Skip to content
View Asimawad's full-sized avatar
πŸ˜ƒ
..
πŸ˜ƒ
..

Organizations

@instadeepai

Block or report Asimawad

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Asimawad/README.md

Hi, I'm Asim

AI Research Engineer based in Cape Town, South Africa πŸ‡ΏπŸ‡¦

I specialize in Multi-Agent Reinforcement Learning and LLM Agents & Engineering

Currently at InstaDeep working on MARL research, I'm currently focused on combining Contrastive Goal Conditioned Reinforcement Learnining and Unsupervised Environment Design (UED) in Multi Agent settings.

πŸŽ“ MSc in AI from University of Cape Town & AIMS South Africa (Google DeepMind Scholar)


πŸ”¬ What I Work On

  • Multi-Agent RL β€” Contrastive learning, goal-conditioned RL, and curriculum strategies in JAX
  • LLM Agents β€” Autonomous agents for ML engineering, scientific discovery, and code generation
  • Inference-Time Scaling β€” Making open-source LLMs competitive with proprietary models
  • LLM Engineering β€” Fine-tuning, RLHF (PPO/GRPO/DPO), vLLM serving, distributed training

Skills

I'm good with Python JAX/Flax PyTorch vLLM HuggingFace TRL Unsloth LangGraph LangSmith LiteLLM Hydra Docker TPU/GPU

Website LinkedIn Email

Pinned Loading

  1. ITS-bench ITS-bench Public

    Forked from openai/mle-bench

    Bench-marking Inference time scaling strategies on MLE-bench for measuring how well AI agents perform at machine learning engineering

    Python

  2. Arabic-to-Swahili-Machine-Translation Arabic-to-Swahili-Machine-Translation Public

    Graduation Project

    Jupyter Notebook

  3. Bayesian-Deep-Active-Learning Bayesian-Deep-Active-Learning Public

    Active Learning experiments using Bayesian neural networks (BNNs)

    Jupyter Notebook

  4. tunix-jax-llms tunix-jax-llms Public

    Python

  5. aide-agent aide-agent Public

    automatic tree search llm based agent

    Python