Asim A.Osman Asimawad

Hi, I'm Asim

AI Research Engineer based in Cape Town, South Africa 🇿🇦

I specialize in Multi-Agent Reinforcement Learning and LLM Agents & Engineering

Currently at InstaDeep working on MARL research, I'm currently focused on combining Contrastive Goal Conditioned Reinforcement Learnining and Unsupervised Environment Design (UED) in Multi Agent settings.

🎓 MSc in AI from University of Cape Town & AIMS South Africa (Google DeepMind Scholar)

🔬 What I Work On

Multi-Agent RL — Contrastive learning, goal-conditioned RL, and curriculum strategies in JAX
LLM Agents — Autonomous agents for ML engineering, scientific discovery, and code generation
Inference-Time Scaling — Making open-source LLMs competitive with proprietary models
LLM Engineering — Fine-tuning, RLHF (PPO/GRPO/DPO), vLLM serving, distributed training

Skills

I'm good with Python JAX/Flax PyTorch vLLM HuggingFace TRL Unsloth LangGraph LangSmith LiteLLM Hydra Docker TPU/GPU

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Asim A.Osman Asimawad

Organizations

Block or report Asimawad

Hi, I'm Asim

🔬 What I Work On

Skills

Pinned Loading

Uh oh!