AI Research Engineer based in Cape Town, South Africa πΏπ¦
I specialize in Multi-Agent Reinforcement Learning and LLM Agents & Engineering
Currently at InstaDeep working on MARL research, I'm currently focused on combining Contrastive Goal Conditioned Reinforcement Learnining and Unsupervised Environment Design (UED) in Multi Agent settings.
π MSc in AI from University of Cape Town & AIMS South Africa (Google DeepMind Scholar)
- Multi-Agent RL β Contrastive learning, goal-conditioned RL, and curriculum strategies in JAX
- LLM Agents β Autonomous agents for ML engineering, scientific discovery, and code generation
- Inference-Time Scaling β Making open-source LLMs competitive with proprietary models
- LLM Engineering β Fine-tuning, RLHF (PPO/GRPO/DPO), vLLM serving, distributed training
I'm good with Python JAX/Flax PyTorch vLLM HuggingFace TRL Unsloth LangGraph LangSmith LiteLLM Hydra Docker TPU/GPU