-
Notifications
You must be signed in to change notification settings - Fork 95
Pull requests: NVIDIA-NeMo/Automodel
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: optimized TP plan lookup in NeMo-RL by qualname
#1547
opened Mar 15, 2026 by
ZhiyuLi-Nvidia
Loading…
ci: Updating testing path to /opt/Automodel, update codecov settings
#1544
opened Mar 13, 2026 by
thomasdhc
Loading…
3 tasks
refactor: rename encoder -> retrieval naming convention
#1536
opened Mar 12, 2026 by
oliverholworthy
Loading…
3 tasks done
fix: NaN loss in NemotronH from-scratch pretraining with FSDP2
community-request
#1527
opened Mar 11, 2026 by
chloechiaw
Loading…
fix: skip initialize_weights for all NemotronH variants (including MoE)
#1526
opened Mar 11, 2026 by
terrykong
Loading…
3 tasks
feat: add pipeline parallelism support for knowledge distillation
#1500
opened Mar 9, 2026 by
Separius
Loading…
fix: skip model.to(device) after checkpoint loading (tied params + FSDP)
#1489
opened Mar 8, 2026 by
terrykong
Loading…
1 of 2 tasks
feat: MFU logging in train recipes
community-request
#1413
opened Feb 28, 2026 by
SwekeR-463
Loading…
1 of 3 tasks
docs: add retriever docs
docs-only
With great power comes great responsibility.
#1407
opened Feb 27, 2026 by
akoumpa
Loading…
3 tasks
fix: cherry-pick combined projection fixes (#1324, #1357) into r0.2.1
#1388
opened Feb 25, 2026 by
HuiyingLi
Loading…
2 tasks
Previous Next
ProTip!
Updated in the last three days: updated:>2026-03-12.