Skip to content

Pull requests: NVIDIA-NeMo/RL

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: Add Nemotron‑3 Nano 30B A3B GRPO nightly tests (FSDP2, +LoRA) documentation Improvements or additions to documentation
#1866 opened Feb 3, 2026 by RayenTian Loading…
4 tasks
fix: prevent crash in rollout metric calculation when just 1 value CI:L1 Run doctests, unit tests, and functional tests
#1864 opened Feb 2, 2026 by terrykong Loading…
4 tasks
feat: add val_at_end for all algorithms CI:L1 Run doctests, unit tests, and functional tests
#1863 opened Feb 2, 2026 by terrykong Loading…
4 tasks
fix: add log_plot to the logger interface CI:L1 Run doctests, unit tests, and functional tests
#1862 opened Feb 2, 2026 by terrykong Loading…
4 tasks
chore: add assert for tp4 batch variant accuracy issue CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1861 opened Feb 2, 2026 by yuki-97 Loading…
feat: add way of excluding generation backends documentation Improvements or additions to documentation
#1855 opened Jan 31, 2026 by terrykong Loading…
ci: Add secrets detector CI Relating to CI
#1854 opened Jan 30, 2026 by chtruong814 Loading…
4 tasks
Mdp
#1849 opened Jan 29, 2026 by shanmugamr1992 Loading…
4 tasks
Add Muon post-training support documentation Improvements or additions to documentation
#1848 opened Jan 29, 2026 by ashors1 Draft
4 tasks
feat: Added save_optimizer flag to control saving optimizer or not in checkpointing CI:L0 Run doctests and unit tests community-request needs-follow-up Issue needs follow-up
#1843 opened Jan 29, 2026 by odedovadia Loading…
1 task
feat: enforce monotonicity config option
#1840 opened Jan 29, 2026 by cmunley1 Loading…
4 tasks
feat: Mask sequences with high logprob error CI:L1 Run doctests, unit tests, and functional tests
#1838 opened Jan 29, 2026 by yfw Loading…
4 tasks
fix: allow multi epoch training for async grpo CI:L0 Run doctests and unit tests
#1836 opened Jan 28, 2026 by parthchadha Loading…
4 tasks
Mcore dp coordinator implementation initial
#1833 opened Jan 27, 2026 by shanmugamr1992 Loading…
4 tasks
ci: introduce renovate to deal with bumping our dependencies CI Relating to CI
#1823 opened Jan 23, 2026 by terrykong Draft
4 tasks
perf: Update cudnn to 9.14 CI:L2 Run doctests, unit tests, functional tests, and convergence tests deepseek Related to deepseek 671b
#1820 opened Jan 23, 2026 by guyueh1 Loading…
4 tasks
feat: Implement ProRLv2 recipe CI:L1 Run doctests, unit tests, and functional tests documentation Improvements or additions to documentation
#1809 opened Jan 22, 2026 by hijkzzz Loading…
feat: unify nemogym dataset CI:L1 Run doctests, unit tests, and functional tests
#1807 opened Jan 22, 2026 by yuki-97 Draft
chore: cuda13 support CI:L2 Run doctests, unit tests, functional tests, and convergence tests
#1803 opened Jan 21, 2026 by guyueh1 Loading…
4 tasks
Bxyu/gym dynamic sampling
#1793 opened Jan 18, 2026 by bxyu-nvidia Draft
4 tasks
Feat: Megatron LoRA GRPO sync colocated [1/3]
#1790 opened Jan 17, 2026 by vadam5 Draft
4 tasks
ProTip! Adding no:label will show everything without a label.