-
Notifications
You must be signed in to change notification settings - Fork 3.8k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix checkpoint loading with
load_main_params_from_ckpt=True for grouped weight
#4324
opened Apr 15, 2026 by
ksivaman
Member
Loading…
5 tasks
remove legacy GPT code
complexity: high
Expert Review
[deprecated] Apply this label to indicate that your PR is ready for expert review.
SafeUnpickler class for safe pickle usage
complexity: low
Expert Review
[deprecated] Apply this label to indicate that your PR is ready for expert review.
Final Review
PR is in the "final review" stage
Run functional tests
Add TEFusedDenseMLP for Dense+Grouped GEMM fusion on SM100+
complexity: medium
#4318
opened Apr 15, 2026 by
sraman-rgb
Loading…
5 tasks
Eliminate QPs during checkpoint loading, no need for review or merge
#4317
opened Apr 15, 2026 by
CarlosGomes98
Contributor
•
Draft
fix: correct 'Seperate'/'Seperated' typo in comments
community-request
#4313
opened Apr 15, 2026 by
MukundaKatta
•
Draft
feat: Optimize memory footprint of long-context training via fused kernel and chunking
community-request
#4312
opened Apr 15, 2026 by
terminator123
•
Draft
Fix fused grouped MLP wgrad hooks for DDP reduce-scatter
complexity: low
#4311
opened Apr 15, 2026 by
gdengk
Contributor
Loading…
5 tasks
Add quantization type debug logging.
#4308
opened Apr 14, 2026 by
kwyss-nvidia
Contributor
•
Draft
5 tasks
Move inference context bookkeeping to CPU with ContextGPUView
#4306
opened Apr 14, 2026 by
lmcafee-nvidia
Contributor
•
Draft
8 tasks
checkpoint integrity verification
complexity: medium
Expert Review
[deprecated] Apply this label to indicate that your PR is ready for expert review.
Run functional tests
ci: add sync-skills workflow, rename CLAUDE.md → AGENTS.md, move .claude/skills → skills/
Approved
All necessary approvals have been made
complexity: low
Fix aux loss computation with per-token loss and dynamic-cp
#4302
opened Apr 14, 2026 by
xiaoyao0115
Contributor
•
Draft
5 tasks
Inference | Per-block MoE routing storage for prefix caching
complexity: medium
Final Review
PR is in the "final review" stage
#4301
opened Apr 14, 2026 by
lmcafee-nvidia
Contributor
Loading…
2 of 3 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-03-15.