-
Notifications
You must be signed in to change notification settings - Fork 227
Pull requests: bigscience-workshop/Megatron-DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: correct model_type comparison bug, f-string typo, and bare except clauses
#410
opened Apr 24, 2026 by
Ricardo-M-L
Loading…
3 tasks done
Startup: add argument-consistency checks & summary table (Fixes #124)
#409
opened Jun 20, 2025 by
MagellaX
Loading…
fix(training): correct rank-zero log messages, Print total model size once at startup (rank-0) – Fixes #123
#408
opened Jun 20, 2025 by
MagellaX
Loading…
Bump black from 21.4b0 to 24.3.0
dependencies
Pull requests that update a dependency file
#402
opened Mar 20, 2024 by
dependabot
Bot
Loading…
[checkpoints] replace bf16 with fp32 checkpoint weights
#327
opened Aug 10, 2022 by
stas00
Contributor
Loading…
a branch combining layer-norm-auto-sync and ds_ckpt_reshape
#292
opened Jun 29, 2022 by
stas00
Contributor
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.