-
Notifications
You must be signed in to change notification settings - Fork 4.6k
Pull requests: deepspeedai/DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add single parameter allgather optimization for zero3
#7661
opened Oct 31, 2025 by
aeeeeeep
Loading…
resolved a 0-dim tensor slicing bug from _get_state_without_padding
#7659
opened Oct 30, 2025 by
therealnaveenkamal
Loading…
allow seperate learning rate "muon_lr" and "adam_lr" for muon optimizer
#7658
opened Oct 30, 2025 by
delock
Loading…
HF2UCP: Converting a
pytorch_model.bin or .safetensors checkpoint to UCP
#7212
opened Apr 10, 2025 by
Schwidola0607
Loading…
[bugfix] update results of state_dict loading, embedding resizing to secondary partitions (hpz)
#7130
opened Mar 11, 2025 by
cyr0930
Loading…
Fix, pipeline model with moe cause error when send grad
#7055
opened Feb 19, 2025 by
wukong1992
Loading…
Add
pyproject.toml with legacy build backend to keep most logic in setup.py
#7033
opened Feb 13, 2025 by
loadams
Loading…
4 of 5 tasks
Enabled high-performance Automatic Tensor Parallelism (auto TP) for the MoE models on multiple GPUs/HPUs
#6964
opened Jan 21, 2025 by
gyou2021
Loading…
Update sharded_moe.py to support top2 gate with Tutel
#6948
opened Jan 14, 2025 by
xenshinu
Loading…
Fix: forbid repeated deepspeed.initialize on training objects
#6874
opened Dec 16, 2024 by
traincheck-team
Loading…
Training ops kernels: Speeding up the Llama-based MoE architectures
#6734
opened Nov 8, 2024 by
RezaYazdaniAminabadi
•
Draft
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.