-
Notifications
You must be signed in to change notification settings - Fork 369
Pull requests: THUDM/slime
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Fix][VLM] Fix image_patch_size for vision preprocessing
#1227
opened Dec 26, 2025 by
coding-famer
Loading…
update default paths and disable offloading for AMD qwen3-4B training
#1225
opened Dec 26, 2025 by
Vivicai1005
Loading…
fix: replace blocking sleep with async sleep and fix file handle leak
#1200
opened Dec 24, 2025 by
lancerts
Loading…
[WIP] Implement RDMA P2P weight update using TransferEngine
#1164
opened Dec 20, 2025 by
JD-ETH
Loading…
[FEATURE] Add tool call support for multi-turn SFT with delta-based loss masking
#1159
opened Dec 20, 2025 by
Surya-Gunukula
Loading…
tau-bench: offline stub user + tool parsing fallback
#1158
opened Dec 19, 2025 by
Fengzdadi
Loading…
Add tau2-bench training cookbook and implementation
#1156
opened Dec 19, 2025 by
jbarnes850
Loading…
fix: fix 8B VLM true on policy issue
run-ci-short
#1155
opened Dec 19, 2025 by
nanjiangwill
Loading…
[WIP] Add TerminalBench eval delegate + quickstart
#1154
opened Dec 19, 2025 by
XinyuJiangCMU
•
Draft
6 tasks done
[FSDP][1/n] Support LoRA training for FSDP backend.
#1140
opened Dec 17, 2025 by
GuanxingLu
Loading…
4 tasks
[On Policy Distillation] resolve log prob dimension mismatch in on-policy distillation with CP > 1
#1135
opened Dec 17, 2025 by
Yuchen-Cao
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.