System Info
accelerate 1.10.1
os: Linux #25-Ubuntu SMP Wed Mar 30 15:54:22 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux
python: 3.10
numpy: 1.26.0
torch version: 2.5.1+cu118
Information
Tasks
Reproduction
accelerate fsdp model
if accelerator.sync_gradients and train_config.max_grad_norm > 0:
all_norm = accelerator.clip_grad_norm_(model.parameters(), train_config.max_grad_norm)
Expected behavior
No RuntimeError
System Info
accelerate 1.10.1 os: Linux #25-Ubuntu SMP Wed Mar 30 15:54:22 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux python: 3.10 numpy: 1.26.0 torch version: 2.5.1+cu118Information
Tasks
no_trainerscript in theexamplesfolder of thetransformersrepo (such asrun_no_trainer_glue.py)Reproduction
accelerate fsdp model
if accelerator.sync_gradients and train_config.max_grad_norm > 0:
all_norm = accelerator.clip_grad_norm_(model.parameters(), train_config.max_grad_norm)
Expected behavior
No RuntimeError