Skip to content

Commit 6e24e68

Browse files
committed
e2e example tested, cleanup needed, readme to be updated
Signed-off-by: Suguna Velury <[email protected]>
1 parent 9021845 commit 6e24e68

File tree

4 files changed

+491
-33
lines changed

4 files changed

+491
-33
lines changed

examples/llm_ptq/fsdp2.yaml

Lines changed: 26 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,26 @@
1+
compute_environment: LOCAL_MACHINE
2+
debug: false
3+
distributed_type: FSDP
4+
downcast_bf16: 'no'
5+
enable_cpu_affinity: false
6+
fsdp_config:
7+
fsdp_activation_checkpointing: true
8+
fsdp_auto_wrap_policy: TRANSFORMER_BASED_WRAP
9+
fsdp_cpu_ram_efficient_loading: true
10+
fsdp_offload_params: false
11+
fsdp_reshard_after_forward: true
12+
fsdp_state_dict_type: FULL_STATE_DICT
13+
fsdp_transformer_layer_cls_to_wrap: LlamaDecoderLayer
14+
fsdp_use_orig_params: true
15+
fsdp_version: 2
16+
machine_rank: 0
17+
main_training_function: main
18+
mixed_precision: 'no'
19+
num_machines: 2
20+
num_processes: 16
21+
rdzv_backend: c10d
22+
same_network: true
23+
tpu_env: []
24+
tpu_use_cluster: false
25+
tpu_use_sudo: false
26+
use_cpu: false

0 commit comments

Comments
 (0)