File tree Expand file tree Collapse file tree 1 file changed +14
-1
lines changed Expand file tree Collapse file tree 1 file changed +14
-1
lines changed Original file line number Diff line number Diff line change @@ -36,6 +36,9 @@ deepspeed: deepspeed_configs/zero1.json
36
36
### Usage {#sec-deepspeed-usage}
37
37
38
38
``` {.bash}
39
+ # Fetch deepspeed configs (if not already present)
40
+ axolotl fetch deepspeed_configs
41
+
39
42
# Passing arg via config
40
43
axolotl train config.yml
41
44
@@ -48,10 +51,20 @@ axolotl train config.yml --deepspeed deepspeed_configs/zero1.json
48
51
We provide default configurations for:
49
52
50
53
- ZeRO Stage 1 (` zero1.json ` )
54
+ - ZeRO Stage 1 with torch compile (` zero1_torch_compile.json ` )
51
55
- ZeRO Stage 2 (` zero2.json ` )
52
56
- ZeRO Stage 3 (` zero3.json ` )
57
+ - ZeRO Stage 3 with bf16 (` zero3_bf16.json ` )
58
+ - ZeRO Stage 3 with bf16 and CPU offload params(` zero3_bf16_cpuoffload_params.json ` )
59
+ - ZeRO Stage 3 with bf16 and CPU offload params and optimizer (` zero3_bf16_cpuoffload_all.json ` )
60
+
61
+ ::: {.callout-tip}
62
+
63
+ Choose the configuration that offloads the least amount to memory while still being able to fit on VRAM for best performance.
53
64
54
- Choose based on your memory requirements and performance needs.
65
+ Start from Stage 1 -> Stage 2 -> Stage 3.
66
+
67
+ :::
55
68
56
69
## FSDP {#sec-fsdp}
57
70
You can’t perform that action at this time.
0 commit comments