Skip to content

Commit f25426c

Browse files
committed
b300 dsv3 bf16 hang fix
Signed-off-by: Malay Nagda <malayn@nvidia.com>
1 parent c3836cd commit f25426c

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

scripts/performance/configs/deepseek/deepseek_workload_base_configs.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -149,6 +149,7 @@
149149
DEEPSEEK_V3_PRETRAIN_CONFIG_B300_V2 = replace(
150150
DEEPSEEK_V3_PRETRAIN_CONFIG_B300_V1,
151151
global_batch_size=4096,
152+
recompute_modules=["mla_up_proj", "moe_act", "layernorm", "mlp", "moe"],
152153
)
153154
DEEPSEEK_V3_PRETRAIN_CONFIG_B300_BF16_V2 = replace(
154155
DEEPSEEK_V3_PRETRAIN_CONFIG_B300_V2,

0 commit comments

Comments
 (0)