Skip to content

Commit b7f9358

Browse files
committed
Add TEST ONLY header to deepseek toy model config
Signed-off-by: Yuhe Zhang <yuhe@polarr.co>
1 parent f9b9e23 commit b7f9358

File tree

1 file changed

+7
-0
lines changed

1 file changed

+7
-0
lines changed

examples/llm_finetune/deepseek/deepseek_moe_lora_small_for_test.yaml

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,10 @@
1+
# ------------------------------------------------------------------------------------------------
2+
# TEST ONLY: Small DeepSeek-style MoE for E2E Testing
3+
# ------------------------------------------------------------------------------------------------
4+
# This is NOT a real model config. It creates a randomly initialized small MoE
5+
# for verifying the training pipeline works end-to-end without downloading real weights.
6+
# DO NOT use this for actual training.
7+
# ------------------------------------------------------------------------------------------------
18

29
step_scheduler:
310
global_batch_size: 32

0 commit comments

Comments
 (0)