Skip to content

Commit ea0d0a6

Browse files
committed
fix flags
1 parent c6f438d commit ea0d0a6

File tree

1 file changed

+7
-7
lines changed
  • docs/tutorials/posttraining

1 file changed

+7
-7
lines changed

docs/tutorials/posttraining/rl.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -117,7 +117,7 @@ Run the following command for GRPO:
117117
python3 -m src.MaxText.rl.train_rl src/MaxText/configs/rl.yml \
118118
model_name=${MODEL} \
119119
tokenizer_path=${TOKENIZER} \
120-
load_parameters_path=${MAXTEXT_CKPT_PATH} \
120+
load_parameters_path=${MAXTEXT_CKPT_PATH}/0/items \
121121
run_name=${RUN_NAME} \
122122
base_output_directory=${BASE_OUTPUT_DIRECTORY} \
123123
hf_access_token=${HF_TOKEN}
@@ -136,12 +136,12 @@ Run the following command for GSPO:
136136

137137
```
138138
python3 -m src.MaxText.rl.train_rl src/MaxText/configs/rl.yml \
139-
model_name=llama3.1-8b \
140-
tokenizer_path=meta-llama/Llama-3.1-8B-Instruct \
141-
load_parameters_path=gs://path/to/checkpoint/0/items \
142-
run_name=$WORKLOAD \
143-
base_output_directory=$OUTPUT_PATH \
144-
hf_access_token=$HF_TOKEN \
139+
model_name=${MODEL} \
140+
tokenizer_path=${TOKENIZER} \
141+
load_parameters_path=${MAXTEXT_CKPT_PATH}/0/items \
142+
run_name=${RUN_NAME} \
143+
base_output_directory=${BASE_OUTPUT_DIRECTORY} \
144+
hf_access_token=${HF_TOKEN} \
145145
loss_algo=gspo-token
146146
```
147147

0 commit comments

Comments
 (0)