File tree Expand file tree Collapse file tree 1 file changed +7
-7
lines changed
docs/tutorials/posttraining Expand file tree Collapse file tree 1 file changed +7
-7
lines changed Original file line number Diff line number Diff line change @@ -117,7 +117,7 @@ Run the following command for GRPO:
117117python3 -m src.MaxText.rl.train_rl src/MaxText/configs/rl.yml \
118118 model_name=${MODEL} \
119119 tokenizer_path=${TOKENIZER} \
120- load_parameters_path=${MAXTEXT_CKPT_PATH} \
120+ load_parameters_path=${MAXTEXT_CKPT_PATH}/0/items \
121121 run_name=${RUN_NAME} \
122122 base_output_directory=${BASE_OUTPUT_DIRECTORY} \
123123 hf_access_token=${HF_TOKEN}
@@ -136,12 +136,12 @@ Run the following command for GSPO:
136136
137137```
138138python3 -m src.MaxText.rl.train_rl src/MaxText/configs/rl.yml \
139- model_name=llama3.1-8b \
140- tokenizer_path=meta-llama/Llama-3.1-8B-Instruct \
141- load_parameters_path=gs://path/to/checkpoint /0/items \
142- run_name=$WORKLOAD \
143- base_output_directory=$OUTPUT_PATH \
144- hf_access_token=$HF_TOKEN \
139+ model_name=${MODEL} \
140+ tokenizer_path=${TOKENIZER} \
141+ load_parameters_path=${MAXTEXT_CKPT_PATH} /0/items \
142+ run_name=${RUN_NAME} \
143+ base_output_directory=${BASE_OUTPUT_DIRECTORY} \
144+ hf_access_token=${ HF_TOKEN} \
145145 loss_algo=gspo-token
146146```
147147
You can’t perform that action at this time.
0 commit comments