From 00ab2efde2151d761ab70723ee21d368e4ba89dd Mon Sep 17 00:00:00 2001 From: Aditya Prakash <55011564+AdityaPrakash-26@users.noreply.github.com> Date: Sun, 12 Oct 2025 09:50:23 -0400 Subject: [PATCH] Fix typo in training_args parameter name Its num_generations, not num_generation https://huggingface.co/docs/trl/en/grpo_trainer#trl.GRPOConfig --- chapters/en/chapter12/4.mdx | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/chapters/en/chapter12/4.mdx b/chapters/en/chapter12/4.mdx index 769bb561c..4e440bd61 100644 --- a/chapters/en/chapter12/4.mdx +++ b/chapters/en/chapter12/4.mdx @@ -89,7 +89,7 @@ training_args = GRPOConfig( # Essential parameters output_dir="output", num_train_epochs=3, - num_generation=4, # Number of completions to generate for each prompt + num_generations=4, # Number of completions to generate for each prompt per_device_train_batch_size=4, # We want to get all generations in one device batch # Optional but useful gradient_accumulation_steps=2,