We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
2 parents fc404e9 + 89b2957 commit 59bdf7eCopy full SHA for 59bdf7e
articles/ai-services/openai/how-to/reinforcement-fine-tuning.md
@@ -176,11 +176,11 @@ Models which we're supporting as grader models are:
176
"model": string,
177
"pass_threshold": number,
178
"range": number[],
179
- "sampling_parameters": {
+ "sampling_params": {
180
"seed": number,
181
"top_p": number,
182
"temperature": number,
183
- "max_completion_tokens": number,
+ "max_completions_tokens": number,
184
"reasoning_effort": "low" | "medium" | "high"
185
}
186
0 commit comments