You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/en/get_started/customization.md
+54-13Lines changed: 54 additions & 13 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -19,14 +19,15 @@ Below is a summary of all available customization interfaces and their purposes.
19
19
|[`--custom-loss-function-path`](#9-custom-loss-function---custom-loss-function-path)| Implement custom training loss computation. |
20
20
|[`--custom-tis-function-path`](#10-custom-tisrs-function---custom-tis-function-path)| Implement custom importance sampling for off-policy correction. |
21
21
|[`--custom-reward-post-process-path`](#11-reward-post-processing---custom-reward-post-process-path)| Custom post-processing of rewards before advantage computation. |
22
-
|[`--custom-rollout-log-function-path`](#12-logging-functions)| Custom logging for training rollouts. |
23
-
|[`--custom-eval-rollout-log-function-path`](#12-logging-functions)| Custom logging for evaluation rollouts. |
24
-
|[`--data-source-path`](#13-data-source---data-source-path)| Override the data source for rollout prompts. |
25
-
|[`--eval-function-path`](#14-evaluation-function---eval-function-path)| Override the rollout function specifically for evaluation. |
26
-
|[`--custom-megatron-init-path`](#15-megatron-hooks)| Custom initialization after Megatron setup. |
27
-
|[`--custom-megatron-before-log-prob-hook-path`](#15-megatron-hooks)| Custom logic before log probability computation. |
28
-
|[`--custom-megatron-before-train-step-hook-path`](#15-megatron-hooks)| Custom logic before each training step. |
29
-
|[`--slime-router-middleware-paths`](#16-slime-router-middleware---slime-router-middleware-paths)| Add custom middleware to slime router. |
22
+
|[`--custom-convert-samples-to-train-data-path`](#12-samples-to-train-data-conversion---custom-convert-samples-to-train-data-path)| Override the conversion of samples to training data format. |
23
+
|[`--custom-rollout-log-function-path`](#13-logging-functions)| Custom logging for training rollouts. |
24
+
|[`--custom-eval-rollout-log-function-path`](#13-logging-functions)| Custom logging for evaluation rollouts. |
25
+
|[`--data-source-path`](#14-data-source---data-source-path)| Override the data source for rollout prompts. |
26
+
|[`--eval-function-path`](#15-evaluation-function---eval-function-path)| Override the rollout function specifically for evaluation. |
27
+
|[`--custom-megatron-init-path`](#16-megatron-hooks)| Custom initialization after Megatron setup. |
28
+
|[`--custom-megatron-before-log-prob-hook-path`](#16-megatron-hooks)| Custom logic before log probability computation. |
29
+
|[`--custom-megatron-before-train-step-hook-path`](#16-megatron-hooks)| Custom logic before each training step. |
30
+
|[`--slime-router-middleware-paths`](#17-slime-router-middleware---slime-router-middleware-paths)| Add custom middleware to slime router. |
0 commit comments