You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/en/notes/guide/pipelines/ReasoningPipeline.md
+50-18Lines changed: 50 additions & 18 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -24,7 +24,54 @@ The main processes of the pipeline include:
24
24
2.**Answer Generation and Processing**: Processing based on standard answers or model-generated answers for problems, including format filtering, length filtering, and correctness verification.
25
25
3.**Data Deduplication**: Deduplicating generated question-answer data to ensure dataset quality.
Additionally, you can choose to run any other Pipeline code according to your needs, and the execution method is similar. Next, we will introduce the operators used in the Pipeline and how to configure parameters.
73
+
74
+
## 3. Data Flow and Pipeline Logic
28
75
29
76
### 1. **Input Data**
30
77
@@ -240,21 +287,6 @@ Finally, the output data generated by the pipeline will contain the following co
240
287
***primary\_category**: Primary category of the problem
241
288
***secondary\_category**: Secondary category of the problem
242
289
243
-
## 3. Execution Methods
244
-
245
-
The pipeline executes different configurations through simple Python commands to meet different data needs:
246
-
247
-
***Strong reasoning instruction fine-tuning data synthesis**:
0 commit comments