File tree
4 files changed
+36
-15
lines changed- docs
- source_en/Instruction/GRPO/DeveloperGuide
- source/Instruction/GRPO/DeveloperGuide
- examples/train/grpo/plugin
- swift/trainers/rlhf_trainer
4 files changed
+36
-15
lines changedLines changed: 13 additions & 5 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 | 1 |
| |
2 | 2 |
| |
3 |
| - | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
4 | 8 |
| |
5 | 9 |
| |
6 | 10 |
| |
| |||
13 | 17 |
| |
14 | 18 |
| |
15 | 19 |
| |
16 |
| - | |
| 20 | + | |
17 | 21 |
| |
18 |
| - | |
| 22 | + | |
19 | 23 |
| |
20 |
| - | |
| 24 | + | |
21 | 25 |
| |
| 26 | + | |
| 27 | + | |
22 | 28 |
| |
23 | 29 |
| |
24 | 30 |
| |
25 | 31 |
| |
26 | 32 |
| |
27 | 33 |
| |
28 | 34 |
| |
| 35 | + | |
| 36 | + | |
| 37 | + | |
29 | 38 |
| |
30 | 39 |
| |
31 | 40 |
| |
32 |
| - | |
33 | 41 |
| |
34 | 42 |
| |
35 | 43 |
| |
|
Lines changed: 13 additions & 6 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
1 | 1 |
| |
2 | 2 |
| |
3 |
| - | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
4 | 8 |
| |
5 | 9 |
| |
6 | 10 |
| |
| |||
12 | 16 |
| |
13 | 17 |
| |
14 | 18 |
| |
| 19 | + | |
15 | 20 |
| |
16 |
| - | |
17 | 21 |
| |
18 |
| - | |
| 22 | + | |
19 | 23 |
| |
20 |
| - | |
| 24 | + | |
21 | 25 |
| |
| 26 | + | |
| 27 | + | |
22 | 28 |
| |
23 | 29 |
| |
24 | 30 |
| |
25 | 31 |
| |
26 | 32 |
| |
27 | 33 |
| |
28 | 34 |
| |
| 35 | + | |
| 36 | + | |
| 37 | + | |
29 | 38 |
| |
30 | 39 |
| |
31 | 40 |
| |
32 |
| - | |
33 |
| - | |
34 | 41 |
| |
35 | 42 |
| |
36 | 43 |
| |
|
Lines changed: 9 additions & 3 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
462 | 462 |
| |
463 | 463 |
| |
464 | 464 |
| |
465 |
| - | |
| 465 | + | |
| 466 | + | |
| 467 | + | |
466 | 468 |
| |
467 | 469 |
| |
468 | 470 |
| |
| |||
521 | 523 |
| |
522 | 524 |
| |
523 | 525 |
| |
524 |
| - | |
| 526 | + | |
525 | 527 |
| |
526 | 528 |
| |
| 529 | + | |
| 530 | + | |
527 | 531 |
| |
528 | 532 |
| |
529 | 533 |
| |
| |||
639 | 643 |
| |
640 | 644 |
| |
641 | 645 |
| |
642 |
| - | |
| 646 | + | |
| 647 | + | |
| 648 | + | |
643 | 649 |
| |
644 | 650 |
| |
645 | 651 |
| |
|
Lines changed: 1 addition & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
907 | 907 |
| |
908 | 908 |
| |
909 | 909 |
| |
910 |
| - | |
| 910 | + | |
911 | 911 |
| |
912 | 912 |
| |
913 | 913 |
| |
|
0 commit comments