Commit 2ea56e9
authored
fix(async_rl): add docs and fix typos for embodiment async rl (RLinf#790)
* docs: add docs for async ppo
* feat: support multi rollout epoch for async ppo
* chore: fix typos in async ppo's yamls
* fix: let async env worker also support multi rollout epoch
* fix: fix correct critic_warmup_steps
* feat: add resume training for async ppo
* fix: convert version to tensor in case perf regression
* chore: move version update logic to set global step
* fix: fix condition when sac does not set global step
---------
Signed-off-by: Bo Dai <daibo@infini-ai.com>1 parent 015ba0f commit 2ea56e9
File tree
20 files changed
+940
-78
lines changed- docs
- source-en/rst_source/tutorials/rlalg
- source-zh/rst_source/tutorials/rlalg
- examples/embodiment/config
- rlinf
- runners
- workers
- actor
- env
- rollout/hf
- tests/e2e_tests/embodied
20 files changed
+940
-78
lines changedLarge diffs are not rendered by default.
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | | - | |
| 8 | + | |
9 | 9 | | |
10 | 10 | | |
11 | 11 | | |
| |||
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
| 17 | + | |
17 | 18 | | |
18 | 19 | | |
19 | 20 | | |
| |||
28 | 29 | | |
29 | 30 | | |
30 | 31 | | |
| 32 | + | |
0 commit comments