Skip to content

Commit c1eccd2

Browse files
committed
fix faq
1 parent dae30ac commit c1eccd2

File tree

2 files changed

+4
-4
lines changed
  • docs/sphinx_doc

2 files changed

+4
-4
lines changed

docs/sphinx_doc/source/tutorial/faq.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -94,7 +94,7 @@ ray start --head
9494

9595
**A:** The following parameters may be helpful:
9696

97-
- For trainer, adjust `actor_rollout_ref.actor.ppo_micro_batch_size_per_gpu` when `actor_rollout_ref.actor.use_dynamic_bsz=false`; adjust `actor_rollout_ref.actor.ppo_max_token_len_per_gpu` and `actor_rollout_ref.actor.ulysses_sequence_parallel_size` when `actor_rollout_ref.actor.use_dynamic_bsz=true`. Setting `actor_rollout_ref.actor.entropy_from_logits_with_chunking=true` may also help.
97+
- For trainer, adjust `trainer.max_token_len_per_gpu` when `trainer.use_dynamic_bsz=false`; adjust `trainer.ppo_max_token_len_per_gpu` and `trainer.ulysses_sequence_parallel_size` when `trainer.use_dynamic_bsz=true`. Setting `trainer.trainer_config.actor_rollout_ref.actor.entropy_from_logits_with_chunking=true` may also help.
9898
- For explorer, adjust `explorer.rollout_model.tensor_parallel_size`.
9999

100100

@@ -113,7 +113,7 @@ To debug a new workflow, use Trinity-RFT's debug mode with the following steps:
113113

114114
1. Launch the inference model via `trinity debug --config <config_file_path> --module inference_model`
115115

116-
2. Debug the workflow in another terminal via `trinity debug --config <config_file_path> --module workflow --output_file <output_file_path> --plugin_dir <plugin_dir>`
116+
2. Debug the workflow in another terminal via `trinity debug --config <config_file_path> --module workflow --output-file <output_file_path> --plugin-dir <plugin_dir>`
117117

118118
Please refer to {ref}`Workflow Development Guide <Workflows>` section for details.
119119

docs/sphinx_doc/source_zh/tutorial/faq.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -93,7 +93,7 @@ ray start --head
9393

9494
**A:** 以下参数可能有所帮助:
9595

96-
- 对于 trainer:当 `actor_rollout_ref.actor.use_dynamic_bsz=false` 时,调整 `actor_rollout_ref.actor.ppo_micro_batch_size_per_gpu`;当 `actor_rollout_ref.actor.use_dynamic_bsz=true` 时,调整 `actor_rollout_ref.actor.ppo_max_token_len_per_gpu``actor_rollout_ref.actor.ulysses_sequence_parallel_size`。设置 `actor_rollout_ref.actor.entropy_from_logits_with_chunking=true` 也可能有帮助。
96+
- 对于 trainer:当 `trainer.use_dynamic_bsz=false` 时,调整 `trainer.max_token_len_per_gpu`;当 `trainer.use_dynamic_bsz=true` 时,调整 `trainer.ppo_max_token_len_per_gpu``trainer.ulysses_sequence_parallel_size`。设置 `trainer.trainer_config.actor_rollout_ref.actor.entropy_from_logits_with_chunking=true` 也可能有帮助。
9797
- 对于 explorer:调整 `explorer.rollout_model.tensor_parallel_size`
9898

9999
## 第三部分:调试方法
@@ -113,7 +113,7 @@ trinity run --config grpo_gsm8k/gsm8k.yaml 2>&1 | tee debug.log
113113

114114
1. 启动推理模型: `trinity debug --config <config_file_path> --module inference_model`
115115

116-
2. 在另一个终端中进行工作流的调试:`trinity debug --config <config_file_path> --module workflow --output_file <output_file_path> --plugin_dir <plugin_dir>`
116+
2. 在另一个终端中进行工作流的调试:`trinity debug --config <config_file_path> --module workflow --output-file <output_file_path> --plugin-dir <plugin_dir>`
117117

118118
更多详细信息,请参阅{ref}`工作流开发指南 <Workflows>`章节。
119119

0 commit comments

Comments
 (0)