Skip to content

Commit b0577ed

Browse files
committed
add message for dapo
1 parent 18d219c commit b0577ed

File tree

3 files changed

+3
-2
lines changed

3 files changed

+3
-2
lines changed

examples/dapo_math/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# DAPO on DAPO-MATH-17k dataset [WIP]
22

3-
This example shows the usage of DAPO on the [DAPO-MATH-17k](https://huggingface.co/datasets/open-r1/DAPO-Math-17k-Processed) dataset.
3+
Note this example only shows the usage of GRPO on the [DAPO-MATH-17k](https://huggingface.co/datasets/open-r1/DAPO-Math-17k-Processed) dataset. We plan to implement DAPO algorithm soon.
44

55
The config file is located in [`dapo.yaml`](dapo.yaml).

examples/dapo_math/dapo.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@ project: Trinity-RFT-example
22
name: dapo
33
checkpoint_root_dir: ${oc.env:TRINITY_CHECKPOINT_ROOT_DIR,./checkpoints}
44
model:
5-
model_path: ${oc.env:TRINITY_MODEL_PATH,Qwen/Qwen2.5-1.5B-Instruct}
5+
model_path: ${oc.env:TRINITY_MODEL_PATH} # Suggest using larger model on this dataset
66
max_response_tokens: 20480
77
max_model_len: 21504
88
algorithm:

trinity/common/verl_config.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -81,6 +81,7 @@ class FSDPConfig:
8181
wrap_policy: WrapPolicy = field(default_factory=WrapPolicy)
8282
fsdp_size: int = -1
8383
forward_prefetch: bool = False
84+
model_dtype: Optional[str] = "fp32"
8485

8586

8687
@dataclass

0 commit comments

Comments
 (0)