Skip to content

Commit 7f660c6

Browse files
committed
update path
1 parent 87e3c70 commit 7f660c6

File tree

1 file changed

+5
-5
lines changed

1 file changed

+5
-5
lines changed

README.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -27,9 +27,9 @@ To support this method, we construct a high-quality dataset, **VerInstruct**, wi
2727

2828
## Data & Trained Models
2929

30-
- [VerInstruct (22k instruction-following examples with verifiable signals)](./data/)
31-
- [R1-Distill-Qwen-7B-VerIF](./models/qwen2-7b-verif/), based on DeepSeek-R1-R1-Distill-Qwen-7B
32-
- [TULU3-VerIF](./models/tulu3-8b-verif/), based on Llama-3.1-Tulu-3-8B-SFT
30+
- [VerInstruct (22k instruction-following examples with verifiable signals)](data)
31+
- [R1-Distill-Qwen-7B-VerIF](model), based on DeepSeek-R1-R1-Distill-Qwen-7B
32+
- [TULU3-VerIF](model), based on Llama-3.1-Tulu-3-8B-SFT
3333

3434
---
3535

@@ -50,12 +50,12 @@ This repo is forked from [verl](https://github.com/volcengine/verl). We sincerel
5050
Please refer to the original [verl documentation](https://github.com/volcengine/verl) for environment setup.
5151

5252
### Step 1: Preprocess Data
53-
Download data from [here](). Use `./examples/data_preprocess/if_prompts.py` to preprocess VerInstruct.
53+
Download data from [here](#data). Use `./examples/data_preprocess/if_prompts.py` to preprocess VerInstruct.
5454
> Make sure to add the import path for `./verl/utils/reward_score/local_server` at the top of each function.
5555
5656
### Step 2: Setup the Verifier Model
5757
For **soft constraint verification**, use an LLM-based verifier. You may:
58-
- Use our own trained [verifier]() based on R1-Distilled-Qwen-7B
58+
- Use our own trained [verifier](#verifier) based on R1-Distilled-Qwen-7B
5959
- Use **QwQ-32B** as the verifier
6060

6161
We suggest using **SGLang** or **vLLM** for deployment.

0 commit comments

Comments
 (0)