Skip to content

Commit 3dc083d

Browse files
committed
feat: optimized r1 pipeline for efficient code RL
1 parent 154512d commit 3dc083d

File tree

17 files changed

+1026
-299
lines changed

17 files changed

+1026
-299
lines changed

.gitignore

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,6 @@
11
# code r1
22
data/
3+
models/
34

45
**/*.pt
56
**/checkpoints

README-code-r1.md

Lines changed: 10 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,16 @@
1-
Use the setup script below to avoid environment issues:
1+
# Code R1
2+
3+
## Setup
24

35
```bash
6+
# For sandboxing
7+
sudo add-apt-repository ppa:deki/firejail
8+
sudo apt-get update
9+
sudo apt-get install firejail firejail-profiles
10+
11+
# For training
412
pip install vllm==0.6.3 torch==2.4.0 ray
5-
pip3 install flash-attn --no-build-isolation
13+
pip install flash-attn --no-build-isolation
614
pip install -e . # For verl integration
715
pip install wandb IPython matplotlib gpustat
816
```

0 commit comments

Comments
 (0)