Skip to content

Commit 43531d6

Browse files
authored
Bumping version to 0.3.2 (#358)
1 parent d9799d9 commit 43531d6

File tree

10 files changed

+25
-15
lines changed

10 files changed

+25
-15
lines changed

README.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -82,6 +82,7 @@ Trinity-RFT is a flexible, general-purpose framework for reinforcement fine-tuni
8282

8383
## 🚀 News
8484

85+
* [2025-10] [[Release Notes](https://github.com/modelscope/Trinity-RFT/releases/tag/v0.3.2)] Trinity-RFT v0.3.2 released: bug fixes and advanced task selection & scheduling.
8586
* [2025-10] [[Release Notes](https://github.com/modelscope/Trinity-RFT/releases/tag/v0.3.1)] Trinity-RFT v0.3.1 released: multi-stage training support, improved agentic RL examples, LoRA support, debug mode and new RL algorithms.
8687
* [2025-09] [[Release Notes](https://github.com/modelscope/Trinity-RFT/releases/tag/v0.3.0)] Trinity-RFT v0.3.0 released: enhanced Buffer, FSDP2 & Megatron support, multi-modal models, and new RL algorithms/examples.
8788
* [2025-08] Introducing [CHORD](https://github.com/modelscope/Trinity-RFT/tree/main/examples/mix_chord): dynamic SFT + RL integration for advanced LLM fine-tuning ([paper](https://arxiv.org/pdf/2508.11408)).
@@ -177,14 +178,14 @@ uv sync --extra dev --extra flash_attn
177178
If you just want to use the package without modifying the code:
178179

179180
```bash
180-
pip install trinity-rft==0.3.1
181+
pip install trinity-rft
181182
pip install flash-attn==2.8.1
182183
```
183184

184185
Or with `uv`:
185186

186187
```bash
187-
uv pip install trinity-rft==0.3.1
188+
uv pip install trinity-rft
188189
uv pip install flash-attn==2.8.1
189190
```
190191

README_zh.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -83,6 +83,7 @@ Trinity-RFT 是一个灵活、通用的大语言模型(LLM)强化微调(RF
8383

8484
## 🚀 新闻
8585

86+
* [2025-10] [[发布说明](https://github.com/modelscope/Trinity-RFT/releases/tag/v0.3.2)] Trinity-RFT v0.3.2 发布:修复若干 Bug 并支持进阶的任务选择和调度。
8687
* [2025-10] [[发布说明](https://github.com/modelscope/Trinity-RFT/releases/tag/v0.3.1)] Trinity-RFT v0.3.1 发布:多阶段训练支持、改进的智能体 RL 示例、LoRA 支持、调试模式和全新 RL 算法。
8788
* [2025-09] [[发布说明](https://github.com/modelscope/Trinity-RFT/releases/tag/v0.3.0)] Trinity-RFT v0.3.0 发布:增强的 Buffer、FSDP2 & Megatron 支持,多模态模型,以及全新 RL 算法/示例。
8889
* [2025-08] 推出 [CHORD](https://github.com/modelscope/Trinity-RFT/tree/main/examples/mix_chord):动态 SFT + RL 集成,实现进阶 LLM 微调([论文](https://arxiv.org/pdf/2508.11408))。
@@ -176,14 +177,14 @@ uv sync --extra dev --extra flash_attn
176177
如果您只需使用 Trinity-RFT 而不打算修改代码:
177178

178179
```bash
179-
pip install trinity-rft==0.3.1
180+
pip install trinity-rft
180181
pip install flash-attn==2.8.1
181182
```
182183

183184
或使用 `uv`
184185

185186
```bash
186-
uv pip install trinity-rft==0.3.1
187+
uv pip install trinity-rft
187188
uv pip install flash-attn==2.8.1
188189
```
189190

docs/sphinx_doc/source/tutorial/trinity_installation.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -65,14 +65,14 @@ uv sync --extra dev --extra flash_attn
6565
If you just want to use the package without modifying the code:
6666

6767
```bash
68-
pip install trinity-rft==0.3.1
68+
pip install trinity-rft
6969
pip install flash-attn==2.8.1
7070
```
7171

7272
Or with `uv`:
7373

7474
```bash
75-
uv pip install trinity-rft==0.3.1
75+
uv pip install trinity-rft
7676
uv pip install flash-attn==2.8.1
7777
```
7878

docs/sphinx_doc/source_zh/tutorial/trinity_installation.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -65,14 +65,14 @@ uv sync --extra dev --extra flash_attn
6565
如果您只需使用 Trinity-RFT 而不打算修改代码:
6666

6767
```bash
68-
pip install trinity-rft==0.3.1
68+
pip install trinity-rft
6969
pip install flash-attn==2.8.1
7070
```
7171

7272
或使用 `uv`
7373

7474
```bash
75-
uv pip install trinity-rft==0.3.1
75+
uv pip install trinity-rft
7676
uv pip install flash-attn==2.8.1
7777
```
7878

pyproject.toml

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
44

55
[project]
66
name = "trinity-rft"
7-
version = "0.4.0.dev0"
7+
version = "0.3.2"
88
authors = [
99
{name="Trinity-RFT Team", email="[email protected]"},
1010
]
@@ -87,6 +87,10 @@ doc = [
8787
"sphinxcontrib-mermaid",
8888
]
8989

90+
mm = [
91+
"qwen-vl-utils",
92+
]
93+
9094
flash_attn = [
9195
"flash-attn==2.8.1"
9296
]

scripts/docker/Dockerfile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ RUN apt update && apt install -y \
2626
# copy the Trinity-RFT dir into the workspace
2727
COPY . .
2828

29-
RUN pip install --upgrade pip && pip install -e .[dev] && pip install flash_attn==2.8.1 --no-build-isolation
29+
RUN pip install --upgrade pip && pip install -e .[mm,dev] && pip install flash_attn==2.8.1 --no-build-isolation
3030

3131
# Set Env variables
3232

scripts/docker_for_megatron/Dockerfile

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ COPY . .
2727

2828
# Install Trinity-RFT with Megatron
2929
RUN pip install --upgrade pip \
30-
&& pip install -e .[dev] \
30+
&& pip install -e .[mm,dev] \
3131
&& pip install flash_attn==2.8.1 --no-build-isolation \
3232
&& pip install -e .[megatron] \
3333
&& pip install -v --disable-pip-version-check --no-cache-dir --no-build-isolation \

tests/common/vllm_test.py

Lines changed: 5 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -261,11 +261,13 @@ async def test_model_len(self):
261261

262262
response = openai_client.chat.completions.create(model=model_id, messages=messages[1:], n=1)
263263
self.assertEqual(len(response.choices), 1)
264-
print(response.choices[0].message.content)
265264
exps = self.model_wrapper.extract_experience_from_history()
266265
self.assertEqual(len(exps), 1)
267-
# only generate max_model_len - prompt_len tokens
268-
self.assertEqual(len(exps[0].tokens), self.max_model_len)
266+
# only generate max_response_tokens tokens
267+
self.assertEqual(
268+
len(exps[0].tokens),
269+
response.usage.prompt_tokens + self.config.model.max_response_tokens,
270+
)
269271

270272

271273
class TestAPIServer(RayUnittestBaseAysnc):

trinity/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
11
# -*- coding: utf-8 -*-
22
"""Trinity-RFT (Reinforcement Fine-Tuning)"""
33

4-
__version__ = "0.4.0.dev0"
4+
__version__ = "0.3.2"

trinity/common/models/vllm_model.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -51,6 +51,8 @@ def __init__(
5151
os.environ["VLLM_RAY_PER_WORKER_GPUS"] = str(int(config.use_v1))
5252
os.environ["VLLM_WORKER_MULTIPROC_METHOD"] = "spawn"
5353
os.environ["VLLM_ENABLE_V1_MULTIPROCESSING"] = "0"
54+
if get_vllm_version() >= parse_version("0.11.0"):
55+
os.environ["VLLM_ALLREDUCE_USE_SYMM_MEM"] = "0"
5456
self.default_sampling_params = vllm.SamplingParams(
5557
n=1,
5658
temperature=0.0,

0 commit comments

Comments
 (0)