Skip to content

Commit 30ca213

Browse files
authored
Release Trinity-RFT 0.3.2 (#369)
1 parent 3640ae2 commit 30ca213

File tree

4 files changed

+17
-5
lines changed

4 files changed

+17
-5
lines changed

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -83,7 +83,7 @@ Trinity-RFT is a flexible, general-purpose framework for reinforcement fine-tuni
8383
## 🚀 News
8484

8585
* [2025-11] Introducing [BOTS](https://github.com/modelscope/Trinity-RFT/tree/main/examples/bots): online RL task selection for efficient LLM fine-tuning ([paper](https://arxiv.org/pdf/2510.26374)).
86-
* [2025-10] [[Release Notes](https://github.com/modelscope/Trinity-RFT/releases/tag/v0.3.2)] Trinity-RFT v0.3.2 released: bug fixes and advanced task selection & scheduling.
86+
* [2025-11] [[Release Notes](https://github.com/modelscope/Trinity-RFT/releases/tag/v0.3.2)] Trinity-RFT v0.3.2 released: bug fixes and advanced task selection & scheduling.
8787
* [2025-10] [[Release Notes](https://github.com/modelscope/Trinity-RFT/releases/tag/v0.3.1)] Trinity-RFT v0.3.1 released: multi-stage training support, improved agentic RL examples, LoRA support, debug mode and new RL algorithms.
8888
* [2025-09] [[Release Notes](https://github.com/modelscope/Trinity-RFT/releases/tag/v0.3.0)] Trinity-RFT v0.3.0 released: enhanced Buffer, FSDP2 & Megatron support, multi-modal models, and new RL algorithms/examples.
8989
* [2025-08] Introducing [CHORD](https://github.com/modelscope/Trinity-RFT/tree/main/examples/mix_chord): dynamic SFT + RL integration for advanced LLM fine-tuning ([paper](https://arxiv.org/pdf/2508.11408)).

README_zh.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -83,8 +83,8 @@ Trinity-RFT 是一个灵活、通用的大语言模型(LLM)强化微调(RF
8383

8484
## 🚀 新闻
8585

86-
* [2025-11] 推出 [BOTS](https://github.com/modelscope/Trinity-RFT/tree/main/examples/bots)在线RL任务选择,实现高效LLM微调[论文](https://arxiv.org/pdf/2510.26374))。
87-
* [2025-10] [[发布说明](https://github.com/modelscope/Trinity-RFT/releases/tag/v0.3.2)] Trinity-RFT v0.3.2 发布:修复若干 Bug 并支持进阶的任务选择和调度。
86+
* [2025-11] 推出 [BOTS](https://github.com/modelscope/Trinity-RFT/tree/main/examples/bots)在线 RL 任务选择,实现高效 LLM 微调[论文](https://arxiv.org/pdf/2510.26374))。
87+
* [2025-11] [[发布说明](https://github.com/modelscope/Trinity-RFT/releases/tag/v0.3.2)] Trinity-RFT v0.3.2 发布:修复若干 Bug 并支持进阶的任务选择和调度。
8888
* [2025-10] [[发布说明](https://github.com/modelscope/Trinity-RFT/releases/tag/v0.3.1)] Trinity-RFT v0.3.1 发布:多阶段训练支持、改进的智能体 RL 示例、LoRA 支持、调试模式和全新 RL 算法。
8989
* [2025-09] [[发布说明](https://github.com/modelscope/Trinity-RFT/releases/tag/v0.3.0)] Trinity-RFT v0.3.0 发布:增强的 Buffer、FSDP2 & Megatron 支持,多模态模型,以及全新 RL 算法/示例。
9090
* [2025-08] 推出 [CHORD](https://github.com/modelscope/Trinity-RFT/tree/main/examples/mix_chord):动态 SFT + RL 集成,实现进阶 LLM 微调([论文](https://arxiv.org/pdf/2508.11408))。

examples/bots/workflow/bots_math_boxed_workflow.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -19,12 +19,12 @@ def format_messages(self):
1919
return self.task_desc
2020

2121
@property
22-
def task_desc(self) -> Union[str, None]:
22+
def task_desc(self) -> Union[str, None]: # type: ignore [override]
2323
prompt_key = self.format_args.prompt_key
2424
return nested_query(prompt_key, self.raw_task) # type: ignore
2525

2626
@property
27-
def truth(self) -> Union[str, None]:
27+
def truth(self) -> Union[str, None]: # type: ignore [override]
2828
response_key = self.format_args.response_key
2929
return nested_query(response_key, self.raw_task)
3030

trinity/common/workflows/workflow.py

Lines changed: 12 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -60,6 +60,18 @@ def to_workflow(
6060
auxiliary_models=auxiliary_models,
6161
)
6262

63+
# Deprecated property, will be removed in the future
64+
@property
65+
def task_desc(self) -> Union[str, None]:
66+
prompt_key = self.format_args.prompt_key
67+
return self.raw_task[prompt_key] if prompt_key in self.raw_task else None # type: ignore
68+
69+
# Deprecated property, will be removed in the future
70+
@property
71+
def truth(self) -> Union[str, None]:
72+
response_key = self.format_args.response_key
73+
return self.raw_task[response_key] if response_key in self.raw_task else None # type: ignore
74+
6375
def to_dict(self) -> dict:
6476
return self.raw_task # type: ignore
6577

0 commit comments

Comments
 (0)