Refactor MathWorkflows #60

garyzhang99 · 2025-05-28T08:18:30Z

Description

As the title says.
This PR also enable training using base model, as the added BaseModelWorkflow shows.

Checklist

Please check the following items before code is ready to be reviewed.

Code has passed all tests
Docstrings have been added/updated in Google Style
Documentation has been updated
Code is ready for review

pan-x-c · 2025-05-28T09:32:49Z

/run-unittest

github-actions · 2025-05-28T09:47:32Z

Summary

Tests 📝	Passed ✅	Failed ❌	Skipped ⏭️	Pending ⏳	Other ❓	Flaky 🍂	Duration ⏱️
25	25	0	0	0	0	0	847ms

Failed Tests

No failed tests ✨

Flaky Tests

No flaky tests ✨

Skipped

No skipped tests ✨

Tests

Test Name	Status	Duration
tests/buffer/queue_test.py::TestQueueBuffer::test_queue_buffer	✅	4ms
tests/buffer/sql_test.py::TestSQLBuffer::test_create_sql_buffer	✅	1ms
tests/common/config_test.py::TestConfig::test_all_examples_are_valid	✅	1ms
tests/common/config_test.py::TestConfig::test_load_default_config	✅	5ms
tests/common/experience_test.py::TestExperienceConversion::test_batch_conversion	✅	1ms
tests/common/experience_test.py::TestExperienceConversion::test_experience_model_experience_conversion	✅	1ms
tests/common/vllm_test.py::TestModelWrapperSyncV0::test_generate	✅	44ms
tests/common/vllm_test.py::TestModelWrapperAsyncV0::test_generate	✅	43ms
tests/common/vllm_test.py::TestModelWrapperAsyncTPV0::test_generate	✅	52ms
tests/common/vllm_test.py::TestModelWrapperAsyncTPV1::test_generate	✅	53ms
tests/common/vllm_test.py::TestModelWrapperAsyncV1::test_generate	✅	40ms
tests/common/vllm_test.py::TestAPIServer::test_api	✅	22ms
tests/common/vllm_test.py::TestTokenizer::test_assistant_token_mask	✅	1ms
tests/explorer/explorer_test.py::BaseExplorerCase::test_explorer	✅	1ms
tests/explorer/explorer_test.py::TestExplorerCountdownEval::test_explorer	✅	123ms
tests/explorer/explorer_test.py::TestExplorerCountdownNoEval::test_explorer	✅	129ms
tests/explorer/runner_pool_test.py::RunnerPoolTest::test_runner_pool	✅	19ms
tests/explorer/runner_pool_test.py::RunnerPoolTest::test_runner_pool_with_auxiliary_models	✅	3ms
tests/explorer/workflow_test.py::WorkflowTest::test_gsm8k_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_complex_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_fraction_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_workflow	✅	1ms
tests/explorer/workflow_test.py::WorkflowTest::test_math_workflow_with_different_system_prompt	✅	1ms
tests/trainer/trainer_test.py::BaseTrainerCase::test_trainer	✅	1ms
tests/trainer/trainer_test.py::TestTrainerCountdown::test_trainer	✅	295ms

Github Test Reporter by CTRF 💚

pan-x-c · 2025-05-28T14:50:32Z

trinity/common/workflows/math_workflows.py

+        )
+
+    def reset(self, task: Task):
+        if task.format_args.system_prompt is None:


The system_prompt here is a bit confusing

It is basically the same as the old MathWorkflow. Any recommendation on how to improve it?

pan-x-c · 2025-05-28T14:53:42Z

trinity/common/workflows/math_workflows.py

+        self,
+        model: ModelWrapper,
+        task: Task,
+        auxiliary_models: Optional[List[openai.OpenAI]] = None,


We should add a workflow_args: Dict field to the __init__ method of Workflow interface

trinity/common/workflows/workflow.py

问昊 and others added 6 commits May 28, 2025 15:51

change math workflows

cf9845b

add new math workflows

a1c4cc1

fix styles

ffd317a

pass all tests

758a086

add workflows to init

f8088b4

pass pre-commit

d0c64a3

garyzhang99 requested review from hiyuchang, pan-x-c and yanxi-chen May 28, 2025 08:18

pan-x-c reviewed May 28, 2025

View reviewed changes

yanxi-chen reviewed May 29, 2025

View reviewed changes

trinity/common/workflows/workflow.py Outdated Show resolved Hide resolved

remove chinese comments and typos

66d5c43

pan-x-c mentioned this pull request Jun 9, 2025

Add workflow_args for fine-grained control #73

Merged

4 tasks

garyzhang99 closed this Jun 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor MathWorkflows #60

Refactor MathWorkflows #60

Uh oh!

garyzhang99 commented May 28, 2025

Uh oh!

pan-x-c commented May 28, 2025

Uh oh!

github-actions bot commented May 28, 2025

Uh oh!

pan-x-c May 28, 2025

Uh oh!

garyzhang99 May 29, 2025

Uh oh!

pan-x-c May 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Refactor MathWorkflows #60

Refactor MathWorkflows #60

Uh oh!

Conversation

garyzhang99 commented May 28, 2025

Description

Checklist

Uh oh!

pan-x-c commented May 28, 2025

Uh oh!

github-actions bot commented May 28, 2025

Summary

Failed Tests

Flaky Tests

Skipped

Tests

Uh oh!

pan-x-c May 28, 2025

Choose a reason for hiding this comment

Uh oh!

garyzhang99 May 29, 2025

Choose a reason for hiding this comment

Uh oh!

pan-x-c May 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants