[On Policy Distillation] Fix on-policy distillation example readme issues #1353

zfan2356 · 2026-01-07T08:01:46Z

Hi ~, Currently the instructions for the pre-training setup in the on-policy distillation example README are somewhat confusing. I’ve refined and clarified them so that the preparation commands can now be run sequentially without confusion. and align with the run-qwen3-8B-opd.sh script.

fix opd readme issues

5d29cbb

zfan2356 closed this Jan 8, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[On Policy Distillation] Fix on-policy distillation example readme issues #1353

[On Policy Distillation] Fix on-policy distillation example readme issues #1353

Uh oh!

zfan2356 commented Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

[On Policy Distillation] Fix on-policy distillation example readme issues #1353

[On Policy Distillation] Fix on-policy distillation example readme issues #1353

Uh oh!

Conversation

zfan2356 commented Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant