Let LLMs Break Free from Overthinking via Self-Braking Tuning

Haoran Zhao^1,2*, Yuchen Yan^1*, Yongliang Shen^1†, Haolei Xu¹, Wenqi Zhang¹, Kaitao Song³, Jian Shao¹, Weiming Lu¹, Jun Xiao¹ Yueting Zhuang¹

¹Zhejiang University, ²Tianjin University, ³Microsoft Research Asia
Preprint. Under review.
^*Equal Contribution, ^†Corresponding Author

Overview of Self-Braking Tuning: Through a specialized data construction method and training strategy, our self-braking model is able to spontaneously halt overthinking.

📝 About

Self-Braking Tuning is a novel framework that unlocks the potential of large reasoning models to autonomously identify and terminate redundant reasoning, enabling the models to regulate their own reasoning processes without relying on external control mechanisms. During fine-tuning, we use the Megatron-LM framework, with related parameters specified in configs/train.yaml; for evaluation, we employ the vLLM framework as the inference engine, with corresponding parameters located in configs/evaluation.yaml. Here, we provide a complete data construction framework that can be applied to nearly any long-chain tuning dataset, generating corresponding self-braking data accordingly.

🛠️ Preparation Steps Before Starting

In Let LLMs Break Free from Overthinking via Self-Braking Tuning, we performed self-braking tuning based on the OpenR1-Math dataset. In fact, this approach is applicable to any long-chain reasoning dataset, as long as the reasoning segments are wrapped with <think> and </think> tags. It is worth noting that, prior to training, it is recommended to keep the model's max_position_embeddings set to 32,768. In addition, to extend the context length from 4k to 32k, we increase the RoPE frequency to 300k.

Our method requires access to an LLM, and the recommended way to provide this is by setting:

export APIKEY=<your_key>

Tip: To provide a convenient default option, we use the OpenAI API key. However, for large-scale datasets, it is recommended to deploy open-source models locally using vLLM or other frameworks, and to leverage efficient methods such as batch processing for better scalability and cost efficiency.

🚀 Quick Start

1. Install Dependencies

pip install -r requirements.txt

2. Download

python models/model_download.py
python data/datasets/download_benchmarks.py

3. Get the baseline

python data/datasets/download_OpenR1-Math.py

4. Preprocess Data

python data/preprocessing/build_sbt-e.py
python data/preprocessing/build_sbt-d.py

5. Configure and Run Training / Evaluation

Refer to the config Settings in the following file:

train.yaml: Training settings
evalution.yaml: Evaluation settings

📖 Citation

If you find our work helpful, feel free to give us a cite.

@misc{zhao2025letllmsbreakfree,
      title={Let LLMs Break Free from Overthinking via Self-Braking Tuning}, 
      author={Haoran Zhao and Yuchen Yan and Yongliang Shen and Haolei Xu and Wenqi Zhang and Kaitao Song and Jian Shao and Weiming Lu and Jun Xiao and Yueting Zhuang},
      year={2025},
      eprint={2505.14604},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2505.14604}, 
}

📬 Contact Us

If you have any questions, please contact us by email: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
configs		configs
data		data
figures		figures
models		models
prompts		prompts
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Let LLMs Break Free from Overthinking via Self-Braking Tuning

📝 About

🛠️ Preparation Steps Before Starting

🚀 Quick Start

1. Install Dependencies

2. Download

3. Get the baseline

4. Preprocess Data

5. Configure and Run Training / Evaluation

📖 Citation

📬 Contact Us

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

ZJU-REAL/Self-Braking-Tuning

Folders and files

Latest commit

History

Repository files navigation

Let LLMs Break Free from Overthinking via Self-Braking Tuning

📝 About

🛠️ Preparation Steps Before Starting

🚀 Quick Start

1. Install Dependencies

2. Download

3. Get the baseline

4. Preprocess Data

5. Configure and Run Training / Evaluation

📖 Citation

📬 Contact Us

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages