You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: optillm/autothink/README.md
+17-2Lines changed: 17 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,7 +15,7 @@ AutoThink combines several advanced techniques to optimize the thinking process
15
15
16
16
### 1. Query Classification
17
17
18
-
AutoThink uses the `adaptive-classifier/llm-router` model to classify incoming queries:
18
+
AutoThink uses the `adaptive-classifier/llm-router`[model](https://huggingface.co/adaptive-classifier/llm-router) to classify incoming queries:
19
19
20
20
-**HIGH**: Complex queries requiring deep reasoning, multi-step calculations, or thorough exploration
21
21
-**LOW**: Simpler queries requiring less extensive reasoning
@@ -29,7 +29,7 @@ Based on the classification, AutoThink allocates different token budgets for the
29
29
30
30
### 3. Steering Vectors
31
31
32
-
AutoThink uses pre-extracted steering vectors from datasets like `codelion/Qwen3-0.6B-pts-steering-vectors`. These vectors represent different reasoning patterns:
32
+
AutoThink uses pre-extracted steering vectors from [datasets](https://huggingface.co/datasets?other=pts) like `codelion/Qwen3-0.6B-pts-steering-vectors`. These vectors represent different reasoning patterns:
33
33
34
34
-**Depth and thoroughness**: Encourages detailed, step-by-step reasoning
35
35
-**Numerical accuracy**: Promotes precise calculations and verification
@@ -93,3 +93,18 @@ response = autothink_decode(
93
93
-**Enhanced Reasoning**: Steering vectors guide the model toward better reasoning patterns
94
94
-**Efficiency**: Better performance without increasing model size
95
95
-**Customizability**: Can be tailored for different domains using domain-specific steering vector datasets
96
+
97
+
98
+
## Citation
99
+
100
+
If you use this approach in your research, please cite:
101
+
102
+
```bibtex
103
+
@article{autothink,
104
+
title={AutoThink: efficient inference for reasoning LLMs},
0 commit comments