Skip to content

Commit 8d01cde

Browse files
committed
Update README.md
1 parent 6a7aef9 commit 8d01cde

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

README.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -137,6 +137,7 @@ pip install -r requirements.txt
137137
| Mixture of Agents | `moa` | Combines responses from multiple critiques |
138138
| Monte Carlo Tree Search | `mcts` | Uses MCTS for decision-making in chat responses |
139139
| PV Game | `pvg` | Applies a prover-verifier game approach at inference time |
140+
| [Deep Confidence](optillm/deepconf) | N/A for proxy | Implements confidence-guided reasoning with multiple intensity levels for enhanced accuracy |
140141
| CoT Decoding | N/A for proxy | Implements chain-of-thought decoding to elicit reasoning without explicit prompting |
141142
| Entropy Decoding | N/A for proxy | Implements adaptive sampling based on the uncertainty of tokens during generation |
142143
| Thinkdeeper | N/A for proxy | Implements the `reasoning_effort` param from OpenAI for reasoning models like DeepSeek R1 |
@@ -161,6 +162,7 @@ pip install -r requirements.txt
161162
| GenSelect | `genselect` | Generative Solution Selection - generates multiple candidates and selects the best based on quality criteria |
162163
| Web Search | `web_search` | Performs Google searches using Chrome automation (Selenium) to gather search results and URLs |
163164
| [Deep Research](optillm/plugins/deep_research) | `deep_research` | Implements Test-Time Diffusion Deep Researcher (TTD-DR) for comprehensive research reports using iterative refinement |
165+
| [Proxy](optillm/plugins/proxy) | `proxy` | Load balancing and failover across multiple LLM providers with health monitoring and round-robin routing |
164166

165167
We support all major LLM providers and models for inference. You need to set the correct environment variable and the proxy will pick the corresponding client.
166168

@@ -703,6 +705,7 @@ python -m pytest tests/
703705
## References
704706
- [Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques](https://arxiv.org/abs/2506.08060)
705707
- [AutoThink: efficient inference for reasoning LLMs](https://dx.doi.org/10.2139/ssrn.5253327) - [Implementation](optillm/autothink)
708+
- [Deep Think with Confidence: Confidence-guided reasoning and inference-time scaling](https://arxiv.org/abs/2508.15260) - [Implementation](optillm/deepconf)
706709
- [Self-Discover: Large Language Models Self-Compose Reasoning Structures
707710
](https://arxiv.org/abs/2402.03620) - [Implementation](optillm/plugings/deepthink)
708711
- [CePO: Empowering Llama with Reasoning using Test-Time Compute](https://cerebras.ai/blog/cepo) - [Implementation](optillm/cepo)

0 commit comments

Comments
 (0)