Merge pull request #189 from codelion/small-fixes

codelion · web-flow · commit 202f0c4bb379 · 2025-05-25T08:07:00.000+08:00
Small fixes
diff --git a/README.md b/README.md
@@ -366,6 +366,7 @@ Check this log file for connection issues, tool execution errors, and other diag
 | Plugin                  | Slug               | Description                                                                                    |
 | ----------------------- | ------------------ | ---------------------------------------------------------------------------------------------- |
 | System Prompt Learning  | `spl`              | Implements what [Andrej Karpathy called the third paradigm](https://x.com/karpathy/status/1921368644069765486) for LLM learning, this enables the model to acquire program solving knowledge and strategies |
+| Deep Think              | `deepthink`        | Implements a Gemini-like Deep Think approach using inference time scaling for reasoning LLMs |
 | Long-Context Cerebras Planning and Optimization              | `longcepo`              | Combines planning and divide-and-conquer processing of long documents to enable infinite context  |
 | MCP Client              | `mcp`              | Implements the model context protocol (MCP) client, enabling you to use any LLM with any MCP Server  |
 | Router                  | `router`           | Uses the [optillm-modernbert-large](https://huggingface.co/codelion/optillm-modernbert-large) model to route requests to different approaches based on the user prompt |
@@ -564,6 +565,8 @@ called patchflows. We saw huge performance gains across all the supported patchf
 
 ## References
 - [AutoThink: efficient inference for reasoning LLMs](https://dx.doi.org/10.2139/ssrn.5253327) - [Implementation](optillm/autothink)
+- [Self-Discover: Large Language Models Self-Compose Reasoning Structures
+](https://arxiv.org/abs/2402.03620) - [Implementation](optillm/plugings/deepthink)
 - [CePO: Empowering Llama with Reasoning using Test-Time Compute](https://cerebras.ai/blog/cepo) - [Implementation](optillm/cepo)
 - [LongCePO: Empowering LLMs to efficiently leverage infinite context](https://cerebras.ai/blog/longcepo) - [Implementation](optillm/plugins/longcepo)
 - [Chain of Code: Reasoning with a Language Model-Augmented Code Emulator](https://arxiv.org/abs/2312.04474) - [Inspired the implementation of coc plugin](optillm/plugins/coc_plugin.py)
diff --git a/optillm/plugins/deepthink/README.md b/optillm/plugins/deepthink/README.md
@@ -133,4 +133,4 @@ The Deep Think approach has also demonstrated:
 ## References
 
 - Zhou, P. et al. "SELF-DISCOVER: Large Language Models Self-Compose Reasoning Structures" (2024)
-- Uncertainty-routed chain-of-thought approaches in advanced reasoning systems
+- Uncertainty-routed chain-of-thought approaches in advanced reasoning systems is from the paper "Gemini: A Family of Highly Capable Multimodal Models" (2023), https://arxiv.org/abs/2312.11805