Skip to content

Commit 202f0c4

Browse files
authored
Merge pull request #189 from codelion/small-fixes
Small fixes
2 parents 6fdfef6 + dda39ed commit 202f0c4

File tree

2 files changed

+4
-1
lines changed

2 files changed

+4
-1
lines changed

README.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -366,6 +366,7 @@ Check this log file for connection issues, tool execution errors, and other diag
366366
| Plugin | Slug | Description |
367367
| ----------------------- | ------------------ | ---------------------------------------------------------------------------------------------- |
368368
| System Prompt Learning | `spl` | Implements what [Andrej Karpathy called the third paradigm](https://x.com/karpathy/status/1921368644069765486) for LLM learning, this enables the model to acquire program solving knowledge and strategies |
369+
| Deep Think | `deepthink` | Implements a Gemini-like Deep Think approach using inference time scaling for reasoning LLMs |
369370
| Long-Context Cerebras Planning and Optimization | `longcepo` | Combines planning and divide-and-conquer processing of long documents to enable infinite context |
370371
| MCP Client | `mcp` | Implements the model context protocol (MCP) client, enabling you to use any LLM with any MCP Server |
371372
| Router | `router` | Uses the [optillm-modernbert-large](https://huggingface.co/codelion/optillm-modernbert-large) model to route requests to different approaches based on the user prompt |
@@ -564,6 +565,8 @@ called patchflows. We saw huge performance gains across all the supported patchf
564565

565566
## References
566567
- [AutoThink: efficient inference for reasoning LLMs](https://dx.doi.org/10.2139/ssrn.5253327) - [Implementation](optillm/autothink)
568+
- [Self-Discover: Large Language Models Self-Compose Reasoning Structures
569+
](https://arxiv.org/abs/2402.03620) - [Implementation](optillm/plugings/deepthink)
567570
- [CePO: Empowering Llama with Reasoning using Test-Time Compute](https://cerebras.ai/blog/cepo) - [Implementation](optillm/cepo)
568571
- [LongCePO: Empowering LLMs to efficiently leverage infinite context](https://cerebras.ai/blog/longcepo) - [Implementation](optillm/plugins/longcepo)
569572
- [Chain of Code: Reasoning with a Language Model-Augmented Code Emulator](https://arxiv.org/abs/2312.04474) - [Inspired the implementation of coc plugin](optillm/plugins/coc_plugin.py)

optillm/plugins/deepthink/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -133,4 +133,4 @@ The Deep Think approach has also demonstrated:
133133
## References
134134

135135
- Zhou, P. et al. "SELF-DISCOVER: Large Language Models Self-Compose Reasoning Structures" (2024)
136-
- Uncertainty-routed chain-of-thought approaches in advanced reasoning systems
136+
- Uncertainty-routed chain-of-thought approaches in advanced reasoning systems is from the paper "Gemini: A Family of Highly Capable Multimodal Models" (2023), https://arxiv.org/abs/2312.11805

0 commit comments

Comments
 (0)