Skip to content

coco12373/Awesome-LLM-Resources-List

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

132 Commits
 
 
 
 
 
 
 
 

Repository files navigation

🌟 Awesome LLM Resources

A Curated Collection of LLM resources. 💡✨

🌐 Updated: 22nd of June 2025

'Serverless' Hosting of Private/OS Models

Platform/Tool Rel. Scale Down OS 🔓 GH Start One-Click Dev Exp. Free-Tier
Baseten 2019 > 15 min 🔴 GitHub followers Guide 🟡 👍 $30
Modal 2021 < 1 min 🔴 GitHub followers Helpers 👍 $30/m
HF Endpoints 2023 > 15 min 🔴 GitHub followers None Needed 😓
Replicate 2019 < 1 min 🔴 GitHub followers Guide 🟡 🤷
Sagemaker (Serverless) 2017 N/A 🔴 GitHub followers N/A 300,000s
Lambda w/ EFS (AWS) 2014 < 1 min 🔴 GitHub followers Guide
RunPod Serverless 2022 > 30s 🔴 GitHub followers N/A 🤷
BentoML 2019 > 5 min GitHub Repo stars GitHub followers Gallery 🟡 👍 🆓 $10

It goes without saying that these platforms can usually do more than LLM serving**

🧮 Serverless Compute Pricing & Limits – Lambda vs Modal (on CPU)

Platform 💵 Compute Unit 📥 Per-Request Fee 🆓 Free Tier ⏱️ Max Timeout 🚦 Concurrency Limit
AWS Lambda + API GW GB-sec @ $0.000016667 $0.20/M Lambda + $1.00/M HTTP API calls 1M req + 400k GB-s/mo (12 mo) + 1M API calls/mo 15 min 1,000 per region (can request more)
Modal CPU-s @ $0.0000131 + GiB-s @ $0.00000222 ❌ No per-request fee $30/mo compute credits (Starter) Func: 24h ⎮ HTTP: 150s → 303 redirect Starter: 100 containers / 200 req/s ⎮ Team: 1,000 containers

Access Off-the-Shelf OS Models (via API):

Platform/Tool Released GitHub
Together.ai N/A 🔴
Fireworks.ai N/A GitHub followers
Replicate 2019 GitHub followers
Groq N/A GitHub followers
DeepInfra N/A GitHub followers
Bedrock N/A GitHub followers
Lepton N/A GitHub followers
Fal.ai N/A GitHub followers
VertexAI N/A GitHub followers

Local Inference

Framework Browser Chat 🖥️ Organization Open Source GitHub
Llama.cpp ggerganov GitHub Repo stars GitHub followers
Ollama Ollama GitHub Repo stars GitHub followers
gpt4all Nomic.ai GitHub Repo stars GitHub followers
LMStudio LMStudio AI 🔴 GitHub followers
OpenLLM BentoML GitHub Repo stars GitHub followers

LLM Serving Frameworks

Framework Open Source GitHub
vLLM GitHub Repo stars GitHub followers
OpenLLM GitHub Repo stars GitHub followers
TGI (Text Generation Inference) GitHub Repo stars GitHub followers
TensorRT LLM GitHub Repo stars GitHub followers
Ray Serve GitHub Repo stars GitHub followers
LMDeploy GitHub Repo stars GitHub followers
Ollama GitHub Repo stars GitHub followers
MLC-LLM GitHub Repo stars GitHub followers

Building Open-Source LLM Web Chat UIs

Tool Organization Description Open Source GitHub
Text Generation WebUI oobabooga A Gradio web UI for Large Language Models. GitHub Repo stars GitHub followers
Jan AI Jan HQ An open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM). GitHub Repo stars GitHub followers
AnythingLLM Mintplex Labs The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more. GitHub Repo stars GitHub followers
Superagent Superagent AI Allows developers to add powerful AI assistants to their applications using LLMs and RAG. GitHub Repo stars GitHub followers
Bionic-GPT Bionic GPT A ChatGPT replacement offering generative AI advantages while maintaining strict data confidentiality. GitHub Repo stars GitHub followers
Open WebUI Open WebUI A user-friendly web interface for interacting with Large Language Models (LLMs). GitHub Repo stars GitHub followers
Xyne xynehq A sleek, minimal web chat interface for interacting with Large Language Models. GitHub Repo stars GitHub followers
Assistant UI assistant-ui An open-source ChatGPT-like interface with a clean and responsive design. GitHub Repo stars GitHub followers
Scira zaidmukaddam An AI-powered search interface that leverages LLMs for intelligent search results. GitHub Repo stars GitHub followers
Onyx onyx-dot-app A customizable and extendable web chat UI for interacting with large language models. GitHub Repo stars GitHub followers
NextChat ChatGPTNextWeb A Next.js-based, open-source ChatGPT clone for seamless web interaction. GitHub Repo stars GitHub followers

Rent GPUs (Fine-Tuning, Deploying, Training)

Platform Templates Beginner Friendly GitHub
Brev.dev Fine-tuning GitHub followers
Modal Fine-tuning GitHub followers
Hyperbolic AI None GitHub followers
RunPod None GitHub followers
Paperspace Fine-tuning GitHub followers
Colab Small models only GitHub followers

Fine-Tuning with No-Code UI

Tool Beginner Friendly Open Source GitHub
Together.ai N/A
Hugging Face AutoTrain GitHub Repo stars
AutoML GitHub Repo stars
LLaMA-Factory GitHub Repo stars
H2O LLM Studio GitHub Repo stars

Fine-Tuning Frameworks

Framework Open Source GitHub
Axolotl GitHub Repo stars GitHub followers
Unsloth GitHub Repo stars GitHub followers

OS Agentic/AI Workflow

Framework Open Source Beginner Friendly Released GitHub
LangChain GitHub Repo stars 2022 GitHub followers
LlamaIndex GitHub Repo stars 2023 GitHub followers
Swarms GitHub Repo stars 2023 GitHub followers
CrewAI GitHub Repo stars 2023 GitHub followers
Autogen GitHub Repo stars 2023 GitHub followers
AutoChain GitHub Repo stars 2023 GitHub followers
SuperAGI GitHub Repo stars 2023 GitHub followers
AILegion GitHub Repo stars 2023 GitHub followers
MemGPT (Letta) GitHub Repo stars 2023 GitHub followers
uAgents GitHub Repo stars 2023 GitHub followers
AGiXT GitHub Repo stars 2023 GitHub followers
Dify GitHub Repo stars 2024 GitHub followers
TaskingAI GitHub Repo stars 2024 GitHub followers
Bee Agent Framework GitHub Repo stars 2024 GitHub followers
Swarms GitHub Repo stars 2024 GitHub followers
IoA GitHub Repo stars 2024 GitHub followers
Upsonic GitHub Repo stars 2024 GitHub followers
Parlant GitHub Repo stars 2024 GitHub followers
Rig GitHub Repo stars 2024 GitHub followers
eliza GitHub Repo stars 2024 GitHub followers
TensorZero GitHub Repo stars 2024 GitHub followers
AgentDock GitHub Repo stars 2025 GitHub followers

Top Agentic Frameworks

Framework Open Source Beginner Friendly Released GitHub
LangGraph GitHub Repo stars 2023 GitHub followers
Flowise GitHub Repo stars 2023 GitHub followers
Langroid GitHub Repo stars 2023 GitHub followers
smolagents GitHub Repo stars 2024 GitHub followers
Semantic Kernel GitHub Repo stars 2023 GitHub followers
Atomic Agents GitHub Repo stars 2024 GitHub followers
Agno GitHub Repo stars 2024 GitHub followers
PydanticAI GitHub Repo stars 2024 GitHub followers
Mastra GitHub Repo stars 2025 GitHub followers

Agentic Frameworks: Core Capabilities

Framework Memory & RAG Multimodality Multi-agent Support Observability
AgentDock Built-in RAG system; knowledge base integration 🟢 Multi-modal (text, voice, tools, APIs) Visual workflow orchestration & agent chains Comprehensive LLM traceability & credit tracking
Agno Integrated memory & vector DB/RAG 🟢 Native (text, image, audio, video) Supervisor-worker roles Built-in cloud dashboard/logging
LangGraph Persistent state; easy external integration 🔸 Primarily text; extendable via nodes Hierarchical orchestration LangSmith integration & graph editor
SmolAgents Built-in short-term; custom long-term 🔸 Vision agents via VLMs Modular multi-agent composition Minimal; relies on external logging
Mastra Persistent workflows; native RAG pipelines 🟢 Multi-modal via integrations Native multi-agent workflows Built-in OpenTelemetry dashboards
Pydantic AI DI-based memory & RAG integration 🔸 Text-first; multimodal via custom DI Type-safe manual orchestration Limited; Python logging/OpenTelemetry
Atomic Agents Per-agent memory & RAG (vector DB) 🟢 Native multi-modal Explicit chaining of workflows Minimal; external instrumentation recommended
Autogen Short-term built-in; external long-term 🔸 Text-mainly; extensible Emergent, free-form collaboration Moderate; internal logging, no dashboard
CrewAI Stateful memory & team-based RAG 🟢 Diverse modalities (text, image, etc.) Supervisor-led multi-team workflows Integrated dashboards for logging & monitoring

Please see this google sheet with more columns.

Visual AI Agent Builders

Tool Organization Description Open Source GitHub
Rivet Ironclad A visual builder to design and deploy AI agent workflows. GitHub Repo stars GitHub followers
PySpur PySpur-Dev A tool to build and visualize AI agents seamlessly. GitHub Repo stars GitHub followers
Flowise FlowiseAI A no‑code, visual platform for designing AI agent workflows. GitHub Repo stars GitHub followers

💬 Model Call Pricing for agent systems (Text-only (2000 tokens in, 100 token out), Flat Rate)

Model 💵 $ / Call 💯 100 Calls 🧮 1,000 Calls 🔁 30,000 Calls
Gemini Flash 2.0 $0.00000 $0.02 $0.24 $7.20
GPT-4o mini $0.00144 $0.14 $1.44 $43.20
GPT-4.1 $0.00480 $0.48 $4.80 $144.00
Gemini Pro 2.5 $0.00350 $0.35 $3.50 $105.00
Claude Haiku 3.5 $0.00200 $0.20 $2.00 $60.00
Claude Sonnet 4 $0.00750 $0.75 $7.50 $225.00
GPT-4o $0.01200 $1.20 $12.00 $360.00
OpenAI o3 $0.02400 $2.40 $24.00 $720.00

Agentic Tools (for “building”)

Tool Organization Description Open Source GitHub
browser-use browser-use Integrates browser functionalities into agentic workflows. GitHub Repo stars GitHub followers
code2prompt mufeedvh Converts code snippets into actionable prompts for development. GitHub Repo stars GitHub followers
note-gen codexu Automatically generates notes and documentation from your code. GitHub Repo stars GitHub followers
refly refly-ai Automates code refactoring and prompt generation tasks. GitHub Repo stars GitHub followers
potpie potpie-ai A toolkit for prototyping and building AI agent pipelines. GitHub Repo stars GitHub followers
AgentStack AgentOps-AI A comprehensive stack for constructing and deploying AI agents. GitHub Repo stars GitHub followers
browser lightpanda-io A browser‑based tool designed for integrating agentic functionalities. GitHub Repo stars GitHub followers
Memary kingjulio8238 A memory module for retaining context in agent workflows. GitHub Repo stars GitHub followers
open-canvas langchain-ai A visual interface for designing agent workflows with LangChain. GitHub Repo stars GitHub followers
agent-service-toolkit JoshuaC215 A toolkit for building and deploying agent-based services. GitHub Repo stars GitHub followers

Virtual Brains

Tool Organization Description Open Source GitHub
Leon leon-ai An open‑source personal assistant and automation platform powered by AI. GitHub Repo stars GitHub followers
Khoj khoj-ai A virtual brain for organizing and retrieving your knowledge using AI. GitHub Repo stars GitHub followers

AI Agents

Framework Organization Open Source Released GitHub
GPT Engineer GPT Engineer Org GitHub Repo stars 2023 GitHub followers
XAgent OpenBMB GitHub Repo stars 2023 GitHub followers
Bolt.new StackBlitz GitHub Repo stars 2023 GitHub followers
Goose Block GitHub Repo stars 2023 GitHub followers
AI Hedge Fund virattt GitHub Repo stars 2023 GitHub followers
FinRobot AI4Finance Foundation GitHub Repo stars 2024 GitHub followers
STORM Stanford OVAL GitHub Repo stars 2024 GitHub followers
Multion MULTI-ON 🔴 N/A GitHub followers
Minion Minion AI 🔴 N/A GitHub followers

Long-Term Memory

Provider Community Founded GitHub ⭐ Stars Open Source
Letta 💬 Active dev community Oct 2023 GitHub followers 17k ✅ Apache-2.0
Zep 🤝 Moderate community Aug 2024 GitHub followers 11.6k ⚠️ Graphiti CE (Apache-2.0)
MemoRAG 🧪 Small research group Sep 2024 GitHub followers 1.8k ✅ Apache-2.0
Memary 🧠 Niche community April 2024 GitHub followers 2.3k ✅ MIT
Cognee 🔄 Moderate Aug 2023 GitHub followers 5.8k ✅ Apache-2.0
Mem0 🚀 Fast-growing June 2023 GitHub followers 35.2k ✅ Apache-2.0

Memory Features Comparison

Provider Based Optional KG Self-Editing / Agentic Rolling Summaries Categories
Letta 🧮 Vector ⚠️ Partial ✅ Yes ⚠️ Partial (memory blocks) ✅ Yes
Zep 🧠 KG - ✅ Yes ✅ Auto chat summarization ✅ Yes
MemoRAG 🧮 Vector ❌ No ✅ Yes ❌ Uses long-range model ❌ No
Memary 🧠 KG - ✅ Yes ⚠️ Plans “rewind” feature ✅ Yes
Cognee 🧠 KG - ✅ Yes ❌ No auto summaries ✅ Yes
Mem0 🧮 Vector ❌ No ✅ Yes ❌ Not explicit ✅ Yes

Enterprise Security (Cloud-Based Use)

Provider Enterprise Security
Mem0 🔐 Hosted with encryption, org/project roles, GDPR-friendly delete. Uses Graphlit (SOC 2 not stated).
Letta ☁️ Self-hosted or managed server. User auth & ID-partitioned memory. Graphlit-based. No public SSO details.
Zep ✅ SOC 2 Type 2. Encrypted at rest/in transit, access controls, JWT, and deletion API ("Right to be Forgotten").
MemoRAG 🏠 Self-host
Memary 🏠 Self-host
Cognee 🏠 Self-host

Pricing by Monthly Message Volume

Provider 1K msgs/mo 10K msgs/mo 100K msgs/mo 1M msgs/mo
Mem0 🆓 Free 🆓 Free–$29 💵 $249 🏢 Enterprise (custom)
Letta 🆓 Free 💵 $20 💵 $750 🏢 Enterprise (custom)
Zep 🆓 Free 🆓 Free 💵 ~ $112.50 💵 ~ $1,237
MemoRAG 💻 GPU Server (~$150–300/mo) 💻 GPU Server (~$150–300/mo) 💻 Multi-GPU ($500+) 🖥️ Cluster ($1K+/mo)
Self-host 🖥️ Small VM (~$15/mo) 🖥️ Small VM (~$15–20/mo) 🖥️ Medium VM ($50–$100/mo) 🖥️ Large VM ($200+/mo)

Evaluation Frameworks and add-ons

Framework Open Source Beginner Friendly Released GitHub
TruLens GitHub Repo stars 2023 GitHub followers
Promptfoo GitHub Repo stars 2023 GitHub followers
DeepEval GitHub Repo stars 2024 GitHub followers
RAGAS GitHub Repo stars 2023 GitHub followers
OpenAI Evals GitHub Repo stars 2023 GitHub followers
LangChain OpenEvals GitHub Repo stars 2025 GitHub followers
LangChain AgentEvals GitHub Repo stars 2025 GitHub followers
LlamaIndex Eval GitHub Repo stars 2023 GitHub followers

Evaluation Frameworks: Core Differences

Framework Pytest / CLI Runner Metrics Ready-made Synthetic Data Gen Offline Judge Model-Agnostic Safety Red-Team Custom Metrics (setup speed)
DeepEval 🟢 deepeval test 40 + 🟢 deepeval create-dataset 🟢 🟢 🟢 🟢 🟢 G-Eval builder — minutes (one function)
RAGAS ✖ (script asserts) 6 core RAG + 🔸 🟢 KG-based Q-gen 🟢 🟢 🔸 DIY 🟢 AspectCritic one-liner — minutes
MLflow Evaluate ✖ (mlflow.evaluate) 3-4 ✖ BYO 🔸 possible 🔸 🟢 🟢 Subclass scorer — few lines, ~hour
OpenAI Evals 🟢 CLI orchestrator ~10 templates 🔸 helper script 🟢 🟢 Full Python/YAML eval — flexible but slower

Co-Pilots

Framework Open Source GitHub
Aider GitHub Repo stars GitHub followers
Cursor GitHub Repo stars GitHub followers
Continue GitHub Repo stars GitHub followers

Voice API

Framework Open Source GitHub
VAPI.ai 🔴 GitHub followers
Bland.ai 🔴 N/A
CallAnnie 🔴 N/A
RealtimeTTS GitHub Repo stars GitHub followers
RealtimeSTT GitHub Repo stars GitHub followers
Coqui TTS GitHub Repo stars GitHub followers

Open Source TTS Models

Model License Stars/Likes Downloads (Last Month) Repository
Kokoro-82M Apache 2.0 ⭐ 3.16k (HF) 📥 557,392 Hugging Face
Zonos-v0.1-transformer Apache 2.0 ⭐ 249 (HF) 📥 24,240 Hugging Face
XTTS-v2 Non-Commercial ❤️ 368 (HF) 📥 2,545,850 Hugging Face
ChatTTS AGPL-3.0 N/A N/A GitHub
MeloTTS MIT N/A N/A GitHub

For more TTS models and rankings, check out the TTS Leaderboard.

LLM Application Frameworks

Tool Organization Description Open Source GitHub
Eino CloudWeGo A lightweight LLM application framework for scalable AI solutions. GitHub Repo stars GitHub followers
Conversation Knowledge Mining Solution Accelerator Microsoft A solution accelerator for integrating conversation intelligence and knowledge mining using LLMs. GitHub Repo stars GitHub followers
Olmocr AllenAI An OCR framework optimized for integration with language models. GitHub Repo stars GitHub followers
PDFMathTranslate Byaidu A tool for converting and translating mathematical content in PDFs using LLMs. GitHub Repo stars GitHub followers
Podcastfy souzatharsis A tool to generate podcasts from written content using LLMs. GitHub Repo stars GitHub followers
Pandas AI sinaptik-ai Brings LLM-powered analytics to pandas dataframes. GitHub Repo stars GitHub followers
Ramalama containers An LLM application framework for containerized deployment of AI solutions. GitHub Repo stars GitHub followers
Robyn facebookexperimental A scalable framework for building LLM applications from Facebook Experimental. GitHub Repo stars GitHub followers
ExtractThinker enoch3712 A tool for extracting and synthesizing insights from textual data using LLMs. GitHub Repo stars GitHub followers

OS RAG Frameworks

Framework Organization Open Source Released GitHub
Haystack deepset.ai GitHub Repo stars 2023 GitHub followers
RAGflow Infiniflow GitHub Repo stars 2024 GitHub followers
txtai Neuml GitHub Repo stars 2022 GitHub followers
LLM App Pathway GitHub Repo stars 2023 GitHub followers
Cognita Truefoundry GitHub Repo stars 2024 GitHub followers
R2R SciPhi AI GitHub Repo stars 2024 GitHub followers
Raptor Parth Sarthi GitHub Repo stars 2024 GitHub followers
LightRAG HKUDS GitHub Repo stars 2023 GitHub followers
PIKE-RAG Microsoft GitHub Repo stars 2024 GitHub followers
KAG OpenSPG GitHub Repo stars 2024 GitHub followers
MemoRAG qhjqhj00 GitHub Repo stars 2023 GitHub followers

See RAG_Techniques if you get stuck (not always needed)

🔍 Vector DBs – FOSS, Performance, Pricing, DevX

Vector DB License ⚡ Perf / Throughput ⏱️ Latency (Real-World) ☁️ Cloud Pricing / Free Tier 💻 Dev Experience
Qdrant Apache 2.0 🥇 Highest RPS, lowest latency in single-node bench (≥4× vs prev. run) p95 < 10ms for 1M vecs (1 thread) Always-on 1GB free; pay-go ≈ $0.014/hr REST + gRPC; 7 lang clients; filter-aware HNSW; hybrid support; Python “embedded” mode
Milvus / Zilliz Cloud Apache 2.0 🚀 Fastest index build; RPS trails Qdrant for high-dim vecs p95 ≈ 10–20ms @ 1M 768-dim (DiskANN, vendor data) 5GB free; serverless $0.30/GB-mo; dedicated from $99/mo New SDK v2 (async, schema cache); Python/Go/Java/Node support
Weaviate BSD-3-Clause ⚙️ Least bench gains, but decent recall (95%+) and throughput “Low-ms” claimed; users report 100–300ms if misconfigured Starts $25/mo; 14-day sandbox free GraphQL + REST; strong SDKs (Py/TS/Go/Java); easy RAG + hybrid templates
pgvector MIT 🔥 28× lower p95 & 16× higher QPS vs Pinecone s1 @ 99% recall (50M) p95 < 50ms @ 50M 768-dim (Timescale test) Neon/Supabase offer free Postgres with pgvector (0.5–1GB, ~200h CPU) Pure SQL; supports joins + ACID; great for hybrid text + dense queries
Redis 8 Vector AGPLv3 / RSAL / SSPL 🧵 3.4× higher QPS vs Qdrant, 4× vs Milvus @ ≥0.98 recall Sub-ms avg, <10ms under load (vendor); 9.7× lower than Aurora+pgvector Redis Cloud: 30MB free, pay-go from $5/mo; Flex $0.007/hr Redis Vector Library + RAG helpers; OM clients for .NET/Py/JS; fast setup

💾 Vector DB Cloud Pricing (2000-char Chunks, ~768-dim)

# Chunks Data Size 🟣 Milvus / Zilliz Cloud (Serverless) 🟢 Qdrant Cloud 🟡 Weaviate Cloud (“Standard”)
10k ~0.07 GB 🆓 Free – within 5 GB tier 🆓 Free – fits in 1 GB RAM / 4 GB disk $25 base + $1.2 dim fee ≈ $26
100k ~0.67 GB 🆓 Still under 5 GB 🆓 Fits with compression in 4 GB disk $25 + $12.0 dim fee ≈ $37
1M ~6.7 GB 💵 ≈ $2 storage; add vCU fees or $99 dedicated 💵 Needs 10 GB cluster → ≈ $20/mo $25 + $120.5 dim fee ≈ $145
10M ~67 GB 💵 ≈ $20 storage; + compute: $100–150 total 💵 Needs 64+ GB → $120–150/mo estimate $25 + $1,204 dim fee ≈ $1,230

🧠 Embedding Cost – OpenAI (Small Model, per Chunk Size)

# Chunks 📏 1,000 Chars 📏 2,000 Chars 📏 3,000 Chars
1,000 $0.01 $0.01 $0.01
10,000 $0.05 $0.10 $0.15
100,000 $0.50 $1.00 $1.50
1,000,000 $5.00 $10.00 $15.00
10,000,000 $50.00 $100.00 $150.00

AI Tools (for “using”)

Tool Organization Description Open Source GitHub
magic-resume JOYCEQL An AI-powered tool for generating resumes. GitHub Repo stars GitHub followers
VideoCaptioner WEIFENG2333 An AI tool for automatically generating video captions. GitHub Repo stars GitHub followers
DeepSeekAI DeepLifeStudio Browser extension for invoking the DeepSeek AI large model. GitHub Repo stars GitHub followers
logocreator Nutlope A tool for creating logos using AI. GitHub Repo stars GitHub followers
blinkshot Nutlope An AI-powered tool for capturing and enhancing screenshots. GitHub Repo stars GitHub followers
pollinations pollinations A tool for generating creative images and artwork using AI. GitHub Repo stars GitHub followers
PromptWizard microsoft A tool to generate, manage, and optimize prompts for AI models. GitHub Repo stars GitHub followers
Open-Interface AmberSahdev Control Any Computer Using LLMs. GitHub Repo stars GitHub followers
wut shobrook LLM for the terminal GitHub Repo stars GitHub followers

Training/Optimization

Tool Organization Description Open Source GitHub
transformerlab-app transformerlab An application for training and optimizing transformer models. GitHub Repo stars GitHub followers
fluxgym cocktailpeanut A gym environment for reinforcement learning training and optimization. GitHub Repo stars GitHub followers
AutoGPTQ AutoGPTQ A tool for automating GPT quantization and optimization. GitHub Repo stars GitHub followers

AI Models

Tool Organization Description Open Source GitHub
WALDO stephansturges An AI model for visual reasoning and object detection. GitHub Repo stars GitHub followers
Janus deepseek-ai A multi-modal AI model for advanced data processing. GitHub Repo stars GitHub followers
ModernBERT AnswerDotAI A modernized version of BERT for natural language processing tasks. GitHub Repo stars GitHub followers
Magma microsoft A scalable AI model for large-scale data analysis. GitHub Repo stars GitHub followers
Cosmos-Nemotron NVlabs An AI model for advanced image and video processing. GitHub Repo stars GitHub followers
Paints-UNDO lllyasviel An interactive AI model for image generation and editing. GitHub Repo stars GitHub followers

Monitoring

Tool Organization Description Open Source GitHub
helicone Helicone A platform for monitoring and analyzing AI model performance. GitHub Repo stars GitHub followers
langwatch langwatch A tool for monitoring outputs and performance of language models. GitHub Repo stars GitHub followers

Infrastructure

Tool Organization Description Open Source GitHub
gpustack gpustack A toolkit for managing GPU infrastructure for AI workloads. GitHub Repo stars GitHub followers
harbor av A repository for containerized AI infrastructure management. GitHub Repo stars GitHub followers

Research Papers on Chain-of-Thought Prompting

Publication Date Title 🔗 Authors Organization Technique
January 28, 2022 Chain-of-Thought Prompting Elicits Reasoning in Large Language Models 🔗 Jason Wei, et al. DeepMind CoT Prompting
March 21, 2022 Self-Consistency Improves Chain of Thought Reasoning in Language Models 🔗 Xuezhi Wang et al. DeepMind CoT with Self-Consistency
May 21, 2022 Least-to-Most Prompting Enables Complex Reasoning in Large Language Models 🔗 Denny Zhou et al. DeepMind Least-to-Most Prompting
May 21, 2022 Large Language Models are Zero-Shot Reasoners 🔗 Takeshi Kojima, et al. DeepMind Zero-shot-CoT
October 6, 2022 ReAct: Synergizing Reasoning and Acting in Language Models 🔗 Shunyu Yao et al. Princeton University ReAct
April 1, 2023 Teaching Large Language Models to Self-Debug 🔗 Xiang Lisa Li, et al. DeepMind, Stanford University Self-Debugging
May 6, 2023 Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models 🔗 Lei Wang et al. The Chinese University of Hong Kong, SenseTime Research Plan-and-Solve Prompting
May 23, 2023 Let’s Verify Step by Step 🔗 Anya Goyal, et al. DeepMind Verification for CoT
October 3, 2023 Large Language Models Cannot Self-Correct Reasoning Yet 🔗 Qingxiu Dong, et al. The Chinese University of Hong Kong, Huawei Noah's Ark Lab Self-Correction in LLMs
November 2023 Universal Self-Consistency for Large Language Model Generation 🔗 Xinyun Chen, Renat Aksitov, Uri Alon, Jie Ren, Kefan Xiao, Pengcheng Yin, Sushant Prakash, Charles Sutton, Xuezhi Wang, Denny Zhou DeepMind Universal Self-Consistency
May 17, 2023 Tree of Thoughts: Deliberate Problem Solving with Large Language Models 🔗 Shunyu Yao, et al. Princeton University, DeepMind Tree-of-Thought
February 15, 2024 Chain-of-Thought Reasoning Without Prompting 🔗 Xuezhi Wang, Denny Zhou DeepMind Chain-of-Thought Decoding
March 21, 2024 ChainLM: Empowering Large Language Models with Improved Chain-of-Thought Prompting 🔗 Xiaoxue Cheng et al. Renmin University of China CoTGenius
June 2024 Language Agent Tree Search Unifies Reasoning, Acting, and Planning in Language Models 🔗 Andy Zhou, Kai Yan, Michal Shlapentokh-Rothman, Haohan Wang, Yu-Xiong Wang Language Agent Tree Search (LATS)
May 2024 Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning 🔗 Yuxi Xie, et al. National University of Singapore, DeepMind MCTS
September 18, 2024 To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning 🔗 Zayne Sprague, et al. The University of Texas at Austin, Johns Hopkins University, Princeton University Meta-analysis of CoT
September 25, 2024 Chain-of-Thoughtlessness? An Analysis of CoT in Planning 🔗 Kaya Stechly, et al. Arizona State University Analysis of CoT in Planning
October 18, 2024 Supervised Chain of Thought 🔗 Xiang Zhang, Dujian Ding University of British Columbia Supervised Chain of Thought
October 24, 2024 On examples: A Theoretical Understanding of Chain-of-Thought: Coherent Reasoning and Error-Aware Demonstration 🔗 Zhiqiang Hu, et al. Amazon, Michigan State University Theoretical Analysis of CoT

CoT Implementations

Implementation Link Author GitHub Stars GitHub Followers
CoT chain-of-thought-hub Franx Yao Stars Followers
CoT optillm Codelion Stars Followers
CoT auto-cot Amazon Science Stars Followers
CoT g1 BKlieger Groq Stars Followers
Decoding CoT optillm/cot_decoding.py Codelion Stars Followers
Tree of Thoughts tree-of-thought-llm Princeton NLP Stars Followers
Tree of Thoughts tree-of-thoughts Kye Gomez Stars Followers
Tree of Thoughts saplings Shobrook Stars Followers
MCTS optillm/mcts.py Codelion Stars Followers
Graph of Thoughts graph-of-thoughts SPCL Stars Followers
Other CPO SAIL SG Stars Followers
Other Everything-of-Thoughts-XoT Microsoft Stars Followers

CoT Fine-Tuned Models & Datasets

Models

Model Name Author Size Link
CoT-T5-3B KAIST AI 3B 🔗
CoT-T5-11B KAIST AI 11B 🔗
Llama-3.2V-11B-cot Xkev 11B 🔗
Llama-3.1-8B-Instruct-Reasoner-1o1_v0.3 Lyte 8B 🔗

Datasets

Dataset Name Author Data Size Likes Link
chain-of-thought-sharegpt Isaiah Bjork 7.14k rows 🌟 8 🔗
CoT-Collection KAIST AI 1.84 million rows 🌟 122 🔗
Reasoner-1o1-v0.3-HQ Lyte 370 rows 🌟 7 🔗
OpenLongCoT-Pretrain qq8933 103k rows 🌟 86 🔗

Learning Resources

Tool Organization Description Open Source GitHub
awesome-cursorrules PatrickJS A curated list of resources and guides on cursorrules. GitHub Repo stars GitHub followers
ai-engineering-hub patchy631 A hub of AI engineering learning resources, tutorials, and best practices. GitHub Repo stars GitHub followers
GenAI_Agents NirDiamant Resources and examples for building Generative AI Agents. GitHub Repo stars GitHub followers
learn-agentic-ai panaversity Learning materials for understanding and building agentic AI. GitHub Repo stars GitHub followers
awesome-generative-ai steven2358 A curated list of generative AI resources and projects. GitHub Repo stars GitHub followers
awesome-mcp-servers punkpeye A curated collection of awesome MCP servers resources. GitHub Repo stars GitHub followers
GenAI-Showcase mongodb-developer A showcase of innovative Generative AI projects. GitHub Repo stars GitHub followers
well-architected-iac-analyzer aws-samples A tool to analyze and ensure well-architected Infrastructure as Code practices. GitHub Repo stars GitHub followers
llama-cookbook meta-llama A collection of recipes and guides for working with LLaMA models. GitHub Repo stars GitHub followers
optillm codelion Resources for optimizing LLM usage and performance. GitHub Repo stars GitHub followers
cursor.directory pontusab A directory of tools and resources related to cursor-based workflows. GitHub Repo stars GitHub followers
GenAI_Agents NirDiamant A curated collection of generative AI agents and related tools. GitHub Repo stars GitHub followers

About

A Curated Collection of resources for applied AI engineering (work in progress).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 100.0%