Open-source Large Language Model (LLM) driven autonomous agent that can automatically solve various tasks.
- AutoGPT - AutoGPT is the vision of the power of AI accessible to everyone, to use and to build on.
- gpt-engineer - Specify what you want it to build, the AI asks for clarification, and then builds it.
- gpt-researcher - GPT based autonomous agent that does online comprehensive research on any given topic
- JARVIS - a system to connect LLMs with ML community.
- babyagi - An example of an AI-powered task management system.
- AgentGPT - 🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
- OpenDevin - a platform for autonomous software engineers, powered by AI and LLMs.
- XAgent - An Autonomous LLM Agent for Complex Task Solving
- ShortGPT - 🚀🎬Experimental AI framework for automated short/video content creation.
- KwaiAgents - A generalized information-seeking agent system with Large Language Models (LLMs).
- ProAgent - An LLM-based Agent for the New Automation Paradigm - Agentic Process Automation
- Agent-E - Agent-E is an agent based system that aims to automate actions on the user's computer. At the moment it focuses on automation within the browser. The system is based on on AutoGen agent framework.
- Wordware - A web-hosted IDE where non-technical domain experts work with AI Engineers to build task-specific AI agents. It approaches prompting as a new programming language rather than low/no-code blocks.
- GenAgent - Build Collaborative AI Systems with Automated Workflow Generation - Case Studies on ComfyUI
- MLE-agent - Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Ollama, etc supported. 🎆 Code RAG
- Notte - Notte is the fastest, most reliable framework for Browser Using Agents.
- OpenLens AI - Fully Autonomous Research Agent for Health / Medicine
- DeepAnalyze - Agentic LLM that autonomously completes the full data science pipeline from preparation to analyst-grade reports.
- KodeAgent - The Minimal Agent Engine, enabling seamless integration with your platform. KodeAgent offers tool-calling (ReAct) and sanboxed code-executing (CodeAct) agents, supported by planning and observation.
Open-source Large Language Model (LLM) driven Multi-Agent that can automatically solve various tasks.
-
MetaGPT - 🌟 The Multi-Agent Framework: Given one line Requirement, return PRD, Design, Tasks, Repo
-
ChatDev - Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
-
DevOpsGPT - Multi agent system for AI-driven software development.
-
GenoMAS - Multi-agent framework for robust automation of scientific analysis workflows, such as gene expression analysis.
-
Giselle - Giselle is an agentic workflow builder that empowers you to create AI-driven solutions with ease.
-
EvoAgentX - EvoAgentX is building a Self-Evolving Ecosystem of AI Agents, it will give you automated framework for evaluating and evolving agentic workflows.
-
RadOps - RadOps is an AI-powered, multi-agent platform that automates DevOps workflows with human-level reasoning.
-
Hivemoot - Framework for AI agent teams that build real software on GitHub — agents get roles, propose features, vote, review code, and ship autonomously. Colony is the first project built this way.
-
Orchard Kit - Six zero-dependency Python modules for autonomous agent governance and cognitive architecture: runtime security, confabulation detection, self-audit, agent discovery, cognitive architecture, and collective cognition.
Exploring endless possibilities with open-source agent social simulation.
- generative_agents - Interactive Simulacra of Human Behavior
- camel - 🐫 Communicative Agents for “Mind” Exploration of Large Language Model Society (NeruIPS'2023)
- ai-town - deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
- GPTTeam - The main objective of this project is to explore the potential of GPT models in enhancing multi-agent productivity and effective communication.
- ChatArena - ChatArena is a library that provides multi-agent language game environments and facilitates research about autonomous LLM agents and their social interactions.
- Camel-AutoGPT - Watch two agents 🤝 collaborate and solve tasks together, unlocking endless possibilities in #ConversationalAI, 🎮 gaming, 📚 education, and more! 🔥
- Cache-to-Cache - Direct semantic communication between LLMs via KV-cache fusion, removing token-by-token latency for multi-agent collaboration.
- mem0 - Mem0 provides a smart, self-improving memory layer for Large Language Models, enabling personalized AI experiences across applications.
- musecl-memory - Zero-dependency file-based memory sync for AI agents using bash, git, and markdown. Lightweight alternative to vector DBs for agent persistence.
- composio - Composio equips agents with well-crafted tools empowering them to tackle complex tasks
- Agentic Radar - Open-source CLI security scanner for agentic workflows. Scans your workflow’s source code, detects vulnerabilities, and generates an interactive visualization along with a detailed security report.
- agentlego - Enhance LLM agents with versatile tool APIs
- Metorial - Integration gateway that links AI agents to 600+ tools via unified MCP/OAuth interface with built-in scaling and monitoring.
- APort Agent Guardrails - Pre-action authorization for OpenClaw and agent frameworks.
before_tool_callplugin, 40+ blocked patterns, local or API. Setup:npx @aporthq/agent-guardrails - Agent OS - A kernel architecture for governing autonomous AI agents. Intercepts actions mid-execution with deterministic policy enforcement, POSIX-inspired primitives, and MCP server for Claude Desktop.
- AgentGuard - Lightweight observability and runtime guardrails for AI agents — loop detection, budget enforcement, cost tracking, and deterministic replay. Zero dependencies, LangChain integration.
Quickly build and customize agents.
- langchain - ⚡ Building applications with LLMs through composability ⚡
- awesome-langchain - 😎 Awesome list of tools and projects with the awesome LangChain framework
- awesome-langchain - 😎 Awesome list of tools and projects with the awesome LangChain framework
- llama_index - LlamaIndex (formerly GPT Index) is a data framework for your LLM applications
- crewaAI - Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
- PraisonAI - Production-ready Multi-AI Agents framework with self-reflection. Fastest agent instantiation (3.77μs), 100+ LLM support, MCP integration, agentic workflows (route/parallel/loop/repeat), built-in memory, and both Python & JavaScript SDKs.
- agents - An Open-source Framework for Autonomous Language Agents
- AutoGen - AutoGen is a framework that enables the development of LLM applications using multiple agents that can converse with each other to solve tasks.
- TaskWeaver - A code-first agent framework for seamlessly planning and executing data analytics tasks.
- AgentVerse - AgentVerse is designed to facilitate the deployment of multiple LLM-based agents in various applications. AgentVerse primarily provides two frameworks: task-solving and simulation.
- AgentFlow - Trainable multi-agent framework coordinating planner, executor, verifier, generator via in-the-flow optimization.
- SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
- Swarms - Enterprise-grade multi-agent framework for orchestrating intelligent AI agents at scale. Designed for production environments with hierarchical swarms, parallel processing, and robust infrastructure.
- AutoChain - Build lightweight, extensible, and testable LLM Agents
- modelscope-agent - An agent framework connecting models in ModelScope with the world
- notte - 🔥 Reliable Browser AI agents framework for building and deploying web automation agents with hybrid workflows combining AI and traditional scripting
- AppAgent - A novel LLM-based multimodal agent framework designed to operate smartphone applications.
- superagent - 🥷 The open framework for building AI Assistants
- Voice Lab - A comprehensive testing and evaluation framework for voice agents across language models, prompts, and agent personas.
- AgentSquare - Automatic LLM Agent Search In Modular Design Space
- MixedVoices - An Open source tool for analyzing and evaluating AI Voice agents. Track and visualize performance through call analysis and flow charts. Run complex simulations before pushing to production.
- KaibanJS - KaibanJS is a JavaScript-native framework for building and managing multi-agent systems with a Kanban-inspired approach.
- Upsonic - Upsonic is a reliable agent framework supporting MCP, offering trusted agent workflows with verification layers.
- Mastra - Mastra is an opinionated TypeScript framework that helps you build AI applications and features quickly.
- LLMling-Agent - Multi-agent workflows and complex Agent interactions, both via YAML manifest and programmatic usage. Pydantic-AI and LiteLLM backends with human-in-the-loop integration
- Hector - Pure A2A-Native Declarative AI Agent Platform
- VoltAgent - An open source TypeScript Framework for building AI agents with built-in LLM observability.
- pydantic-collab - A multi-agent framework powered by Pydantic-AI, enabling collaboration via handoffs and consultations. Supports pre-built and custom agent topologies, shared memory, and Logfire observability.
- alive - Minimal autonomous AI agent framework in a single Python file. Production-hardened through 80+ sessions of real autonomous operation with persistent memory, adaptive wake intervals, circuit breakers, and graceful degradation.
Benchmarks to evaluate LLM-as-Agent across a variety of environments.
- AgentBench - A Comprehensive Benchmark to Evaluate LLMs as Agents
- agentops - Python SDK for agent monitoring, LLM cost tracking, benchmarking, and more. Integrates with most LLMs and agent frameworks like CrewAI, Langchain, and Autogen
- langtrace - Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorDBs and more.. Integrate using Typescript, Python.
- LiveMCP-101 - Benchmark of 101 real-world MCP tool-use queries with plan-based evaluation highlighting agent orchestration gaps.
- ToolBench - An open platform for training, serving, and evaluating large language model for tool learning.
- GenoTEX - A benchmark for evaluating LLM agents on end-to-end gene expression data analysis, featuring comprehensive gene-trait association analysis with expert-curated annotations.
- LLM-Agent-Benchmark-List - A benchmark list for evaluation of large language models.
- agbenchmark - by AutoGPT
- open-operator-evals — An opensource and reproducible set of evals on web browser using agents
Able to connect LLM with the real world.
- Agentfield - An open source Kubernetes-style control plane for deploying AI agents as distributed microservices, with built-in service discovery, durable workflows, and observability.
- OpenAgents - An Open Platform for Language Agents in the Wild
- OpenAGI - "May the Force be with LLM and Domain Experts."
- RestGPT - An LLM-based autonomous agent controlling real-world applications via RESTful APIs
- AGiXT - AGiXT is a dynamic AI Agent Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse AI providers.
- UFO - A UI-Focused Agent for Windows OS Interaction.
- aiXiv - An open access platform for AI-generated scientific research with AI and human peer review.
- aiXplain - AI platform SDK providing access to 35,000+ AI models, benchmarking tools, pipeline design, and agent building capabilities
- BidClub - AI-native investment community where agents and humans share research as equals. Agents register via REST API, get claimed by humans, and participate with skills, webhooks, and heartbeat protocol.
- Crewship - The developer-first platform for running AI agent workflows. Deploy your agents, crews, and workflows with a single command and watch them execute in real-time.
- Pinchwork - Open-source agent-to-agent task marketplace where agents delegate tasks, pick up work, and earn credits. REST API, Python SDK, LangChain/CrewAI/MCP integrations.
- Taskade Genesis - AI-powered platform for building custom AI agents, workflows, and apps using natural language.
- Yoyo - The first social network for AI agents. Connect any AI agent via MCP to post, chat, follow other agents, discover experts, and build reputation. 10 MCP tools, open source.
- A Survey on Large Language Model based Autonomous Agents
- The Rise and Potential of Large Language Model Based Agents: A Survey
- LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey
- Awesome-Papers-Autonomous-Agent - A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.
- Awesome-AgenticLLM-RL-Papers - A comprehensive survey and paper collection on agentic reinforcement learning for LLMs, covering planning, tool use, memory, reasoning, and self-improvement.
- LLM-Agents-Papers - A repo lists papers related to LLM based agent
- LLM-Agent-Paper-Digest - papers related to LLM-agent that published on top conferences
- awesome-language-agents - List of language agents based on paper "Cognitive Architectures for Language Agents"
- LLMAgentPapers - Must-read Papers on LLM Agents.
- LLM Powered Autonomous Agents - Amazing blog by Lilian Weng (OpenAI), Jun 23, 2023.
- 从第一性原理看大模型Agent技术
- 基于大语言模型的AI Agents
- ICLR'24 上大型语言模型代理的最新研究进展 | 代理评估重点
- awesome-llm-powered-agent - Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...
- awesome-agents - 🤖 Awesome list of AI Agents
- awesome-ai-agents - A list of AI autonomous agents
- awesome-ai-agents - Awesome list of 100+ agentic AI resources
- Best-AI-Agents - A list of top AI agents
- ai-agent-roadmap - Explore the latest AI Agent Framework!
- Inspired projects by babyagi




