v3.7.0 #6974
mudler
announced in
Announcements
v3.7.0
#6974
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
🚀 LocalAI 3.7.0
Welcome to LocalAI 3.7.0 👋
This release introduces Agentic MCP support with full WebUI integration, a brand-new neutts TTS backend, fuzzy model search, long-form TTS chunking for chatterbox, and a complete WebUI overhaul.
We’ve also fixed critical bugs, improved stability, and enhanced compatibility with OpenAI’s APIs.
📌 TL;DR – What’s New in LocalAI 3.7.0
/v1/videosendpoint for text-to-video generation.illegal instructioncrashes.gemafindsgemma).🔥 What’s New in Detail
🤖 Agentic MCP Support – Build Intelligent, Tool-Using AI Agents
We're proud to announce full Agentic MCP support a feature for building AI agents that can reason, plan, and execute actions using external tools like web search, code execution, and data retrieval. You can use standard
chat/completionsendpoint, but powered by an agent in the background.Full documentation is available here
✨ Key Features:
POST /mcp/v1/chat/completions(OpenAI-compatible).agentconfig:max_attempts: Retry failed tool calls up to N times.max_iterations: Limit how many times the agent can loop through reasoning.enable_reasoning: Allow step-by-step thought processes (e.g., chain-of-thought).enable_re_evaluation: Re-analyze decisions when tool results are ambiguous.🖼️ WebUI enhancements
WebUI had a major overhaul:
mcpsettings enabled in the model config file.🎙️ Introducing neutts TTS Backend – Natural Speech, Low Latency
Say hello to neutts a new, lightweight TTS backend powered by Neuphonic, delivering high-quality, natural-sounding speech with minimal overhead.
🎛️ Setup Example
🐍 Whisper.cpp enhancements
whisper.cppCPU variants are now available for:avxavx2avx512fallback(no optimized instructions available)These variants are optimized for specific instruction sets and reduce crashes on older or non-AVX CPUs.
🔍 Smarter Gallery Search: Fuzzy & Case-Insensitive Matching
Searching for
gemmanow findsgemma-3,gemma2, etc. — even with typos likegemaaorgema.🧩 Improved Tool & Schema Handling – No More Crashes
We’ve fixed multiple edge cases that caused crashes or silent failures in tool usage.
✅ Fixes:
"type": ["string", "null"]now works without panics.parametersnow handled gracefully.strict_mode: true, the model must pick a tool — no more skipping.["string", "null"]in function definitions.📸 New Video Creation Endpoint: OpenAI-Compatible
LocalAI now supports OpenAI’s
/v1/videosendpoint for generating videos from text prompts.📌 Usage Example:
🧠 Qwen 3 VL in llama.cpp
Support has been added for Qwen 3 VL in llama.cpp. We have updated llama.cpp to latest! As a reminder, Qwen 3 VL and multimodal models are also compatible with our vLLM and MLX backends. Qwen 3 VL models are already available in the model gallery:
qwen3-vl-30b-a3b-instructqwen3-vl-30b-a3b-thinkingqwen3-vl-4b-instructqwen3-vl-32b-instructqwen3-vl-4b-thinkingqwen3-vl-2b-thinkingqwen3-vl-2b-instruct🚀 (CI) Gallery Updater Agent: Auto-Detect & Suggest New Models
We’ve added an autonomous CI agent that scans Hugging Face daily for new models and opens PRs to update the gallery.
✨ How It Works:
name,family, andusecases🔧 Critical Bug Fixes & Stability Improvements
can't evaluate field Name in type stringerrorsession.createdevent; removed redundantconversation.createdmessagefieldkokoroTTS🚀 The Complete Local Stack for Privacy-First AI
LocalAI
The free, Open Source OpenAI alternative. Acts as a drop-in replacement REST API compatible with OpenAI specifications for local AI inferencing. No GPU required.
Link: https://github.com/mudler/LocalAI
LocalAGI
A powerful Local AI agent management platform. Serves as a drop-in replacement for OpenAI's Responses API, supercharged with advanced agentic capabilities and a no-code UI.
Link: https://github.com/mudler/LocalAGI
LocalRecall
A RESTful API and knowledge base management system providing persistent memory and storage capabilities for AI agents. Designed to work alongside LocalAI and LocalAGI.
Link: https://github.com/mudler/LocalRecall
❤️ Thank You!
A huge THANK YOU to our growing community! With over 35,000 stars, LocalAI is a true FOSS movement — built by people, for people, with no corporate backing.
If you love privacy-first AI and open source, please:
Your support keeps this stack alive and evolving!
✅ Full Changelog
📋 Click to expand full changelog
What's Changed
Bug fixes 🐛
Exciting New Features 🎉
🧠 Models
📖 Documentation and examples
👒 Dependencies
Other Changes
128d522c04286e019666bd6ee4d18e3fbf8772e2by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to128d522c04286e019666bd6ee4d18e3fbf8772e2#637986df2c9ae4f2f1ee63d2558a9dc797b98524639bby @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to86df2c9ae4f2f1ee63d2558a9dc797b98524639b#6382ca71fb9b368e3db96e028f80c4c9df6b6b370eddby @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp toca71fb9b368e3db96e028f80c4c9df6b6b370edd#63853df2244df40c67dfd6ad548b40ccc507a066af2bby @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to3df2244df40c67dfd6ad548b40ccc507a066af2b#6401c8223a8548ad64435266e551385fc51aca9ee8abby @localai-bot in chore: ⬆️ Update ggml-org/whisper.cpp toc8223a8548ad64435266e551385fc51aca9ee8ab#6402aeaf8a36f06b5810f5ae4bbefe26edb33925cf5eby @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp toaeaf8a36f06b5810f5ae4bbefe26edb33925cf5e#64089d0882840e6c3fb62965d03af0e22880ea90e012by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to9d0882840e6c3fb62965d03af0e22880ea90e012#64108877dfc11a9322ce1990958494cf2e41c54657ebby @localai-bot in chore: ⬆️ Update ggml-org/whisper.cpp to8877dfc11a9322ce1990958494cf2e41c54657eb#641198930fded1c06e601a38903607af262f04893880by @localai-bot in chore: ⬆️ Update ggml-org/whisper.cpp to98930fded1c06e601a38903607af262f04893880#642085871a946971955c635f56bca24ea2a37fed6324by @localai-bot in chore: ⬆️ Update ggml-org/whisper.cpp to85871a946971955c635f56bca24ea2a37fed6324#6435e60f01d941bc5b7fae62dd57fee4cec76ec0ea6eby @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp toe60f01d941bc5b7fae62dd57fee4cec76ec0ea6e#643411f0af5504252e453d57406a935480c909e3ff37by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to11f0af5504252e453d57406a935480c909e3ff37#6437a91dd3be72f70dd1b3cb6e252f35fa17b93f596cby @localai-bot in chore: ⬆️ Update ggml-org/whisper.cpp toa91dd3be72f70dd1b3cb6e252f35fa17b93f596c#6439a31cf36ad946a13b3a646bf0dadf2a481e89f944by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp toa31cf36ad946a13b3a646bf0dadf2a481e89f944#6440e60f241eacec42d3bd7c9edd37d236ebf35132a8by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp toe60f241eacec42d3bd7c9edd37d236ebf35132a8#6452fa882fd2b1bcb663de23af06fdc391489d05b007by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp tofa882fd2b1bcb663de23af06fdc391489d05b007#64544979e04f5dcaccb36057e059bbaed8a2f5288315by @localai-bot in chore: ⬆️ Update ggml-org/whisper.cpp to4979e04f5dcaccb36057e059bbaed8a2f5288315#6462466c1911ab736f0b7366127edee99f8ee5687417by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to466c1911ab736f0b7366127edee99f8ee5687417#64631bb4f43380944e94c9a86e305789ba103f5e62bdby @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to1bb4f43380944e94c9a86e305789ba103f5e62bd#648866b0dbcb2d462e7b70ba5a69ee8c3899ac2efb1cby @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to66b0dbcb2d462e7b70ba5a69ee8c3899ac2efb1c#6520ee09828cb057460b369576410601a3a09279e23cby @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp toee09828cb057460b369576410601a3a09279e23c#6550cec5edbcaec69bbf6d5851cabce4ac148be41701by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp tocec5edbcaec69bbf6d5851cabce4ac148be41701#657684bf3c677857279037adf67cdcfd89eaa4ca9281by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to84bf3c677857279037adf67cdcfd89eaa4ca9281#662123c19308d8a5786c65effa4570204a881660ff31by @localai-bot in chore: ⬆️ Update ggml-org/whisper.cpp to23c19308d8a5786c65effa4570204a881660ff31#662203792ad93609fc67e41041c6347d9aa14e5e0d74by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to03792ad93609fc67e41041c6347d9aa14e5e0d74#6651a2e0088d9242bd9e57f8b852b05a6e47843b5a45by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp toa2e0088d9242bd9e57f8b852b05a6e47843b5a45#6676322c2adb753a9506f0becee134a7f75e2a6b5687by @localai-bot in chore: ⬆️ Update ggml-org/whisper.cpp to322c2adb753a9506f0becee134a7f75e2a6b5687#66770bf47a1dbba4d36f2aff4e8c34b06210ba34e688by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to0bf47a1dbba4d36f2aff4e8c34b06210ba34e688#670355945d2ef51b93821d4b6f4a9b994393344a90dbby @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to55945d2ef51b93821d4b6f4a9b994393344a90db#67295d195f17bc60eacc15cfb929f9403cf29ccdf419by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to5d195f17bc60eacc15cfb929f9403cf29ccdf419#6757bbac6a26b2bd7f7c1f0831cb1e7b52734c66673bby @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp tobbac6a26b2bd7f7c1f0831cb1e7b52734c66673b#6783f16c12f3f55f5bd3d6ac8cf2f31ab90a42c884d5by @localai-bot in chore: ⬆️ Update ggml-org/whisper.cpp tof16c12f3f55f5bd3d6ac8cf2f31ab90a42c884d5#68355a4ff43e7dd049e35942bc3d12361dab2f155544by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to5a4ff43e7dd049e35942bc3d12361dab2f155544#6841c62adfbd1ecdaea9e295c72d672992514a2d887cby @localai-bot in chore: ⬆️ Update ggml-org/whisper.cpp toc62adfbd1ecdaea9e295c72d672992514a2d887c#6868851553ea6b24cb39fd5fd188b437d777cb411de8by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to851553ea6b24cb39fd5fd188b437d777cb411de8#68693464bdac37027c5e9661621fc75ffcef3c19c6efby @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to3464bdac37027c5e9661621fc75ffcef3c19c6ef#689616724b5b6836a2d4b8936a5824d2ff27c52b4517by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to16724b5b6836a2d4b8936a5824d2ff27c52b4517#69254146d6a1a6228711a487a1e3e9ddd120f8d027d7by @localai-bot in chore: ⬆️ Update ggml-org/llama.cpp to4146d6a1a6228711a487a1e3e9ddd120f8d027d7#6945New Contributors
Full Changelog: v3.6.0...v3.7.0
This discussion was created from the release v3.7.0.
Beta Was this translation helpful? Give feedback.
All reactions