Skip to content

feat(local-ai): update image docker.io/localai/localai v4.0.0 → v4.1.0#46706

Merged
truecharts-admin merged 1 commit intomasterfrom
renovate/helm-values-docker.io-localai-localai-4.1.x
Apr 3, 2026
Merged

feat(local-ai): update image docker.io/localai/localai v4.0.0 → v4.1.0#46706
truecharts-admin merged 1 commit intomasterfrom
renovate/helm-values-docker.io-localai-localai-4.1.x

Conversation

@truecharts-admin
Copy link
Copy Markdown
Contributor

This PR contains the following updates:

Package Update Change
docker.io/localai/localai minor b841e2b736fa3f

Warning

Some dependencies could not be looked up. Check the Dependency Dashboard for more information.

Add the preset :preserveSemverRanges to your config if you don't want to pin your dependencies.


Release Notes

mudler/LocalAI (docker.io/localai/localai)

v4.1.0

Compare Source

🎉 LocalAI 4.1.0 Release! 🚀




LocalAI 4.1.0 is out! 🔥

Just weeks after the landmark 4.0, we're back with another massive drop. This release turns LocalAI into a production-grade AI platform: spin up a distributed cluster with smart routing and autoscaling, lock it down with built-in auth and per-user quotas, fine-tune models without leaving the UI, and much more. If 4.0 was the foundation, 4.1 is the control tower.

Feature Summary
🌐 Distributed Mode Run LocalAI as a cluster — smart routing, node groups, drain/resume, min/max autoscaling.
🔐 Users & Auth Built-in user management with OIDC, invite mode, API keys, and admin impersonation.
📊 Quota System Per-user usage quotas with predictive analytics and breakdown dashboards.
🧪 Fine-Tuning (experimental) Fine-tune models with TRL, auto-export to GGUF, and import back — all from the UI.
⚗️ Quantization (experimental) New backend for on-the-fly model quantization.
🔧 Pipeline Editor Visual model pipeline editor in the React UI.
🤖 Standalone Agents Run agents from the CLI with local-ai agent run.
🧠 Smart Inferencing Auto inference defaults from Unsloth, tool parsing fallback, and min_p support.
🎬 Media History Browse past generated images and media in Studio pages.

New (long version) Full setup walktrough: https://www.youtube.com/watch?v=cMVNnlqwfw4

🚀 Key Features

🌐 Distributed Mode: scaling LocalAI horizontally

Run LocalAI as a distributed cluster and let it figure out where to send your requests. No more single-node bottlenecks.

  • Smart Routing: Requests are routed to nodes ordered by available VRAM — the beefiest, free GPU gets the job.
  • Node Groups: Pin models to specific node groups for workload isolation (e.g., "gpu-heavy" vs "cpu-light").
  • Autoscaling: Built-in min/max autoscaler with a node reconciler that manages the lifecycle automatically.
  • Drain & Resume: Gracefully drain nodes for maintenance and bring them back with a single API call.
  • Cluster Dashboard: See your entire cluster status at a glance from the home page.
  • Smart Model transfer: Use S3 or transfer via peer to peer
distributed-mode.mp4

🔐 Users, Authentication & Quotas

LocalAI now ships with a complete multi-user platform — perfect for teams, classrooms, or any shared deployment.

  • User Management: Create, edit, and manage users from the React UI.
  • OIDC/OAuth: Plug in your identity provider for SSO — Google, Keycloak, Authentik, you name it.
  • Invite Mode: Restrict registration to invite-only with admin approval.
  • API Keys: Per-user API key management.
  • Admin Powers: Admins can impersonate users for debugging.
  • Quota System: Set per-user usage quotas and enforce limits.
  • Usage Analytics: Predictive usage dashboard with per-user breakdown statistics.
Users and quota:
usersquota-1775167475876.mp4
Usage metrics per user:
usage.mp4

🧪 Fine-Tuning & Quantization

No more juggling external tools. Fine-tune and quantize directly inside LocalAI.

  • Fine-Tuning with TRL (Experimental): Train LoRA adapters with Hugging Face TRL, auto-export to GGUF, and import the result straight back into LocalAI. Includes a built-in evals framework to validate your work.
  • Quantization Backend: Spin up the new quantization backend to create optimized model variants on-the-fly.
quantize-fine-tune.mp4

🎨 UI

The React UI keeps getting better. This release adds serious power-user features:

  • Model Pipeline Editor: Visually wire up model pipelines — no YAML editing required.
  • Per-Model Backend Logs: Drill into logs scoped to individual models for laser-focused debugging.
  • Media History: Studio pages now remember your past generations — images, audio, and more.
  • Searchable Model/Backend Selector: Quickly find models and backends with inline search and filtering.
  • Structured Error Toasts: Errors now link directly to traces — one click from "something broke" to "here's why."
  • Tracing Settings: Inline tracing config restored with a cleaner UI.
talk.mp4

🤖 Agents & Inference
  • Standalone Agent Mode: Run agents straight from the terminal with local-ai agent run. Supports single-turn --prompt mode and pool-based configurations from pool.json.
  • Streaming Tool Calls: Agent mode tool calls now stream in real-time, with interleaved thinking fixed.
  • Inferencing Defaults: Automatic inference parameters sourced from Unsloth and applied to all endpoints and gallery models, your models just work better out of the box.
  • Tool Parsing Fallback: When native tool call parsing fails, an iterative fallback parser kicks in automatically.

🛠️ Under the Hood

  • Repeated Log Merging: Noisy terminals? Repeated log lines are now collapsed automatically.
  • Jetson/Tegra GPU Detection: First-class NVIDIA Jetson/Tegra platform detection.
  • Intel SYCL Fix: Auto-disables mmap for SYCL backends to prevent crashes.
  • llama.cpp Portability: Bundled libdl, librt, libpthread for improved cross-platform support.
  • HF_ENDPOINT Mirror: Downloader now rewrites HuggingFace URIs with HF_ENDPOINT for corporate/mirror setups.
  • Transformers >5.0: Bumped to HuggingFace Transformers >5.0 with generic model loading.
  • API Improvements: Proper 404s for missing models, unescaped model names, unified inferencing paths with automatic retry on transient errors.

🐞 Fixes & Improvements

  • Embeddings: Implemented encoding_format=base64 for the embeddings endpoint.
  • Kokoro TTS: Fixed phonemization model not downloading during installation.
  • Realtime API: Fixed Opus codec backend selection alias in development mode.
  • Gallery Filtering: Fixed exact tag matching for model gallery filters.
  • Open Responses: Fixed required ORItemParam.Arguments field being omitted; ORItemParam.Summary now always populated.
  • Tracing: Fixed settings not loading from runtime_settings.json.
  • UI: Fixed watchdog field mapping, model list refresh on deletion, backend display in model config, MCP button ordering.
  • Downloads: Fixed directory removal during fallback attempts; improved retry logic.
  • Model Paths: Fixed baseDir assignment to use ModelPath correctly.

❤️ Thank You

LocalAI is a community-powered FOSS movement. Every star, every PR, every bug report matters.

If you believe in privacy-first, self-hosted AI:

  • Star the repo — it helps more than you think
  • 🛠️ Contribute code, docs, or feedback
  • 📣 Share with your team, your community, your world

Let's keep building the future of open AI — together. 💪


✅ Full Changelog

📋 Click to expand full changelog

What's Changed

Bug fixes 🐛
Exciting New Features 🎉
👒 Dependencies
Other Changes

New Contributors

Full Changelog: mudler/LocalAI@v4.0.0...v4.1.0


Configuration

📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).

🚦 Automerge: Enabled.

Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.


  • If you want to rebase/retry this PR, check this box

This PR has been generated by Renovate Bot.

@truecharts-admin truecharts-admin added automerge Categorises a PR or issue that references a new App. renovate/container type/minor app/local-ai labels Apr 3, 2026
@truecharts-admin truecharts-admin requested a review from a team April 3, 2026 04:27
@truecharts-admin truecharts-admin enabled auto-merge (squash) April 3, 2026 04:27
@truecharts-admin truecharts-admin added the automerge Categorises a PR or issue that references a new App. label Apr 3, 2026
@github-actions
Copy link
Copy Markdown

github-actions bot commented Apr 3, 2026

📝 Linting results:

✔️ Linting [charts/stable/local-ai]: Passed - Took 0 seconds
Total Charts Linted: 1
Total Charts Passed: 1
Total Charts Failed: 0

✅ Linting: Passed - Took 1 seconds

Copy link
Copy Markdown
Member

@Crow-Control Crow-Control left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Auto approved automated PR

@truecharts-admin truecharts-admin merged commit a175687 into master Apr 3, 2026
20 checks passed
@truecharts-admin truecharts-admin deleted the renovate/helm-values-docker.io-localai-localai-4.1.x branch April 3, 2026 04:31
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

automerge Categorises a PR or issue that references a new App. renovate/container type/minor

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants