| title | slug |
|---|---|
AI and Machine Learning |
/ai |
Bluefin was created by engineers, but was brought to life by Jacob Schnurr and Andy Frazer. The artwork is free for you to use and will always be made by humans. It is there to remind us that open source is an ecosystem that needs to be sustained. The software we make has an effect on the world. Bluefin's AI integration will always be user controlled, with a focus on open source models and tools.
:::tip[AI is an extension of cloud native]
Bluefin's focus in AI is providing a generic API endpoint to the operating system that is controlled by the user. Just as Bluefin's operating system is built with CNCF tech like bootc and podman, this experience is powered by Agentic AI Foundation tech like goose. With a strong dash of the open source components that power RHEL Lightspeed.
:::
"Bluespeed" is our collection of Bluefin's developer experience tools and support for AI development workflows. We do this via community managed set of tool recommendations and configuration. We believe that the operating system should have more API endpoints for AI.
- Accelerate open standards in AI by shipping tools from the Agentic AI Foundation, CNCF, and other foundations
- Make it easy to run and manage local LLM
- Model management via
ramalamaand Docker Model, your choice - GPU Acceleration for both Nvidia and AMD are included out of the box and usually do not require any extra setup
- Model management via
- "Bring your own LLM" approach, it should be easy to switch between local models and hosted ones
- Forming Bluespeed presents us great swag possibilities in the future
We work closely with the RHEL Lightspeed team by shipping their code, giving feedback, and pushing the envelope where we can.
The AI Lab extension can be installed inside the included Podman Desktop to provide a graphical interface for managing local models:
Read AI Lab Extension documentation or visit its GitHub page for more information.
Install Ramalama via brew install ramalama: manage local models and is the preferred default experience. It's for people who work with local models frequently and need advanced features. It offers the ability to pull models from HuggingFace, Ollama, and any container registry. By default it pulls from ollama.com, check the Ramalama documentation for more information.
Ramalama's command line experience is similar to Podman. Bluefin sets rl as an alias for ramalama, for brevity. Examples include:
rl pull nemotron-3-nano:latest
rl run nemotron-3-nano
rl run gpt-oss:20b
You can also serve the models locally:
rl serve glm-4.7-flash
Then go to http://127.0.0.0:8080 in your browser.
Ramalama will automatically pull in anything your host needs to do the workload. The images are also stored in the same container storage as your other containers. This allows for centralized management of the models and other podman images:
❯ podman images
REPOSITORY TAG IMAGE ID CREATED SIZE
quay.io/ramalama/rocm latest 8875feffdb87 5 days ago 6.92 GB
ramalama serve will serve an OpenAI compatible endpoint at http://0.0.0.0:8080, you can use this to configure tools that do not support ramalama directly:
- Force Vulkan instead of ROCm:
ramalama serve --image quay.io/ramalama/ramalama gpt-oss:latest - Strix Halo users:
ramalama serve --image docker.io/kyuz0/amd-strix-halo-toolboxes:vulkan-radv gpt-oss:latest- Check out AMD Strix Halo Llama.cpp Toolboxes and Donato Capitella's channel for more information
Developer Mode (ujust toggle-devmode) came with Docker Engine and Docker Model Runner, letting you pull large language models from Docker Hub and HuggingFace. Choose between llama.cpp and vLLM inference engines, with support for CUDA (NVIDIA) and Vulkan backend (AMD and Intel).
Check and test the capability:
docker model version
docker model run ai/smollm2
Pull a model to cache it locally:
docker model pull ai/devstral-small-2 # from Docker Hub
docker model pull hf.co/Qwen/Qwen3-32B # from HuggingFace
brew tap ublue-os/tap
brew install ublue-os/tap/lm-studio-linux
The following AI-focused command-line tools are available via homebrew, install individually or use this command to install them all: ujust bbrew and choose the ai menu option:
| Name | Description |
|---|---|
| aichat | All-in-one AI-Powered CLI Chat & Copilot |
| block-goose-cli | Block Protocol AI agent CLI |
| claude-code | Claude coding agent with desktop integration |
| codex | Code editor for OpenAI's coding agent that runs in your terminal |
| copilot-cli | GitHub Copilot CLI for terminal assistance |
| crush | AI coding agent for the terminal, from charm.sh |
| gemini-cli | Command-line interface for Google's Gemini API |
| kimi-cli | CLI for Moonshot AI's Kimi models |
| llm | Access large language models from the command line |
| lm-studio | Desktop app for running local LLMs |
| mistral-vibe | CLI for Mistral AI models |
| mods | AI on the command-line, from charm.sh |
| opencode | AI coding agent for the terminal |
| qwen-code | CLI for Qwen3-Coder models |
| ramalama | Manage and run AI models locally with containers |
| whisper-cpp | High-performance inference of OpenAI's Whisper model |
Here is an example of using devcontainers to run agents inside containers for isolation:
<iframe width="560" height="315" src="https://www.youtube.com/embed/w3kI6XlZXZQ?si=5pygGs5E_Qedf-S8" title="YouTube video player" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe>For light chatbot usage, we recommend that users install Alpaca to manage and chat with your LLM models within a native GNOME desktop application. Alpaca supports Nvidia and AMD[^1] acceleration natively.
:::tip[Only a keystroke away]
Bluefin binds Ctrl-Alt-Backspace as a quicklaunch for Alpaca automatically after you install it!
:::
Goose Desktop is an extensible AI agent with familiar desktop interface. Developed by Block, it was recently donated to Agentic AI Foundation (AAIF), a vendor-neutral home for open source agentic AI under the Linux Foundation umbrella. Goose let you log in to multiple providers, use your own local inference, as well as utilizing instances of CLI tools like Claude Code, Cursor Agent, Codex or Gemini CLI.
Their built-in extensions covers a wide range, from memory tools, sandboxed code execution, to documents/spreadsheets editing. You can also add external MCP extensions and agent skills, create Recipes from a session, and use Scheduler to run recipes automatically.
In the meantime, you can install Goose Desktop from our Homebrew tap:
brew tap ublue-os/tap
brew install ublue-os/tap/goose-linux
Read more information about Goose Desktop on Goose documentation page
- Fresh from the oven, desktop version is currently in beta
- Enhanced UX for agentic coding, including built-in explore and review subagents, file preview, diff viewer, and terminal access
- Focused on agentic coding experiences and easy access to switch between models from multiple providers when needed
- Coming with its own server instance, but you can connect to any instances of OpenCode, including your homelab or VPS, easily.
brew tap ublue-os/experimental-tap
brew install ublue-os/experimental-tap/opencode-desktop-linux
Bluefin ships with automated troubleshooting tools:



