documentation/docs/ai.md at a612d0e8b093d5fd2a78c219ac53a77a66282b7b · projectbluefin/documentation

title	slug
AI and Machine Learning	/ai

Methodology

Bluefin was created by engineers, but was brought to life by Jacob Schnurr and Andy Frazer. The artwork is free for you to use and will always be made by humans. It is there to remind us that open source is an ecosystem that needs to be sustained. The software we make has an effect on the world. Bluefin's AI integration will always be user controlled, with a focus on open source models and tools.

:::tip[AI is an extension of cloud native]

Bluefin's focus in AI is providing a generic API endpoint to the operating system that is controlled by the user. Just as Bluefin's operating system is built with CNCF tech like bootc and podman, this experience is powered by Agentic AI Foundation tech like goose. With a strong dash of the open source components that power RHEL Lightspeed.

:::

Bluespeed

"Bluespeed" is our collection of Bluefin's developer experience tools and support for AI development workflows. We do this via community managed set of tool recommendations and configuration. We believe that the operating system should have more API endpoints for AI.

Accelerate open standards in AI by shipping tools from the Agentic AI Foundation, CNCF, and other foundations
Make it easy to run and manage local LLM
- Model management via ramalama and Docker Model, your choice
- GPU Acceleration for both Nvidia and AMD are included out of the box and usually do not require any extra setup
"Bring your own LLM" approach, it should be easy to switch between local models and hosted ones
- Goose as the primary interface for local and hosted models
- OpenCode as coding-focused agent, available in TUI & Desktop versions
- Provide access to various AI command-line tools installable via Brew
- Highlight great AI/ML applications on FlatHub in our curated section in the App Store
Forming Bluespeed presents us great swag possibilities in the future

We work closely with the RHEL Lightspeed team by shipping their code, giving feedback, and pushing the envelope where we can.

Setup Your Local LLM

AI Lab with Podman Desktop

The AI Lab extension can be installed inside the included Podman Desktop to provide a graphical interface for managing local models:

Read AI Lab Extension documentation or visit its GitHub page for more information.

Ramalama

Install Ramalama via brew install ramalama: manage local models and is the preferred default experience. It's for people who work with local models frequently and need advanced features. It offers the ability to pull models from HuggingFace, Ollama, and any container registry. By default it pulls from ollama.com, check the Ramalama documentation for more information.

Ramalama's command line experience is similar to Podman. Bluefin sets rl as an alias for ramalama, for brevity. Examples include:

rl pull nemotron-3-nano:latest
rl run nemotron-3-nano
rl run gpt-oss:20b

You can also serve the models locally:

rl serve glm-4.7-flash

Then go to http://127.0.0.0:8080 in your browser.

Ramalama will automatically pull in anything your host needs to do the workload. The images are also stored in the same container storage as your other containers. This allows for centralized management of the models and other podman images:

❯ podman images
REPOSITORY                                 TAG         IMAGE ID      CREATED        SIZE
quay.io/ramalama/rocm                      latest      8875feffdb87  5 days ago     6.92 GB

Integrating with Existing Tools

ramalama serve will serve an OpenAI compatible endpoint at http://0.0.0.0:8080, you can use this to configure tools that do not support ramalama directly:

Other Ramalama tips

Force Vulkan instead of ROCm: ramalama serve --image quay.io/ramalama/ramalama gpt-oss:latest
Strix Halo users: ramalama serve --image docker.io/kyuz0/amd-strix-halo-toolboxes:vulkan-radv gpt-oss:latest
- Check out AMD Strix Halo Llama.cpp Toolboxes and Donato Capitella's channel for more information

Docker Model Runner

Developer Mode (ujust toggle-devmode) came with Docker Engine and Docker Model Runner, letting you pull large language models from Docker Hub and HuggingFace. Choose between llama.cpp and vLLM inference engines, with support for CUDA (NVIDIA) and Vulkan backend (AMD and Intel).

Check and test the capability:

docker model version
docker model run ai/smollm2

Pull a model to cache it locally:

docker model pull ai/devstral-small-2  # from Docker Hub
docker model pull hf.co/Qwen/Qwen3-32B # from HuggingFace

LM Studio (WIP)

brew tap ublue-os/tap
brew install ublue-os/tap/lm-studio-linux

Use with AI Command Line Tools

The following AI-focused command-line tools are available via homebrew, install individually or use this command to install them all: ujust bbrew and choose the ai menu option:

Name	Description
aichat	All-in-one AI-Powered CLI Chat & Copilot
block-goose-cli	Block Protocol AI agent CLI
claude-code	Claude coding agent with desktop integration
codex	Code editor for OpenAI's coding agent that runs in your terminal
copilot-cli	GitHub Copilot CLI for terminal assistance
crush	AI coding agent for the terminal, from charm.sh
gemini-cli	Command-line interface for Google's Gemini API
kimi-cli	CLI for Moonshot AI's Kimi models
llm	Access large language models from the command line
lm-studio	Desktop app for running local LLMs
mistral-vibe	CLI for Mistral AI models
mods	AI on the command-line, from charm.sh
opencode	AI coding agent for the terminal
qwen-code	CLI for Qwen3-Coder models
ramalama	Manage and run AI models locally with containers
whisper-cpp	High-performance inference of OpenAI's Whisper model

Use CLI Agents with Devcontainers in VS Code

Here is an example of using devcontainers to run agents inside containers for isolation:

Use with AI Desktop Apps

Alpaca

For light chatbot usage, we recommend that users install Alpaca to manage and chat with your LLM models within a native GNOME desktop application. Alpaca supports Nvidia and AMD[^1] acceleration natively.

:::tip[Only a keystroke away]

Bluefin binds Ctrl-Alt-Backspace as a quicklaunch for Alpaca automatically after you install it!

:::

Configuration

Goose Desktop (WIP)

Goose Desktop is an extensible AI agent with familiar desktop interface. Developed by Block, it was recently donated to Agentic AI Foundation (AAIF), a vendor-neutral home for open source agentic AI under the Linux Foundation umbrella. Goose let you log in to multiple providers, use your own local inference, as well as utilizing instances of CLI tools like Claude Code, Cursor Agent, Codex or Gemini CLI.

Their built-in extensions covers a wide range, from memory tools, sandboxed code execution, to documents/spreadsheets editing. You can also add external MCP extensions and agent skills, create Recipes from a session, and use Scheduler to run recipes automatically.

In the meantime, you can install Goose Desktop from our Homebrew tap:

brew tap ublue-os/tap
brew install ublue-os/tap/goose-linux

Read more information about Goose Desktop on Goose documentation page

OpenCode Desktop (WIP)

Fresh from the oven, desktop version is currently in beta
Enhanced UX for agentic coding, including built-in explore and review subagents, file preview, diff viewer, and terminal access
Focused on agentic coding experiences and easy access to switch between models from multiple providers when needed
Coming with its own server instance, but you can connect to any instances of OpenCode, including your homelab or VPS, easily.

brew tap ublue-os/experimental-tap
brew install ublue-os/experimental-tap/opencode-desktop-linux

Newelle (WIP)

Install Newelle

Automated Troubleshooting (WIP)

Bluefin ships with automated troubleshooting tools:

Work in progress

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Methodology

Bluespeed

Setup Your Local LLM

AI Lab with Podman Desktop

Ramalama

Integrating with Existing Tools

Other Ramalama tips

Docker Model Runner

LM Studio (WIP)

Use with AI Command Line Tools

Use CLI Agents with Devcontainers in VS Code

Use with AI Desktop Apps

Alpaca

Configuration

Goose Desktop (WIP)

OpenCode Desktop (WIP)

Newelle (WIP)

Automated Troubleshooting (WIP)

Uh oh!

FilesExpand file tree

ai.md

Latest commit

History

ai.md

File metadata and controls

Methodology

Bluespeed

Setup Your Local LLM

AI Lab with Podman Desktop

Ramalama

Integrating with Existing Tools

Other Ramalama tips

Docker Model Runner

LM Studio (WIP)

Use with AI Command Line Tools

Use CLI Agents with Devcontainers in VS Code

Use with AI Desktop Apps

Alpaca

Configuration

Goose Desktop (WIP)

OpenCode Desktop (WIP)

Newelle (WIP)

Automated Troubleshooting (WIP)