llamacpp-rocm

We provide nightly builds of llama.cpp with AMD ROCm™ 7 acceleration based on TheRock - delivering the freshest, cutting-edge builds available. Our automated pipeline specifically targets seamless integration with 🍋 Lemonade and similar AI applications requiring high-performance GPU inference.

Important

Contribution & Support Notice: While this project currently focuses on integrating llama.cpp+ROCm in a specific production context, our broader goal is to contribute meaningfully to the llama.cpp+ROCm ecosystem. We're not set up to provide comprehensive technical support, but we welcome collaborations, idea exchanges, or contributions that help advance this space.

🎯 Supported Devices

This build specifically targets the following GPU architectures:

gfx1151 (STX Halo GPUs) - Ryzen AI MAX+ Pro 395
gfx120X (RDNA4 GPUs) - includes AMD Radeon AI PRO R9700, RX 9070 XT/GRE/9070, RX 9060 XT
gfx110X (RDNA3 GPUs) - includes AMD Radeon PRO W7900/W7800/W7700/V710, RX 7900 XTX/XT/GRE, RX 7800 XT, RX 7700 XT

All builds include ROCm™ 7 built-in - no separate ROCm™ installation required!

🚀 Automated Builds

Our automated GitHub Actions workflow creates nightly builds for:

Windows and Ubuntu operating systems
Multiple GPU targets: gfx1151, gfx120X, gfx110X
ROCm™ 7 built-in - complete runtime libraries included

GPU Target	Ubuntu	Windows
gfx110X
gfx1151
gfx120X

⚡ Ready to Run: All releases include complete ROCm™ 7 runtime libraries - just download and go!

🧪 Quick Smoketest

To verify your download is working correctly:

Download the appropriate build for your GPU target from our latest releases
Extract the archive to your preferred directory
Test with any GGUF model from Hugging Face:

llama-server -m YOUR_GGUF_MODEL_PATH -ngl 99

💡 Tip: Use -ngl 99 to offload all layers to GPU for maximum acceleration. The exact number of layers may vary by model, but 99 ensures all available layers are offloaded.

🍋 Lemonade Integration: You can also test these builds directly with Lemonade for a seamless AI application experience (coming soon!)

📦 Dependencies

This project relies on the following external software and tools:

Core Dependencies

Llama.cpp - Efficient, cross-platform inference engine for running GGUF models locally.
ROCm SDK (TheRock) - AMD’s open-source platform for GPU-accelerated computing.
HIP - C++ API for writing portable GPU code within the ROCm ecosystem.

Build Tools & Compilers

Visual Studio 2022 Build Tools - Microsoft C++ build tools
CMake - Cross-platform build system (version 3.31.0)
Ninja - Small build system with focus on speed
Clang/Clang++ - C/C++ compiler (bundled with ROCm)

🏗️ Code and Artifact Structure

Note

Active Development: This project is under active development. Code and artifact structure are subject to change as we continue to improve and expand functionality.

Key Components

docs/ - Contains build documentation and setup guides
utils/ - Houses utility scripts for build automation and dependency management
GitHub Actions Workflows - Located in .github/workflows/ (automated build pipeline)
Build Artifacts - Generated during CI/CD and published as releases

The build process is primarily handled through GitHub Actions, with the repository serving as the source for automated compilation and packaging of llama.cpp with ROCm™ 7 support.

📋 Manual Build Instructions

For detailed manual build instructions, please see: docs/manual_instructions.md

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llamacpp-rocm

🎯 Supported Devices

🚀 Automated Builds

🧪 Quick Smoketest

📦 Dependencies

Core Dependencies

Build Tools & Compilers

🏗️ Code and Artifact Structure

Key Components

📋 Manual Build Instructions

📄 License

About

Uh oh!

Releases 290

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 85 Commits
.github/workflows		.github/workflows
docs		docs
utils		utils
LICENSE		LICENSE
README.md		README.md

License

lemonade-sdk/llamacpp-rocm-dll

Folders and files

Latest commit

History

Repository files navigation

llamacpp-rocm

🎯 Supported Devices

🚀 Automated Builds

🧪 Quick Smoketest

📦 Dependencies

Core Dependencies

Build Tools & Compilers

🏗️ Code and Artifact Structure

Key Components

📋 Manual Build Instructions

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 290

Packages 0

Languages

Packages