Replace custom OCI implementation with go-containerregistry library #40

Copilot · 2025-10-19T13:00:52Z

This PR replaces the custom OCI/Docker registry implementation in llama.cpp with the industry-standard go-containerregistry library from Google, as requested in the issue.

Background

Previously, llama.cpp had a custom implementation for pulling models from Docker registries (~120 lines of C++ code in common/arg.cpp). This implementation had limitations:

Manual HTTP request handling for registry authentication
Limited authentication support
Higher maintenance burden for OCI protocol changes

Changes

New Go Module Integration (`oci-go/`)

Created a Go module that wraps go-containerregistry with a C-compatible interface using CGO:

oci.go - Core implementation with exported PullOCIModel() function
go.mod/go.sum - Dependencies including go-containerregistry v0.20.6
Statically compiled into liboci.a for cross-platform support

C++ Wrapper Layer (`common/oci.h`, `common/oci.cpp`)

Added a clean C++ interface to the Go library:

struct oci_pull_result {
    std::string local_path;
    std::string digest;
    int error_code;
    std::string error_message;
};

oci_pull_result oci_pull_model(const std::string & imageRef, 
                                const std::string & cacheDir);

Updated Docker Registry Resolution (`common/arg.cpp`)

Simplified the common_docker_resolve_model() function from ~120 lines to ~30 lines:

// Before: Manual HTTP requests, token management, manifest parsing
// After: Single call to oci_pull_model()
auto result = oci_pull_model(image_ref, cache_dir);

Build System Integration (`common/CMakeLists.txt`)

Auto-detects Go compiler at build time
Builds Go library as static archive
Links statically with common library
Graceful fallback if Go is unavailable (with warning)

Documentation (`docs/oci-registry.md`)

Added comprehensive documentation covering:

Usage examples for public and private registries
Authentication via docker login
Custom registry support
Troubleshooting guide

Benefits

✅ Proper Docker Login Support: Uses Docker's credential helpers, supporting all authentication methods including credential stores and helpers
✅ Industry Standard: Leverages Google's well-maintained library used by many container tools
✅ Better OCI Compliance: Handles edge cases and protocol changes automatically
✅ Reduced Maintenance: ~90 fewer lines of custom networking code to maintain
✅ Security: No custom HTTP/auth code to audit

Usage

The --docker-repo flag works exactly as before:

# Public model
./llama-cli --docker-repo ai/smollm2:135M-Q4_0

# Private model (after docker login)
docker login
./llama-cli --docker-repo myregistry.com/private/model:tag

Compatibility

Backward Compatible: Existing --docker-repo usage unchanged
Build Requirements: Go 1.24+ required for building (produces statically linked binary)
Runtime: No Go runtime dependency (statically linked)
Platforms: Linux, macOS, Windows (wherever Go can build C archives)

Testing

✅ Full project builds successfully
✅ 93% test pass rate (failures are pre-existing network-related issues)
✅ CodeQL security scan: No vulnerabilities
✅ Code style checks pass

Technical Notes

The Go library is statically linked into the final binaries, so users don't need Go installed to run llama.cpp. Go is only required at build time to compile the OCI module.

The implementation uses authn.DefaultKeychain which automatically discovers Docker credentials from:

~/.docker/config.json
Docker credential helpers
Cloud provider metadata services (for cloud environments)

This ensures seamless compatibility with docker login and other standard Docker authentication methods.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

ggml.ai
- Triggering command: ./build/bin/test-arg-parser (dns block)
huggingface.co
- Triggering command: /usr/lib/git-core/git-remote-https origin REDACTED (dns block)
- Triggering command: /home/REDACTED/work/llama.cpp/llama.cpp/build/bin/test-thread-safety -hf ggml-org/models -hff tinyllamas/stories15M-q4_0.gguf -ngl 99 -p The meaning of life is -n 128 -c 256 -ub 32 -np 4 -t 2 (dns block)
- Triggering command: /home/REDACTED/work/llama.cpp/llama.cpp/build/bin/llama-eval-callback --hf-repo ggml-org/models --hf-file tinyllamas/stories260K.gguf --model stories260K.gguf --prompt hello --seed 42 -ngl 0 (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

Original prompt

Instead of a custom OCI implementation integrate this golang library into llama.cpp statically linking all the golang requried for all platforms. We want to support "docker login" authenticated pulls. Use this library:

@google/go-containerregistry

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

Co-authored-by: ericcurtin <[email protected]>

Initial plan

ca601ac

Copilot AI assigned Copilot and ericcurtin Oct 19, 2025

Copilot started work on behalf of ericcurtin October 19, 2025 13:03 View session

Copilot AI and others added 4 commits October 19, 2025 13:18

Add Go OCI library integration with go-containerregistry

8df6bd4

Co-authored-by: ericcurtin <[email protected]>

Add OCI registry documentation and fix null pointer handling

7a0fad6

Co-authored-by: ericcurtin <[email protected]>

Fix redundant null check in Go error handling

caf8e43

Co-authored-by: ericcurtin <[email protected]>

Add generated OCI header to .gitignore

0a88243

Co-authored-by: ericcurtin <[email protected]>

Copilot AI changed the title ~~[WIP] Integrate golang library for docker login support~~ Replace custom OCI implementation with go-containerregistry library Oct 19, 2025

Copilot AI requested a review from ericcurtin October 19, 2025 13:31

Copilot finished work on behalf of ericcurtin October 19, 2025 13:31

github-actions bot added the documentation Improvements or additions to documentation label Oct 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Replace custom OCI implementation with go-containerregistry library #40

Replace custom OCI implementation with go-containerregistry library #40

Copilot AI commented Oct 19, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Replace custom OCI implementation with go-containerregistry library #40

Are you sure you want to change the base?

Replace custom OCI implementation with go-containerregistry library #40

Conversation

Copilot AI commented Oct 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Background

Changes

New Go Module Integration (oci-go/)

C++ Wrapper Layer (common/oci.h, common/oci.cpp)

Updated Docker Registry Resolution (common/arg.cpp)

Build System Integration (common/CMakeLists.txt)

Documentation (docs/oci-registry.md)

Benefits

Usage

Compatibility

Testing

Technical Notes

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Oct 19, 2025 •

edited

Loading

New Go Module Integration (`oci-go/`)

C++ Wrapper Layer (`common/oci.h`, `common/oci.cpp`)

Updated Docker Registry Resolution (`common/arg.cpp`)

Build System Integration (`common/CMakeLists.txt`)

Documentation (`docs/oci-registry.md`)