Add llama-pull binary for downloading models from HuggingFace and Docker Registry #37
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR adds a new
llama-pullcommand-line tool that provides a dedicated interface for downloading AI models from HuggingFace and Docker Registry repositories.Features
The new tool supports downloading models from two sources:
Implementation
The tool leverages the existing robust download infrastructure from
common/arg.cpp, ensuring:Key Benefits
Usage Examples
Downloaded models are stored in the standard llama.cpp cache directory (
~/.cache/llama.cpp/) and can be immediately used with other tools likellama-cli,llama-server, etc.Files Added
tools/llama-pull/llama-pull.cpp- Main implementationtools/llama-pull/CMakeLists.txt- Build configurationtools/llama-pull/README.md- Documentationtools/CMakeLists.txtto include the new toolThe implementation is minimal (~100 lines) and integrates seamlessly with the existing codebase.
Warning
Firewall rules blocked me from connecting to one or more addresses (expand for details)
I tried to connect to the following addresses, but was blocked by firewall rules:
huggingface.co./build/bin/llama-pull -hf bartowski/Llama-3.2-1B-Instruct-GGUF:Q4_K_M(dns block)If you need me to access, download, or install something from one of these locations, you can either:
✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.