Skip to content

Commit 94bf3c6

Browse files
authored
Merge pull request #28 from mikepapadim/docs/add_docker_readme
Add a Docker section to README with available images and usage examples
2 parents 079df9a + 7ddbd60 commit 94bf3c6

File tree

1 file changed

+28
-0
lines changed

1 file changed

+28
-0
lines changed

README.md

Lines changed: 28 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3,6 +3,8 @@
33
![TornadoVM](https://img.shields.io/badge/TornadoVM-enabled-green?style=for-the-badge&logo=apache)
44
![OpenCL](https://img.shields.io/badge/OpenCL-supported-blue?style=for-the-badge&logo=khronos)
55
![CUDA](https://img.shields.io/badge/CUDA/PTX-supported-76B900?style=for-the-badge&logo=nvidia)
6+
[![Docker OpenCL](https://img.shields.io/badge/Docker-OpenCL-2496ED?style=for-the-badge&logo=docker&logoColor=white)](https://hub.docker.com/r/beehivelab/gpullama3.java-nvidia-openjdk-opencl)
7+
[![Docker PTX](https://img.shields.io/badge/Docker-PTX-2496ED?style=for-the-badge&logo=docker&logoColor=white)](https://hub.docker.com/r/beehivelab/gpullama3.java-nvidia-openjdk-ptx)
68
[![GPULlama3.java DeepWiki](https://deepwiki.com/badge.svg)](https://deepwiki.com/beehive-lab/GPULlama3.java)
79

810
-----------
@@ -226,6 +228,32 @@ Enable GPU acceleration with Q8_0 quantization:
226228

227229
-----------
228230

231+
## 🐳 Docker
232+
233+
You can run `GPULlama3.java` fully containerized with GPU acceleration enabled via **OpenCL** or **PTX** using pre-built Docker images.
234+
More information as well as examples to run with the containers are available at [docker-gpullama3.java](https://github.com/beehive-lab/docker-gpullama3.java).
235+
236+
### 📦 Available Docker Images
237+
238+
| Backend | Docker Image | Pull Command |
239+
|--------|---------------|---------------|
240+
| **OpenCL** | [`beehivelab/gpullama3.java-nvidia-openjdk-opencl`](https://hub.docker.com/r/beehivelab/gpullama3.java-nvidia-openjdk-opencl) | `docker pull beehivelab/gpullama3.java-nvidia-openjdk-opencl` |
241+
| **PTX (CUDA)** | [`beehivelab/gpullama3.java-nvidia-openjdk-ptx`](https://hub.docker.com/r/beehivelab/gpullama3.java-nvidia-openjdk-ptx) | `docker pull beehivelab/gpullama3.java-nvidia-openjdk-ptx` |
242+
243+
#### Example (OpenCL)
244+
245+
```bash
246+
docker run --rm -it --gpus all \
247+
-v "$PWD":/data \
248+
beehivelab/gpullama3.java-nvidia-openjdk-opencl \
249+
/gpullama3/GPULlama3.java/llama-tornado \
250+
--gpu --verbose-init \
251+
--opencl \
252+
--model /data/Llama-3.2-1B-Instruct.FP16.gguf \
253+
--prompt "Tell me a joke"
254+
```
255+
-----------
256+
229257
## Troubleshooting GPU Memory Issues
230258

231259
### Out of Memory Error

0 commit comments

Comments
 (0)