Ollama-Docker-NVIDIA-Toolkit

Getting Started

The default compose.yml file is set to use the NVIDIA Docker Toolkit. What this means is that the Ollama container will be GPU accelerated if you have a compatible NVIDIA GPU and the NVIDIA Toolkit installed.
You must install the NVIDIA Docker Toolkit to use the GPU acceleration feature and have a compatible NVIDIA GPU on the system you deploy to. If you do not have a compatible NVIDIA GPU, you can use the CPU-only version of Ollama by commenting out the deploy section in the compose.yml file under the ollama service.
- Installing NVIDIA Container Toolkit:
  - Source
  - I have a history of what I opted to do in a setup.md file in this repository.
  - I'm running Linux with a NVIDIA 4080 Super on a desktop PC.

To begin the development context, you can run:

docker compose --profile dev up --build (add -d flag for detached mode)
Pull the mistral LLM into the container: docker exec -it ollama ollama run llama3.1
You can run the provided ui by changing into the ui directory and running pnpm run dev.
- The /ui is set to proxy the backend in the vite.config.ts file.
- If you want Rust to serve the frontend /ui package, you must run:

pnpm run build

This will generate files in the /core/static folder.
You may need to rebuild the docker image in order for Rust to see them: docker compose --profile dev up --build

pnpm build --watch docker compose up

To inspect container: docker compose --profile dev run --rm dev bash

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
core		core
ui		ui
.dockerignore		.dockerignore
README.md		README.md
compose.yml		compose.yml
setup.md		setup.md