Skip to content

Ollama GPU discovery failure after reboot (CUDA error 999)Β #5740

@nickheyer

Description

@nickheyer

βœ… Have you read and understood the above guidelines?

yes

πŸ“œ What is the name of the script you are using?

Ollama (lxc)

πŸ“‚ What was the exact command used to execute the script?

bash -c "$(curl -fsSL https://raw.githubusercontent.com/community-scripts/ProxmoxVE/main/ct/ollama.sh)"

βš™οΈ What settings are you using?

  • Default Settings
  • Advanced Settings

πŸ–₯️ Which Linux distribution are you using?

Debian 12

πŸ“ Provide a clear and concise description of the issue.

After start-on-boot spins up the Ollama lxc when proxmox boots, GPU discovery fails and reverts to CPU based inference (which is basically unusable for "production inference").

There is mention in journalctl of nvidia-persistenced service errors due to permissions, but after manually changing permissions of the mentioned /etc/nvidia (iirc) files inside the lxc, doesn't seem to fix the issue.

πŸ”„ Steps to reproduce the issue.

create ollama lxc w/ defaults
install nvidia--open (recommended drivers)
confirm with nvidia-smi
test gpu usage w/ ollama
restart host machine
run journalctl inside lxc and see errors about GPU not being detected
run nvidia-smi to see gpu still accessible
restart ollama service or restart entire container, problem still persists?

❌ Paste the full error output (if available).

cuda 999 error (generic non descript default error). I do not have output right now.

πŸ–ΌοΈ Additional context (optional).

nvidia-575-open

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions