Skip to content

Attempting to connect to upstream llama-server ui via llama-swap results in a 404 #306

@doctorjames

Description

@doctorjames

Describe the bug
I have just updated the llama-swap docker container to the latest cuda version (v159-cuda-b6503) and the upstream web UI no longer works. This is likely caused by the new svelte web UI in llama-server.

Looking at the logs, included below, and confirmed in the Firefox debugger, it looks like the new svelte app tried to load an absolute path from the server. So this might be an issue that needs coordinating with the upstream llama-server folks.

Seems related: ggml-org/llama.cpp#16079

Expected behaviour
Upstream web interface is available.

Operating system and version

  • OS: Linux/docker v159-cuda-b6503

Proxy Logs

[INFO] Request 172.25.250.1 "GET /upstream/qwen3-4b/ HTTP/1.1" 200 842295 "Mozilla/5.0 (X11; Linux x86_64; rv:142.0) Gecko/20100101 Firefox/142.0" 2.866833ms
[INFO] Request 172.25.250.1 "GET /_app/version.json HTTP/1.1" 404 0 "Mozilla/5.0 (X11; Linux x86_64; rv:142.0) Gecko/20100101 Firefox/142.0" 6.208µs
[INFO] Request 172.25.250.1 "GET /props HTTP/1.1" 404 0 "Mozilla/5.0 (X11; Linux x86_64; rv:142.0) Gecko/20100101 Firefox/142.0" 5.666µs

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't working

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions