-
Notifications
You must be signed in to change notification settings - Fork 113
Description
Describe the bug
I have just updated the llama-swap docker container to the latest cuda version (v159-cuda-b6503) and the upstream web UI no longer works. This is likely caused by the new svelte web UI in llama-server.
Looking at the logs, included below, and confirmed in the Firefox debugger, it looks like the new svelte app tried to load an absolute path from the server. So this might be an issue that needs coordinating with the upstream llama-server folks.
Seems related: ggml-org/llama.cpp#16079
Expected behaviour
Upstream web interface is available.
Operating system and version
- OS: Linux/docker v159-cuda-b6503
Proxy Logs
[INFO] Request 172.25.250.1 "GET /upstream/qwen3-4b/ HTTP/1.1" 200 842295 "Mozilla/5.0 (X11; Linux x86_64; rv:142.0) Gecko/20100101 Firefox/142.0" 2.866833ms
[INFO] Request 172.25.250.1 "GET /_app/version.json HTTP/1.1" 404 0 "Mozilla/5.0 (X11; Linux x86_64; rv:142.0) Gecko/20100101 Firefox/142.0" 6.208µs
[INFO] Request 172.25.250.1 "GET /props HTTP/1.1" 404 0 "Mozilla/5.0 (X11; Linux x86_64; rv:142.0) Gecko/20100101 Firefox/142.0" 5.666µs