Lemonade LLMs in docker container (Linux) #578
VladimirVLF
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
You can easily run Lemonade in docker if you are on Linux, e.g. Ubuntu.
Here Lemonade with llamacpp with Vulkan back-end is bundled with Open-WebUi into a docker compose service.
NB: It will likely not work on Windows host due to device sharing
/dev/dri:/dev/dri.You will need Docker installation and the two files below:
Dockerfileanddocker-compose.ymlDockerfile(change the URL to Lemonade installer and context window size as needed):docker-compose.yml(configure your path to models):To get it working follow these steps:
lemonade:latest:docker build -t lemonade:latest .Admin panel->Settings->Connections-> OpenAI API connection URL: http://lemonade:8000/api/v1Now you should be able to launch a chat in Open WebUI, select the downloaded Lemonade model and chat with it.
For reference, my laptop with Ryzen AI 7 Pro 360 with iGPU Radeon 880M and 64 GB vRAM (max 32GB for iGPU) runs gpt-oss-20b-mxfp4-GGUF model with 128k context window size at 27.8 tokens per second.
I also had amdgpu driver with xrt and xdna installed on the laptop native Ubuntu OS. Not sure if it matters.
Beta Was this translation helpful? Give feedback.
All reactions