-
Notifications
You must be signed in to change notification settings - Fork 209
Osaurus is slow and responds very late. #648
Description
Checks
- I have searched existing issues and discussions
- I can reproduce this with the latest
mainor release
Describe the bug
Osaurus's slowness and extremely slow response times are a real problem. I tested it on the qwen 3.5 9b 4bit MLX version as an LLM user. Using the XLSX plugin, I asked it to write the top 10 OWASP vulnerabilities to Excel. It took 30 minutes, but it still didn't respond. However, when I tried the same prompt and the same LLM model, LM Studio responded in less than a minute. Osaurus's interface is generally good, but it's really very slow. If this problem isn't fixed, I'll switch back to Ollama or LM Studio. This isn't just the case with this prompt; it's very slow with other prompts as well. My computer specifications are an M4 Air with 24GB of RAM. I don't think the RAM and CPU amount are the problem, because LM Studio works flawlessly.
Steps to reproduce
No response
Osaurus version / commit
Osaurus 0.14.16
macOS version
macOS Sequoia 15.7.1
Apple Silicon chip
m4
Xcode version
No response
Logs
Screenshots
No response