You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
llama-router: auto-rescan, admin endpoints, and fixes
Auto-rescan models on startup:
- Scan cache directory and add new .gguf files as 'auto' models
- Remove 'auto' models no longer present in cache
- Never touch 'manual' models (user-managed configuration)
- Preserve custom spawn/group settings for existing models
- New /admin/rescan endpoint for on-demand rescanning
Separate admin endpoints:
- Extract /admin routes to router-admin.cpp/h
- Clean separation: router-endpoints.cpp = public API only
- Add RouterApp::update_config() for live config updates
- Support both Bearer token and X-Admin-Token header auth
Fixes:
- Fix /model/(health|props|slots) path rewriting for backends
- Thread-safe streaming: eliminate parent scope captures
- Robust JSON parsing for 'stream' field detection
- Simplified signal handlers (remove redundant stop_all)
- Initialize logger before any LOG_* calls
0 commit comments