What's Changed
- Implement real streaming support in chat completions. #2
- Optimized for LM chat interfaces (like OpenWebUI) to render as reasoning tokens. #2
- Support for user provided system prompt. #2
- Improve API documentation #2
Full Changelog: 0.0.9...0.0.91
Reasoning effort set to
highwithGPT-4o-mini