Skip to content

Releases: radlab-dev-group/llm-router

v0.4.6

11 Jan 12:21
Immutable release. Only release title and notes can be modified.
3550a26

Choose a tag to compare

What's Changed

  • Add multi‑provider embedding support and converters

Full Changelog: v0.4.5...v0.4.6

v0.4.5

10 Jan 16:58
Immutable release. Only release title and notes can be modified.

Choose a tag to compare

What's Changed

  • Fix docstring formatting in OpenAIType
  • Update defaults, add deps, simplify endpoint logic
  • Add local Ollama config for gpt-oss-120b model
  • refactor/providers
  • Refactor streaming logic and clean up HttpRequestExecutor
  • Add native LMStudio streaming support and refactor streaming
  • Add native LMStudio streaming support and refactor handlers

Full Changelog: v0.4.4...v0.4.5

v0.4.4

31 Dec 04:47
Immutable release. Only release title and notes can be modified.
8d94fd2

Choose a tag to compare

What's Changed

  • Validate unique provider identifiers.
  • Store all hosts with keep‑alive configured in a Redis.
  • UtilsPlugin pipeline with LangChain based simple RAG plugin (extending context to GenAI with locally built databse).
  • Improve dev tooling and enforce request timeouts.
  • Refactor API to use unified /v1/responses endpoint.

Full Changelog: https://github.com/radlab-dev-group/llm-router/commits/v0.4.4

v0.4.3

23 Dec 16:05
9883043

Choose a tag to compare

What's Changed

  • Helm chart in #20
  • Add sample auditor log file to logs/auditor/ directory in #21
  • Add Prometheus metrics handler with multiprocess support in #22
  • Add configurable monitoring intervals to services monitor in #25
  • Skip FastMaskerPlugin in LLMRouterServicesMonitor host probing and clean up unused guard‑rail variables in #26
  • Add V0 chat handler and integer timestamps to models endpoint in #28
  • Add model config docs and tool calling support in #31
  • Refactor monitor log prefixes to explicit [*-monitor] tags and comment out Engine del cleanup in #32
  • Add fake model provider and fix guardrail streaming in #33

Full Changelog: v0.4.2...v0.4.3

v0.4.2

13 Dec 03:19

Choose a tag to compare

Full Changelog: v0.4.1...v0.4.2

v0.4.1

05 Dec 18:13

Choose a tag to compare

Full Changelog: v0.4.0...v0.4.1

v0.4.0

28 Nov 20:44

Choose a tag to compare

Full Changelog: v0.3.1...v0.4.0

v0.3.1

23 Nov 00:10

Choose a tag to compare

Full Changelog: v0.3.0...v0.3.1

v0.3.0

17 Nov 02:37

Choose a tag to compare

Full Changelog: v0.2.3...v0.3.0

v0.2.3

14 Nov 03:59

Choose a tag to compare

Full Changelog: v0.2.2...v0.2.3