Skip to content

Add support for KV Cache for /chat/completions API #180

@irar2

Description

@irar2

Relevant for:

  • KV events
  • Prefill time calculations
  • Prometheus metrics

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions