Skip to content

support new api of chat/compilation api #367

@talregev

Description

@talregev

Is your feature request related to a problem? Please describe.
guidellm cannot connect with transformers serve.
transformers serve use new openai api. it found on
http://localhost:8000/v1/chat/completions
with new msg body:

{
    "model": "meta-llama/Llama-3.2-1B-Instruct",
    "messages": [{"role":"user","content":"Who is the SpaceX CEO?"}],
    "max_tokens": 20,
    "stream": false
 }

Describe the solution you'd like
guidellm will be implemented the new openai api such as v1/chat/completions.

Describe alternatives you've considered
I consider to translate the msgs between of the shelf programs, but it another layer and it give more complexity.

Additional context

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions