Documentation regarding the DS format to be fed to GuideLLM

# There is a need in writing down examples for all DSs formats that can be fed as an input to the GuideLLM benchmark.

Tried using the example in the current README - 
`guidellm benchmark --target "http://${IP}:${PORT}" --rate-type sweep --max-seconds 30 --data "prompt_tokens=256,output_tokens=128"` - 
Got an error for autoTokenizer not known. After debugging a little thought that it might be that tokenizers for Mistarl and Llama3 aren't supported yet (used VLLM instances with those loaded), and the error occurs since processor is required for synthetic data bencmarking.  

Then tried using a HF model ID with - 
`guidellm benchmark --target "http://${IP}:${PORT}" --rate-type synchronous --max-seconds 30 --data ${HF_DS_ID} --data-args '{"prompt_column": "prompt"}'` and the benchmark ran but got an error in openAIHTTPBackend for every request, which made me think that the data format I'm sending isn't correct.

then tried to used a local JSON file as a DS and ran with - 
`guidellm benchmark --target "http://${IP}:${PORT}" --rate-type synchronous --max-seconds 30 --data "prompts.json"`
ad got a `TypeError: 'PosixPath' object is not iterable` error

Would appreciate help in running GuideLLM benchmarks with different DSs configurations. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Documentation regarding the DS format to be fed to GuideLLM #133

There is a need in writing down examples for all DSs formats that can be fed as an input to the GuideLLM benchmark.

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Documentation regarding the DS format to be fed to GuideLLM #133

Description

There is a need in writing down examples for all DSs formats that can be fed as an input to the GuideLLM benchmark.

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions