-
Notifications
You must be signed in to change notification settings - Fork 89
Description
There is a need in writing down examples for all DSs formats that can be fed as an input to the GuideLLM benchmark.
Tried using the example in the current README -
guidellm benchmark --target "http://${IP}:${PORT}" --rate-type sweep --max-seconds 30 --data "prompt_tokens=256,output_tokens=128"
-
Got an error for autoTokenizer not known. After debugging a little thought that it might be that tokenizers for Mistarl and Llama3 aren't supported yet (used VLLM instances with those loaded), and the error occurs since processor is required for synthetic data bencmarking.
Then tried using a HF model ID with -
guidellm benchmark --target "http://${IP}:${PORT}" --rate-type synchronous --max-seconds 30 --data ${HF_DS_ID} --data-args '{"prompt_column": "prompt"}'
and the benchmark ran but got an error in openAIHTTPBackend for every request, which made me think that the data format I'm sending isn't correct.
then tried to used a local JSON file as a DS and ran with -
guidellm benchmark --target "http://${IP}:${PORT}" --rate-type synchronous --max-seconds 30 --data "prompts.json"
ad got a TypeError: 'PosixPath' object is not iterable
error
Would appreciate help in running GuideLLM benchmarks with different DSs configurations.