Skip to content

Conversation

@tarukumar
Copy link
Contributor

SUMMARY:
"please provide a brief summary"

TEST PLAN:
"please outline how the changes were tested"

tarukumar and others added 14 commits September 16, 2025 21:34
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
* New models for validation

* update qwen3 metrics. add server for distil-whisper

* update value

* update whisper-large-v3 and Voxtral model acccuracy server settings

* more Voxtral server settings

* accuracy servers need to be in RedHatAI too

* update Qwen2.5-VL-7B-Instruct-FP8-Dynamic values

* try Kimi-K2 with quad

* metric value for gpt-oss-20b from a run on k8s-a100-duo

* remove empty file

Co-authored-by: Derek Kozikowski <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
@dtrifiro
Copy link

Is this still relevant?

Signed-off-by: Tarun Kumar <[email protected]>
Signed-off-by: Tarun Kumar <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants