Add multimodal benchmarking usage docs #568

markurtz · 2026-01-30T15:36:47Z

Summary

This PR adds documentation for benchmarking multimodal models (image, video, and audio) with GuideLLM using OpenAI-compatible endpoints, and links these guides into the broader docs navigation.

Details

Added dedicated guides for image, video, and audio benchmarking, covering setup, data loading, request formatting, metrics, and example guidellm benchmark commands.
Introduced a multimodal benchmarking index page that explains prerequisites and links to each modality-specific guide.
Updated the main Guides index to surface the new multimodal benchmarking docs.

Test Plan

Manual testing

"I certify that all code in this PR is my own, except as noted below."

Use of AI

Includes AI-assisted code completion
Includes code generated by an AI application
Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes ## WRITTEN BY AI ##)

Signed-off-by: Mark Kurtz <[email protected]>

Signed-off-by: michelia <[email protected]> Signed-off-by: Mark Kurtz <[email protected]>

Copilot

Pull request overview

This PR adds documentation for benchmarking multimodal models (image, video, and audio) with GuideLLM using OpenAI-compatible endpoints, and links these guides into the broader docs navigation.

Changes:

Added dedicated guides for image, video, and audio benchmarking, covering setup, data loading, request formatting, metrics, and example guidellm benchmark commands.
Introduced a multimodal benchmarking index page that explains prerequisites and links to each modality-specific guide.
Updated the main Guides index to surface the new multimodal benchmarking docs.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
`docs/guides/multimodal/image.md`	New guide for benchmarking vision-language models with image inputs, including data-column mapping, request formatting, metrics, and VQA/captioning examples.
`docs/guides/multimodal/video.md`	New guide for benchmarking video-language models with video inputs, including data loading, video request formatting, metrics, and QA/captioning examples.
`docs/guides/multimodal/audio.md`	New guide for benchmarking audio models for ASR, translation, and audio chat, with detailed encoder options, metrics, and three example benchmark commands.
`docs/guides/multimodal/index.md`	New multimodal overview page describing prerequisites and linking to image, video, and audio benchmarking guides.
`docs/guides/index.md`	Adds a “Multimodal Benchmarking” card that links the main Guides index to the new multimodal documentation.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

docs/guides/multimodal/audio.md

docs/guides/multimodal/index.md

docs/guides/multimodal/image.md

docs/guides/multimodal/video.md

Co-authored-by: Copilot <[email protected]> Signed-off-by: Mark Kurtz <[email protected]>

docs/guides/multimodal/audio.md

sjmonson

Overall just a few nits around explaining arguments. There is also a lot of redundancy around explaining column mapping, output files, etc that would possible be better to have dedicated docs for with backlinks, but we can address that in a future PR.

docs/guides/multimodal/audio.md

docs/guides/multimodal/image.md

docs/guides/multimodal/audio.md

docs/guides/multimodal/video.md

Co-authored-by: Samuel Monson <[email protected]> Signed-off-by: Mark Kurtz <[email protected]>

markurtz requested review from Copilot and sjmonson January 30, 2026 15:36

markurtz self-assigned this Jan 30, 2026

markurtz and others added 3 commits January 30, 2026 10:37

Add docs for benchmarking multimodal models

16ad81c

Signed-off-by: Mark Kurtz <[email protected]>

style fixes

c42877a

Signed-off-by: Mark Kurtz <[email protected]>

chore: add license and readme

2d7503b

Signed-off-by: michelia <[email protected]> Signed-off-by: Mark Kurtz <[email protected]>

markurtz force-pushed the features/docs-multimodal branch from 125b659 to 2d7503b Compare January 30, 2026 15:37

Copilot started reviewing on behalf of markurtz January 30, 2026 15:37 View session

Merge branch 'main' into features/docs-multimodal

0bc9179

Copilot AI reviewed Jan 30, 2026

View reviewed changes

markurtz and others added 5 commits January 30, 2026 10:41

Update docs/guides/multimodal/audio.md

1b55ea1

Co-authored-by: Copilot <[email protected]> Signed-off-by: Mark Kurtz <[email protected]>

Update docs/guides/multimodal/audio.md

2b2f57a

Co-authored-by: Copilot <[email protected]> Signed-off-by: Mark Kurtz <[email protected]>

Update docs/guides/multimodal/index.md

4aed84f

Co-authored-by: Copilot <[email protected]> Signed-off-by: Mark Kurtz <[email protected]>

Update docs/guides/multimodal/video.md

22b9da3

Co-authored-by: Copilot <[email protected]> Signed-off-by: Mark Kurtz <[email protected]>

Update docs/guides/multimodal/image.md

df16974

Co-authored-by: Copilot <[email protected]> Signed-off-by: Mark Kurtz <[email protected]>

sjmonson reviewed Jan 30, 2026

View reviewed changes

docs/guides/multimodal/audio.md Show resolved Hide resolved

sjmonson requested changes Jan 30, 2026

View reviewed changes

markurtz and others added 5 commits January 30, 2026 12:09

Update docs/guides/multimodal/audio.md

2a2ac6e

Co-authored-by: Samuel Monson <[email protected]> Signed-off-by: Mark Kurtz <[email protected]>

Update docs/guides/multimodal/audio.md

2d10048

Co-authored-by: Samuel Monson <[email protected]> Signed-off-by: Mark Kurtz <[email protected]>

Update docs/guides/multimodal/image.md

ff1a97c

Co-authored-by: Samuel Monson <[email protected]> Signed-off-by: Mark Kurtz <[email protected]>

Update docs/guides/multimodal/audio.md

edffd89

Co-authored-by: Samuel Monson <[email protected]> Signed-off-by: Mark Kurtz <[email protected]>

Update docs/guides/multimodal/video.md

89013a7

Co-authored-by: Samuel Monson <[email protected]> Signed-off-by: Mark Kurtz <[email protected]>

sjmonson approved these changes Jan 30, 2026

View reviewed changes

markurtz merged commit c082cb6 into main Jan 30, 2026
13 of 15 checks passed

markurtz deleted the features/docs-multimodal branch January 30, 2026 17:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add multimodal benchmarking usage docs #568

Add multimodal benchmarking usage docs #568

markurtz commented Jan 30, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjmonson left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add multimodal benchmarking usage docs #568

Add multimodal benchmarking usage docs #568

Conversation

markurtz commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Details

Test Plan

Use of AI

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sjmonson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

markurtz commented Jan 30, 2026 •

edited

Loading