Model Performance Benchmarking with GuideLLM

Introduction

Course Title: Model Performance Benchmarking with GuideLLM

Description: This hands-on course teaches you how to quantitatively measure, analyze, and optimize the performance of Large Language Models (LLMs) deployed on Red Hat OpenShift AI. You will use GuideLLM, an industry-standard benchmarking toolkit, within an automated Tekton pipeline to simulate real-world workloads and capture critical performance data. The course focuses on translating technical metrics into actionable business insights related to user experience, scalability, and cost efficiency.

Duration: 2 hours

Objectives

On completing this course, you should be able to:

Deploy and configure an automated benchmarking pipeline using GuideLLM and Tekton on OpenShift AI.
Execute various performance tests that simulate real-world use cases like chat, RAG, and code generation.
Analyze and interpret key performance metrics, including latency (Time to First Token, Inter-Token Latency), throughput, and their statistical distributions (mean, median, p99).
Connect performance results to business outcomes, such as infrastructure sizing, cost estimation, and defining Service Level Objectives (SLOs).

Prerequisites

This course assumes that you have the following prior experience:

Foundational knowledge of Large Language Models and the basics of model serving.
Familiarity with using the OpenShift command-line interface (oc) to interact with a cluster.
Access to a Red Hat OpenShift AI cluster with an available GPU node and a deployed LLM inference service (e.g., vLLM).

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
.vscode		.vscode
images		images
modules.bfx		modules.bfx
modules		modules
supplemental-ui/partials		supplemental-ui/partials
ui-assets		ui-assets
ui-bundle		ui-bundle
.gitignore		.gitignore
DEVSPACE.md		DEVSPACE.md
README-TRAINING.md		README-TRAINING.md
README.md		README.md
USAGEGUIDE.adoc		USAGEGUIDE.adoc
antora-playbook.yml		antora-playbook.yml
antora.yml		antora.yml
course-init.sh		course-init.sh
create-ui-bundle.sh		create-ui-bundle.sh
devfile.yaml		devfile.yaml
package-lock.json		package-lock.json
package.json		package.json
pdfgen.sh		pdfgen.sh
sample-image.png		sample-image.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Model Performance Benchmarking with GuideLLM

Introduction

Objectives

Prerequisites

About

Uh oh!

Releases

Packages

Languages

RedHatQuickCourses/llmops-guidellm

Folders and files

Latest commit

History

Repository files navigation

Model Performance Benchmarking with GuideLLM

Introduction

Objectives

Prerequisites

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages