Add regression CI by mawad-amd · Pull Request #206 · ROCm/iris

mawad-amd · 2025-10-09T06:03:25Z

Motivation

Add regression CI to avoid performance bugs in examples with known high performance.

Technical Details

Checks if GEMM All-Scatter performance is greater than some threshold for 8 GPUs.

Test Plan

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Copilot

Pull Request Overview

This PR adds a performance regression CI workflow to prevent performance degradation in the Iris multi-GPU framework. The workflow specifically tests GEMM All-Scatter performance using 8 GPUs to ensure it maintains at least 2000 TFLOPs.

Key changes:

Introduces automated performance testing for critical GPU operations
Sets up Apptainer-based containerized testing environment
Implements threshold-based validation with detailed error reporting

.github/workflows/iris-performance-regression-test.yml

Add regression CI

b631c73

Copilot AI review requested due to automatic review settings October 9, 2025 06:03

mawad-amd requested review from BKP and neoblizz as code owners October 9, 2025 06:03

github-actions bot added in-progress We are working on it iris Iris project issue labels Oct 9, 2025

Copilot AI reviewed Oct 9, 2025

View reviewed changes

.github/workflows/iris-performance-regression-test.yml Show resolved Hide resolved

.github/workflows/iris-performance-regression-test.yml Show resolved Hide resolved

.github/workflows/iris-performance-regression-test.yml Outdated Show resolved Hide resolved

mawad-amd and others added 6 commits October 9, 2025 01:07

Fix bad command line option

1d00333

Add set -e and fix test

900932a

Modify the threshold

9b350e3

Merge branch 'main' into muhaawad/regression-ci-1

25fef4c

Run all gemm + all scatters

bbee75c

Add missing script

6bf1f2f

mawad-amd merged commit cdc05dc into main Oct 9, 2025
17 of 18 checks passed

mawad-amd deleted the muhaawad/regression-ci-1 branch October 9, 2025 17:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Add regression CI#206

Add regression CI#206
mawad-amd merged 7 commits intomainfrom
muhaawad/regression-ci-1

mawad-amd commented Oct 9, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Comments

Conversation

mawad-amd commented Oct 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

mawad-amd commented Oct 9, 2025 •

edited

Loading