feat(primbench): support an extra backend by MyNameIsTrez · Pull Request #5512 · ROCm/rocm-libraries

MyNameIsTrez · 2026-03-17T13:55:32Z

Motivation

The PR feat(rocrand): use primbench requires primbench to support CUDA, since rocRAND contains cuRAND benchmarks.

Technical Details

I've spent a few hours on cleaning up the commit history. I recommend reviewing this PR by clicking on the first commit in the Commits tab, and then clicking the Next button to scroll through the commits chronologically.

The first commit is feat(primbench): support CUDA and is by far the largest and most complex, so I recommend reviewing the other commits first since they're nice and small.

The feat(primbench): support vector CLI args commit was also required for rocRAND, since its benchmarks accept an array of values: --lambda 1 2 3.

Two flags were removed

Even though this PR adds CUDA support to primbench.hpp, it went down from 4075 lines to 3593 lines. This is mostly the result of me deciding to remove the flags --output-hip-device-properties-context and --output-amdsmi-context.

These flags were used to dump an extreme amount of extra context to the JSON output, but I have not actually found the extra context useful even once while updating all 53 rocPRIM benchmarks to use primbench.

Porting these flags to CUDA would have been a nightmare, and removing these two flags has made the library significantly more maintainable. In the future we might decide to output a few of the most useful context fields to the JSON output again.

Test Plan

VS Code dev containers

CUDA Dockerfile + devcontainer.json

Dockerfile:

FROM nvidia/cuda:12.9.1-devel-ubuntu24.04

RUN apt update && apt install -y git cmake ninja-build wget

RUN wget -qO- https://repo.radeon.com/rocm/rocm.gpg.key | gpg --dearmor > /etc/apt/keyrings/rocm.gpg

RUN echo "deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/7.2 noble main" > /etc/apt/sources.list.d/rocm.list

RUN tee /etc/apt/preferences.d/rocm-pin-600 <<'EOF'
Package: *
Pin: origin repo.radeon.com
Pin-Priority: 600
EOF

RUN apt update && apt install -y hip-base

ENV HIP_PLATFORM=nvidia

devcontainer.json:

{
    "build": {
        "dockerfile": "Dockerfile"
    },
    "name": "cuda-minimal",
    "privileged": true,
    "runArgs": [
        "--gpus=all"
    ]
}

HIP Dockerfile + devcontainer.json

Dockerfile:

FROM rocm/rocm-terminal:latest

devcontainer.json:

{
    "build": {
        "dockerfile": "Dockerfile"
    },
    "name": "hip-minimal",
    "privileged": true
}

Setup

git checkout users/mynameistrez/add-cuda-support-to-primbench && \
cd shared/primbench

Running the CUDA example benchmark

nvcc -o copy_benchmark examples/cuda/copy_benchmark.cu -I. -lnvidia-ml && ./copy_benchmark

Running the HIP example benchmark

hipcc -o copy_benchmark examples/hip/copy_benchmark.cpp -I. -lamd_smi && ./copy_benchmark

Test Result

The above commands work for me, and the JSON and CSV output looks correct.

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

codecov-commenter · 2026-03-17T21:45:05Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

❌ Your project status has failed because the head coverage (77.21%) is below the target coverage (80.00%). You can increase the head coverage or adjust the target coverage.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #5512      +/-   ##
===========================================
- Coverage    66.57%   64.27%   -2.29%     
===========================================
  Files         1836     1988     +152     
  Lines       282885   308555   +25670     
  Branches     39734    40299     +565     
===========================================
+ Hits        188308   198317   +10009     
- Misses       78211    93495   +15284     
- Partials     16366    16743     +377

Flag	Coverage Δ		*Carryforward flag
hipBLAS	`90.67% <ø> (ø)`		Carriedforward from 1f1c69c
hipBLASLt	`43.55% <ø> (ø)`		Carriedforward from 1f1c69c
hipCUB	`82.38% <ø> (ø)`		Carriedforward from 1f1c69c
hipDNN	`84.98% <ø> (-0.01%)`	⬇️	Carriedforward from 1f1c69c
hipFFT	`56.36% <ø> (ø)`		Carriedforward from 1f1c69c
hipRAND	`76.12% <ø> (ø)`		Carriedforward from 1f1c69c
hipSOLVER	`68.81% <ø> (ø)`		Carriedforward from 1f1c69c
hipSPARSE	`84.70% <ø> (ø)`		Carriedforward from 1f1c69c
rocBLAS	`47.97% <ø> (ø)`		Carriedforward from 1f1c69c
rocFFT	`47.37% <ø> (ø)`		Carriedforward from 1f1c69c
rocPRIM	`39.04% <ø> (?)`
rocRAND	`57.07% <ø> (ø)`		Carriedforward from 1f1c69c
rocSOLVER	`77.21% <ø> (ø)`		Carriedforward from 1f1c69c
rocSPARSE	`71.48% <ø> (ø)`		Carriedforward from 1f1c69c

*This pull request uses carry forward flags. Click here to find out more.
see 152 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

…pecializations()

…compute_max_specialization_width()

github-actions bot added documentation project: rocprim shared: primbench labels Mar 17, 2026

assistant-librarian bot added the organization: ROCm label Mar 17, 2026

MyNameIsTrez force-pushed the users/mynameistrez/add-cuda-support-to-primbench branch 6 times, most recently from 58d791c to 4743f41 Compare March 17, 2026 16:35

MyNameIsTrez force-pushed the users/mynameistrez/add-cuda-support-to-primbench branch from 8a8c064 to 51b7779 Compare March 18, 2026 17:20

MyNameIsTrez added 16 commits March 18, 2026 18:30

feat(primbench): support CUDA

e950001

refactor(primbench): move code to filter_specializations() and sort_s…

e454df9

…pecializations()

refactor(primbench): move code to get_common_algorithm()

63dd1d8

refactor(primbench): move code to get_header()

d1583e8

refactor(primbench): move code to run_all_specializations()

ef2503f

refactor(primbench): move code to ensure_specializations_exist() and …

0c9c2bb

…compute_max_specialization_width()

docs(primbench): use consistent comment style

f6fadbc

chore(primbench): remove unused <optional> header

831f570

feat(primbench): support vector CLI args

76265e6

refactor(primbench): split state.run() into state.run_iteration()

8bfe839

docs(primbench): update example results.json

0eeb7fa

fix(primbench): escape filter in JSON

0f432c4

fix(primbench): fix mi300x outputting extra data in arch

a4d3621

feat(primbench): improve backend JSON format

c440887

fix(rocprim): remove old primbench args

c32ded0

chore(primbench): apply clang-format

3e9ed1b

MyNameIsTrez force-pushed the users/mynameistrez/add-cuda-support-to-primbench branch from 51b7779 to 3e9ed1b Compare March 18, 2026 18:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(primbench): support an extra backend#5512

feat(primbench): support an extra backend#5512
MyNameIsTrez wants to merge 16 commits intodevelopfrom
users/mynameistrez/add-cuda-support-to-primbench

MyNameIsTrez commented Mar 17, 2026 •

edited

Loading

Uh oh!

codecov-commenter commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

MyNameIsTrez commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Technical Details

Two flags were removed

Test Plan

VS Code dev containers

Setup

Running the CUDA example benchmark

Running the HIP example benchmark

Test Result

Submission Checklist

Uh oh!

codecov-commenter commented Mar 17, 2026

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MyNameIsTrez commented Mar 17, 2026 •

edited

Loading