Update `benchmark_step.jl` for CUDA benchmarking with useful kernel names #4055

petebachant · 2025-10-14T16:58:48Z

This uses CliMA/ClimaCore.jl#2376 to provide more useful CUDA kernel names in benchmarks.

TODO

Can we format the kernel names with anything other than underscores as separators? --> No, all non-alphanumeric characters get converted to underscores.
Do we want to do this on the benchmarks that use benchmark.jl, not benchmark_step.jl?
Update Buildkite pipeline to use this feature
Switch to non-dev ClimaCore

perf/benchmark_step.jl

petebachant · 2025-10-14T20:48:14Z

.buildkite/pipeline.yml


  - group: "Reproducibility infrastructure"
    steps:
-


These changes were made by a YAML auto-formatter in VS Code. Is there a style guide I might be breaking here?

I'm not sure... this is something I have been wondering as well. I considered following this example, which is used in Buildkite's docs.

perf/benchmark_step.jl

…/gpu-perf-2

petebachant · 2025-11-04T16:27:22Z

@dennisYatunin @imreddyTeja thoughts on the timeouts here? Should I increase the limit or disable kernel renaming?

imreddyTeja · 2025-11-04T18:15:58Z

@dennisYatunin @imreddyTeja thoughts on the timeouts here? Should I increase the limit or disable kernel renaming?

What is the advantage of using kernel renaming in climaatmos-ci for buildkite steps that don't profile? Is the idea to run the profiler at the end of each simulation?

Comparing the buildkite for this PR to the main's last buildkite run shows a ~40% slowdown, but everything after the first step doesn't seem to be affected. I'm not sure if that cost is worth it at the moment.

petebachant · 2025-11-04T19:15:12Z

What is the advantage of using kernel renaming in climaatmos-ci for buildkite steps that don't profile? Is the idea to run the profiler at the end of each simulation?

I believe that was Dennis's vision, and then we'd process and summarize all of the profiling results together in a later step.

This reverts commit 427b8db.

petebachant and others added 4 commits October 6, 2025 10:47

Update benchmark_step.jl for CUDA profiling

abe1f57

Fix external profiler determination

cbad8ac

Get kernel naming option from ClimaCore

df5f349

Control kernel naming via env var

606f584

petebachant marked this pull request as draft October 14, 2025 16:58

petebachant added 5 commits October 14, 2025 10:07

Use dev version of ClimaCore

f369d6c

Short-circuit GPU benchmark based on device

e4ce7a2

Rename kernels in buildkite

a521070

Autoformat .buildkite/pipeline.yml

04e0454

Improve logging

8a931f5

petebachant commented Oct 14, 2025

View reviewed changes

perf/benchmark_step.jl Outdated Show resolved Hide resolved

petebachant commented Oct 14, 2025

View reviewed changes

imreddyTeja reviewed Oct 14, 2025

View reviewed changes

perf/benchmark_step.jl Show resolved Hide resolved

imreddyTeja reviewed Oct 15, 2025

View reviewed changes

perf/benchmark_step.jl Outdated Show resolved Hide resolved

petebachant and others added 13 commits October 17, 2025 09:53

Always import CUDA

215255f

Name kernels from stack trace in benchmark GPU default

0b362e7

Merge branch 'main' of https://github.com/CliMA/ClimaAtmos.jl into pb…

cba2694

…/gpu-perf-2

Set stacktrace-based kernel names before compiling

6620982

Print internal profling result in benchmark_step.jl

3dcca85

Relocate function so it can be called

b228a35

Update ClimaCore dev dep

4439d43

Update ClimaCore

f36f405

Trigger build

a09bbc0

Merge main

317dca8

Fix url

1153ca3

Widen display size for CUDA profiling results

7aa7dea

Narrow print

f4429ff

petebachant marked this pull request as ready for review October 22, 2025 18:30

petebachant requested review from daverumph and dennisYatunin October 22, 2025 18:31

petebachant and others added 13 commits October 24, 2025 09:34

Remove limit

57fba6c

Update ClimaCore

b073672

Merge branch 'main' of github.com:CliMA/ClimaAtmos.jl into pb/gpu-perf-2

3824cde

Update ClimaCore

cb6bc31

Merge main

e41ea59

Update ClimaCore

efb94cf

Merge branch 'main' of https://github.com/CliMA/ClimaAtmos.jl into pb…

ac635a4

…/gpu-perf-2

Move CUDA kernel naming selection via env var into ClimaCore

90ccf5b

Update ClimaCore

63e213d

Set kernel naming from stack trace enabled for entire buildkite pipeline

512471e

Update ClimaCore

175164c

Update ClimaCore

7dd4645

Merge branch 'main' of https://github.com/CliMA/ClimaAtmos.jl into pb…

88cc058

…/gpu-perf-2

petebachant added 7 commits November 4, 2025 11:45

Update ClimaCore

7e959cd

Update ClimaCore and only rename kernels in specific benchmarks

77ef5e8

Update ClimaAtmos

d7befca

Update ClimaCore

5559604

Update comment

adde261

Update comment

427b8db

Revert "Update comment"

ad20c71

This reverts commit 427b8db.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update `benchmark_step.jl` for CUDA benchmarking with useful kernel names #4055

Update `benchmark_step.jl` for CUDA benchmarking with useful kernel names #4055

Uh oh!

petebachant commented Oct 14, 2025 •

edited

Loading

Uh oh!

Uh oh!

petebachant Oct 14, 2025

Uh oh!

imreddyTeja Oct 14, 2025

Uh oh!

Uh oh!

Uh oh!

petebachant commented Nov 4, 2025

Uh oh!

imreddyTeja commented Nov 4, 2025

Uh oh!

petebachant commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Update benchmark_step.jl for CUDA benchmarking with useful kernel names #4055

Are you sure you want to change the base?

Update benchmark_step.jl for CUDA benchmarking with useful kernel names #4055

Uh oh!

Conversation

petebachant commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TODO

Uh oh!

Uh oh!

petebachant Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

imreddyTeja Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

petebachant commented Nov 4, 2025

Uh oh!

imreddyTeja commented Nov 4, 2025

Uh oh!

petebachant commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Update `benchmark_step.jl` for CUDA benchmarking with useful kernel names #4055

Update `benchmark_step.jl` for CUDA benchmarking with useful kernel names #4055

petebachant commented Oct 14, 2025 •

edited

Loading