Split core test group to minicore, mxfp, scaled-dot #3993

pbchekin · 2025-04-23T19:09:08Z

Adds 3 new command line options to scripts/test-triton.sh: --minicore, --mxfp, --scaled_dot.

The semantic of --core is not changed: it should execute the same tests as before.
Additionally, --minicore should be much faster than --core, and the remaining tests from core group can be executed separately with --mxfp --scaled_dot.

Required for #3976.

This caching seems to be responsible for some CUDA OOMs we encountered in Meta-internal builds. I haven't got a reduced repro, but this change does seem to fix things. My hypothesis is that the cached stream is causing the memory allocated for the graph to be retained.

pbchekin requested review from anmyachev, gshimansky, kwasd and whitneywhtsang April 23, 2025 19:09

kwasd approved these changes Apr 23, 2025

View reviewed changes

gshimansky approved these changes Apr 23, 2025

View reviewed changes

anmyachev approved these changes Apr 23, 2025

View reviewed changes

pbchekin force-pushed the split-core branch from c734039 to c690c94 Compare April 23, 2025 20:42

whitneywhtsang approved these changes Apr 23, 2025

View reviewed changes

pbchekin added 2 commits April 23, 2025 18:20

Improve help, default mode

bc697fc

Split core to minicore, mxfp, scaled_dot

f623845

pbchekin force-pushed the split-core branch from ec2d154 to f623845 Compare April 24, 2025 01:20

pbchekin merged commit 0cf724f into main Apr 24, 2025
4 of 5 checks passed

pbchekin deleted the split-core branch April 24, 2025 02:32

pbchekin mentioned this pull request Apr 26, 2025

[CI] Ideas to reduce PR build and test time #3820

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Split core test group to minicore, mxfp, scaled-dot #3993

Split core test group to minicore, mxfp, scaled-dot #3993

Uh oh!

pbchekin commented Apr 23, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Split core test group to minicore, mxfp, scaled-dot #3993

Split core test group to minicore, mxfp, scaled-dot #3993

Uh oh!

Conversation

pbchekin commented Apr 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

pbchekin commented Apr 23, 2025 •

edited

Loading