performance variation when measuring the perf with "do_bench()" in triton.testing #807

stephen-youn · 2022-10-26T20:32:12Z

stephen-youn
Oct 26, 2022

Hi All,
I have a question on what would the reason that we have a variation in the measured performance when we run do_bench and measure the perf through torch.cuda.event (link).
and i wonder whether we need extra line of "torch.cuda.synchronize()" for every end_event record in the code (link)
the other euqestion is how the "5" comes as the initial number of runs to estimate n_warmpup and n_repeat. and what's the reason for formula behind this.
thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

performance variation when measuring the perf with "do_bench()" in triton.testing #807

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

performance variation when measuring the perf with "do_bench()" in triton.testing #807

Uh oh!

Uh oh!

stephen-youn Oct 26, 2022

Replies: 0 comments

stephen-youn
Oct 26, 2022