Verify that benchmark tests meet expected performance thresholds #8574

kenzieschmoll · 2024-11-28T01:03:27Z

kenzieschmoll · 2024-12-02T21:04:57Z

The WASM results are running much slower on the bots than locally. Locally, the benchmark tests pass and meet the expected thresholds, which are set to 16666 micros (16.6 ms is what is required for performant rendering on a 60 FPS device). On the CI the test is failing:

  [WASM Benchmarks] The following benchmark scores exceeded their expected thresholds:
  
  [devtools_offlineCpuProfilerScreen.wasm] flutter_frame.total_time.average was 35744.85030395137 μs, which exceeded the expected threshold, 16666.0 μs.
  [devtools_offlineCpuProfilerScreen.wasm] flutter_frame.total_time.p50 was 54035.0 μs, which exceeded the expected threshold, 16666.0 μs.
  [devtools_offlineCpuProfilerScreen.wasm] flutter_frame.total_time.p90 was 59095.0 μs, which exceeded the expected threshold, 16666.0 μs.
  [devtools_offlinePerformanceScreen.wasm] flutter_frame.total_time.p90 was 28900.0 μs, which exceeded the expected threshold, 16666.0 μs.

@eyebrowsoffire @yjbanov any idea what would cause WASM to be significantly slower on the CI? The JS benchmarks seem to be consistent, or at least they are not exceeding the 16666 micros threshold for any of the metrics (on the CI and locally).

yjbanov · 2024-12-02T23:44:09Z

Does the CI use the same hardware+OS combination as the local device you use for benchmarking? A lot depends on the hardware the benchmark runs on. As for why the JS is consistent and Wasm is not, I'm not sure. Someone explained to me in the past that V8 runs in different modes for different situations. For example, when you have DevTools open, V8 switches into a mode that supports debugging, which changes the performance characteristics of the code.

kenzieschmoll · 2024-12-05T20:56:52Z

@yjbanov looks like switching the action to run on macos-latest instead of ubuntu-latest did the trick. Thanks!

kenzieschmoll added 3 commits November 27, 2024 16:22

Verify that benchmark tests meet expected performance thresholds

6db1db0

formatting

d9dbbdb

add todo to address before landing

41e4aee

kenzieschmoll requested a review from a team as a code owner November 28, 2024 01:03

kenzieschmoll requested review from elliette and removed request for a team November 28, 2024 01:03

kenzieschmoll added the release-notes-not-required label Nov 28, 2024

This comment was marked as off-topic.

Sign in to view

add 'wasm' or 'js' to identifier.

be02ddc

elliette approved these changes Dec 2, 2024

View reviewed changes

kenzieschmoll added 3 commits December 2, 2024 12:26

Merge branch 'master' of github.com:flutter/devtools into benchmarks

d7fc774

polish

e10f8a6

retry once

242fd4b

kenzieschmoll added 2 commits December 2, 2024 13:08

formatting

99a4ce1

use a const

4df7b47

try to run the performance benchmarks on macos

d9d52d3

remove TODO

868d8d2

kenzieschmoll merged commit aad5c0c into flutter:master Dec 5, 2024
24 checks passed

kenzieschmoll deleted the benchmarks branch December 5, 2024 21:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Verify that benchmark tests meet expected performance thresholds #8574

Verify that benchmark tests meet expected performance thresholds #8574

Uh oh!

kenzieschmoll commented Nov 28, 2024 •

edited

Loading

Uh oh!

This comment was marked as off-topic.

kenzieschmoll commented Dec 2, 2024 •

edited

Loading

Uh oh!

yjbanov commented Dec 2, 2024

Uh oh!

kenzieschmoll commented Dec 5, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Verify that benchmark tests meet expected performance thresholds #8574

Verify that benchmark tests meet expected performance thresholds #8574

Uh oh!

Conversation

kenzieschmoll commented Nov 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment was marked as off-topic.

kenzieschmoll commented Dec 2, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yjbanov commented Dec 2, 2024

Uh oh!

kenzieschmoll commented Dec 5, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kenzieschmoll commented Nov 28, 2024 •

edited

Loading

kenzieschmoll commented Dec 2, 2024 •

edited

Loading