Fix reporting backends and dtyep to benchmark results #6023

guangy10 · 2024-10-08T22:38:16Z

Couple minor fixes for reporting the benchmarking results:

qnn models are not reporting "backend" and "dtype" info in the benchmark_results.json (Android)
tinyllama mdoel is not reporting "backend" and "dtype" info in the benchmark_results.json (Android)
include compute precision to the exported coreml model name
rename "llama2" to "tinyllama" to eliminate confusion (many people thought it was llama2-7b)

pytorch-bot · 2024-10-08T22:38:20Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/6023

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 2 New Failures

As of commit 22b823e with merge base b118d8e ():

NEW FAILURES - The following jobs have failed:

android-perf / export-models (dl3, xnnpack) / linux-job (gh)
RuntimeError: Command docker exec -t 25832b5277218b7d39c171bdf0f8c34af238d1f19c8fadadb419b7b284162678 /exec failed with exit code 1
android-perf / upload-benchmark-results (gh)
cat: 'benchmark-results/*.json': No such file or directory

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-10-08T22:38:47Z

@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-10-08T22:53:18Z

@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-10-09T00:10:13Z

@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

.ci/scripts/test_model.sh

facebook-github-bot · 2024-10-09T01:41:40Z

@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

huydhn · 2024-10-09T05:37:03Z

I test this out in #5982, so its regex takes the first word as the model name, the last one as the dtype. And everything in between will be the backend. It's as as good as having proper JSON output from device, but I guess this will do for now

huydhn · 2024-10-09T05:39:38Z

Also, I learn from https://github.com/pytorch/executorch/pull/5710/files#r1788458509 that changing the export name might cause unexpected failures because some names are hardcoded in the repo. It's a good idea to double check them.

guangy10 · 2024-10-09T17:38:02Z

Also, I learn from https://github.com/pytorch/executorch/pull/5710/files#r1788458509 that changing the export name might cause unexpected failures because some names are hardcoded in the repo. It's a good idea to double check them.

Yeah, I reverted the changes that renames the exported artifacts directly but append the dtype suffix in the test script instead. It works for now, we can clean it up later

facebook-github-bot · 2024-10-09T17:38:38Z

@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-10-09T19:29:31Z

@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-10-09T20:43:53Z

@guangy10 has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-10-09T21:52:27Z

@guangy10 merged this pull request in 012cba9.

guangy10 · 2024-10-09T22:15:08Z

@pytorchbot cherry-pick --onto release/0.4 -c fixnewfeature

Summary: Couple minor fixes for reporting the benchmarking results: - qnn models are not reporting "backend" and "dtype" info in the benchmark_results.json (Android) - tinyllama mdoel is not reporting "backend" and "dtype" info in the benchmark_results.json (Android) - include compute precision to the exported coreml model name - rename "llama2" to "tinyllama" to eliminate confusion (many people thought it was llama2-7b) Pull Request resolved: #6023 Reviewed By: huydhn Differential Revision: D64074262 Pulled By: guangy10 fbshipit-source-id: c6c53d004c4fb3ad410a792639af2c22a6978b67 (cherry picked from commit 012cba9)

pytorchbot · 2024-10-09T22:16:42Z

Cherry picking #6023

The cherry pick PR is at #6073 and it is recommended to link a fixnewfeature cherry pick PR with an issue. The following tracker issues are updated:

[v0.4.0] Release Tracker #5366 (comment)

Details for Dev Infra team

Raised by workflow job

Fix reporting backends and dtyep to benchmark results (#6023) Summary: Couple minor fixes for reporting the benchmarking results: - qnn models are not reporting "backend" and "dtype" info in the benchmark_results.json (Android) - tinyllama mdoel is not reporting "backend" and "dtype" info in the benchmark_results.json (Android) - include compute precision to the exported coreml model name - rename "llama2" to "tinyllama" to eliminate confusion (many people thought it was llama2-7b) Pull Request resolved: #6023 Reviewed By: huydhn Differential Revision: D64074262 Pulled By: guangy10 fbshipit-source-id: c6c53d004c4fb3ad410a792639af2c22a6978b67 (cherry picked from commit 012cba9) Co-authored-by: Guang Yang <[email protected]>

guangy10 requested a review from huydhn October 8, 2024 22:38

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 8, 2024

guangy10 mentioned this pull request Oct 8, 2024

Upload Apple iOS benchmark results to benchmark database #5982

Closed

guangy10 force-pushed the fix_report_benchmark_results branch from c462b4e to 9e0d88f Compare October 8, 2024 22:52

guangy10 had a problem deploying to upload-benchmark-results October 8, 2024 23:29 — with GitHub Actions Failure

guangy10 force-pushed the fix_report_benchmark_results branch from 9e0d88f to 698e137 Compare October 9, 2024 00:07

guangy10 requested a review from haowhsu-quic October 9, 2024 00:09

guangy10 temporarily deployed to upload-benchmark-results October 9, 2024 00:39 — with GitHub Actions Inactive

haowhsu-quic reviewed Oct 9, 2024

View reviewed changes

.ci/scripts/test_model.sh Outdated Show resolved Hide resolved

guangy10 force-pushed the fix_report_benchmark_results branch 2 times, most recently from e0f7b35 to b370ae7 Compare October 9, 2024 01:35

guangy10 force-pushed the fix_report_benchmark_results branch 2 times, most recently from ef92cb7 to 29aea6e Compare October 9, 2024 01:54

guangy10 had a problem deploying to upload-benchmark-results October 9, 2024 02:17 — with GitHub Actions Failure

guangy10 force-pushed the fix_report_benchmark_results branch from 29aea6e to d515415 Compare October 9, 2024 04:39

guangy10 had a problem deploying to upload-benchmark-results October 9, 2024 05:17 — with GitHub Actions Failure

huydhn approved these changes Oct 9, 2024

View reviewed changes

huydhn added the ciflow/android Trigger Android CI label Oct 9, 2024

guangy10 temporarily deployed to upload-benchmark-results October 9, 2024 06:15 — with GitHub Actions Inactive

guangy10 force-pushed the fix_report_benchmark_results branch from d515415 to c691225 Compare October 9, 2024 19:29

Fix reporting backends and dtyep to benchmark results

22b823e

guangy10 force-pushed the fix_report_benchmark_results branch from c691225 to 22b823e Compare October 9, 2024 20:43

guangy10 had a problem deploying to upload-benchmark-results October 9, 2024 21:22 — with GitHub Actions Failure

guangy10 had a problem deploying to upload-benchmark-results October 9, 2024 21:36 — with GitHub Actions Failure

facebook-github-bot closed this in 012cba9 Oct 9, 2024

facebook-github-bot added the Merged label Oct 9, 2024

guangy10 deleted the fix_report_benchmark_results branch October 9, 2024 22:15

pytorchbot mentioned this pull request Oct 9, 2024

[v0.4.0] Release Tracker #5366

Closed

Fix reporting backends and dtyep to benchmark results #6023

Fix reporting backends and dtyep to benchmark results #6023

Uh oh!

Conversation

guangy10 commented Oct 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/6023

❌ 2 New Failures

Uh oh!

facebook-github-bot commented Oct 8, 2024

Uh oh!

facebook-github-bot commented Oct 8, 2024

Uh oh!

facebook-github-bot commented Oct 9, 2024

Uh oh!

Uh oh!

facebook-github-bot commented Oct 9, 2024

Uh oh!

huydhn commented Oct 9, 2024

Uh oh!

huydhn commented Oct 9, 2024

Uh oh!

guangy10 commented Oct 9, 2024

Uh oh!

facebook-github-bot commented Oct 9, 2024

Uh oh!

facebook-github-bot commented Oct 9, 2024

Uh oh!

facebook-github-bot commented Oct 9, 2024

Uh oh!

facebook-github-bot commented Oct 9, 2024

Uh oh!

guangy10 commented Oct 9, 2024

Uh oh!

pytorchbot commented Oct 9, 2024

Cherry picking #6023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

guangy10 commented Oct 8, 2024 •

edited

Loading

pytorch-bot bot commented Oct 8, 2024 •

edited

Loading