Qualcomm AI Engine Direct - Refactor calibration flow #13150

shewu-quic · 2025-08-06T07:07:51Z

Summary:

Update calibration flow to enhance the speed of wikitext calibration

cc: @haowhsu-quic , @winskuo-quic

pytorch-bot · 2025-08-06T07:07:55Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13150

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure

As of commit c7700e6 with merge base c8a0706 ():

NEW FAILURE - The following job has failed:

Build documentation / build (buck2) / Build doc (gh)
At least one of the pre-conditions you specified did not hold

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2025-08-06T07:08:31Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

shewu-quic · 2025-08-06T07:09:18Z

Hi @cccclai,
This PR is to enhance the speed of calibration with wikitext, AR-N chunk by AR-N chunk instead of original token by token.
Could you please help take a look?
Thanks

facebook-github-bot · 2025-08-10T04:56:28Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this in D79958818.

cccclai · 2025-08-10T04:56:48Z

There seem to be merge conflict. Also, does the flow break evaluate_qnn_llama?

Summary: - Update calibration flow to enhance the speed of wikitext calibration

shewu-quic · 2025-08-11T05:57:49Z

There seem to be merge conflict. Also, does the flow break evaluate_qnn_llama?

Done, thanks. No, it should be speed up the evaluation for AR-N model. But for AR-1 (kv mode) should be the same.

facebook-github-bot · 2025-08-11T17:19:52Z

@cccclai has imported this pull request. If you are a Meta employee, you can view this in D79958818.

@haowhsu-quic

Summary: - Update calibration flow to enhance the speed of wikitext calibration cc: @haowhsu-quic , @winskuo-quic

shewu-quic requested a review from cccclai as a code owner August 6, 2025 07:07

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 6, 2025

Qualcomm AI Engine Direct - Refactor calibration flow

c7700e6

Summary: - Update calibration flow to enhance the speed of wikitext calibration

shewu-quic force-pushed the dev1/hutton/refactor_calibration branch from 829bcbc to c7700e6 Compare August 11, 2025 05:56

cccclai approved these changes Aug 11, 2025

View reviewed changes

cccclai merged commit 18030f9 into pytorch:main Aug 11, 2025
101 of 103 checks passed

agrima1304 pushed a commit to agrima1304/executorch that referenced this pull request Aug 26, 2025

Qualcomm AI Engine Direct - Refactor calibration flow (pytorch#13150)

83d13c2

Summary: - Update calibration flow to enhance the speed of wikitext calibration cc: @haowhsu-quic , @winskuo-quic

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Qualcomm AI Engine Direct - Refactor calibration flow #13150

Qualcomm AI Engine Direct - Refactor calibration flow #13150

shewu-quic commented Aug 6, 2025

Uh oh!

pytorch-bot bot commented Aug 6, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Aug 6, 2025

Uh oh!

shewu-quic commented Aug 6, 2025

Uh oh!

facebook-github-bot commented Aug 10, 2025

Uh oh!

cccclai commented Aug 10, 2025

Uh oh!

shewu-quic commented Aug 11, 2025

Uh oh!

facebook-github-bot commented Aug 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Qualcomm AI Engine Direct - Refactor calibration flow #13150

Qualcomm AI Engine Direct - Refactor calibration flow #13150

Conversation

shewu-quic commented Aug 6, 2025

Uh oh!

pytorch-bot bot commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13150

❌ 1 New Failure

Uh oh!

github-actions bot commented Aug 6, 2025

This PR needs a release notes: label

Uh oh!

shewu-quic commented Aug 6, 2025

Uh oh!

facebook-github-bot commented Aug 10, 2025

Uh oh!

cccclai commented Aug 10, 2025

Uh oh!

shewu-quic commented Aug 11, 2025

Uh oh!

facebook-github-bot commented Aug 11, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Aug 6, 2025 •

edited

Loading

This PR needs a `release notes:` label