-
Couldn't load subscription status.
- Fork 700
Qualcomm AI Engine Direct - fix suite op #15162
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/15162
Note: Links to docs will display an error until the docs builds have been completed. ❗ 1 Active SEVsThere are 1 currently active SEVs. If your PR is affected, please view them below: ❌ 1 New FailureAs of commit 9e6d72d with merge base 72b1fa1 ( NEW FAILURE - The following job has failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
This PR needs a
|
|
Hi @cccclai, if the framework change makes sense to you. I still need some time to test all the decoder models thoroughly since the annotation change of concat will somewhat impact the accuracy of kv_cache. Will ping you again once all of them are well tested. |
993b9cb to
3632239
Compare
|
It seems like some tests are failing.. |
- fix annotation logic for non arithmetic op - partition out unsupported embedding_bag op - use calibration input in suite test when verifying quantized op
05f1e41 to
bb806e6
Compare
Summary
Test plan
e.g. pytest --disable-warnings -c /dev/nul backends/test/suite/ -k test_split_size_dimensions[qnn_16a16w]
cc @cccclai @cbilgin