-
Notifications
You must be signed in to change notification settings - Fork 741
QNN Llama Runner implement IRunner #13171
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13171
Note: Links to docs will display an error until the docs builds have been completed. ❌ 2 New Failures, 1 PendingAs of commit 484a9f0 with merge base 6a875f9 ( NEW FAILURES - The following jobs have failed:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
|
This pull request was exported from Phabricator. Differential Revision: D79759817 |
This PR needs a
|
ed495fc to
0fc31e2
Compare
Summary: This PR makes the Runner for running Qualcomm LlamaModels implement the IRunner interface Using this, enable running static Llama models inside LlamaDemo Android app Switched default eval mode to hybrid everywhere Differential Revision: D79759817
|
This pull request was exported from Phabricator. Differential Revision: D79759817 |
shewu-quic
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. Thank you for the efforts. It is really awesome 👍
cccclai
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me, also stamp on behalf of qcom team
0fc31e2 to
07fce7f
Compare
Summary: This PR makes the Runner for running Qualcomm LlamaModels implement the IRunner interface Using this, enable running static Llama models inside LlamaDemo Android app Switched default eval mode to hybrid everywhere Reviewed By: cccclai Differential Revision: D79759817
|
This pull request was exported from Phabricator. Differential Revision: D79759817 |
Summary: This PR makes the Runner for running Qualcomm LlamaModels implement the IRunner interface Using this, enable running static Llama models inside LlamaDemo Android app Switched default eval mode to hybrid everywhere Reviewed By: cccclai Differential Revision: D79759817
07fce7f to
991dd98
Compare
|
This pull request was exported from Phabricator. Differential Revision: D79759817 |
991dd98 to
ce2893c
Compare
Summary: This PR makes the Runner for running Qualcomm LlamaModels implement the IRunner interface Using this, enable running static Llama models inside LlamaDemo Android app Switched default eval mode to hybrid everywhere Reviewed By: cccclai Differential Revision: D79759817
|
This pull request was exported from Phabricator. Differential Revision: D79759817 |
Summary: Pull Request resolved: pytorch#13171 This PR makes the Runner for running Qualcomm LlamaModels implement the IRunner interface Using this, enable running static Llama models inside LlamaDemo Android app Switched default eval mode to hybrid everywhere Reviewed By: cccclai Differential Revision: D79759817
ce2893c to
e0cfcbd
Compare
|
This pull request was exported from Phabricator. Differential Revision: D79759817 |
e0cfcbd to
533247c
Compare
|
This pull request was exported from Phabricator. Differential Revision: D79759817 |
533247c to
ebb9c3a
Compare
|
This pull request was exported from Phabricator. Differential Revision: D79759817 |
Summary: Pull Request resolved: pytorch#13171 This PR makes the Runner for running Qualcomm LlamaModels implement the IRunner interface Using this, enable running static Llama models with QNN backend inside LlamaDemo Android app Switched default eval mode to hybrid everywhere Reviewed By: cccclai Differential Revision: D79759817
ebb9c3a to
938b552
Compare
Summary: This PR makes the Runner for running Qualcomm LlamaModels implement the IRunner interface Using this, enable running static Llama models inside LlamaDemo Android app Switched default eval mode to hybrid everywhere Reviewed By: cccclai Differential Revision: D79759817
938b552 to
5b377a2
Compare
|
This pull request was exported from Phabricator. Differential Revision: D79759817 |
Summary: This PR makes the Runner for running Qualcomm LlamaModels implement the IRunner interface Using this, enable running static Llama models inside LlamaDemo Android app Switched default eval mode to hybrid everywhere Reviewed By: cccclai Differential Revision: D79759817
Reviewed By: cccclai
5cabbd1 to
484a9f0
Compare
|
@rohansjoshi has imported this pull request. If you are a Meta employee, you can view this in D79759817. |
|
@rohansjoshi has imported this pull request. If you are a Meta employee, you can view this in D79759817. |
| // A llama 3.2 runner that includes preprocessing and post processing | ||
| // logic. The module takes in a string as input and emits a string as output. | ||
|
|
||
| #include <executorch/examples/models/llama/runner/runner.h> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why do you need this particular runner?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm only using the function load_llama_tokenizer from executorch/examples/models/llama/runner/runner, not the runner there. I'm trying to reuse code from examples/models
Summary: This PR makes the Runner for running Qualcomm LlamaModels implement the IRunner interface Using this, enable running static Llama models inside LlamaDemo Android app Switched default eval mode to hybrid everywhere Differential Revision: D79759817
Summary:
This PR makes the Runner for running Qualcomm LlamaModels implement the IRunner interface
Using this, enable running static Llama models inside LlamaDemo Android app
Switched default eval mode to hybrid everywhere
Differential Revision: D79759817