Integrate PowerInfer2 #8184
BinaryQuantumSoul
started this conversation in
Ideas
Replies: 1 comment 1 reply
-
@BinaryQuantumSoul are you aware of an implementation? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
This is the new paper for fast llm inference on smartphones
https://arxiv.org/abs/2406.06282
Beta Was this translation helpful? Give feedback.
All reactions