install bitnet (or other cpu models) on a fresh termux aarch64 #401
Replies: 5 comments 23 replies
-
what is a termux? |
Beta Was this translation helpful? Give feedback.
-
using the built in llama-server standard and pasting that in prompt template field to get correct chat format {{history}} |
Beta Was this translation helpful? Give feedback.
-
Didn't work for me in my case. Stayed hung up at compilation forever |
Beta Was this translation helpful? Give feedback.
-
You can now disable building the templated flash attention (FA) kernels. Disabling FA should massively improve build times. See PR #429 |
Beta Was this translation helpful? Give feedback.
-
There is now PR #435 that significantly reduces build time. I cannot test on Android myself, so would appreciate if someone did and reported
|
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
just for convenience all subsequential commands to install bitnet (or other cpu models) on a fresh termux aarch64:
the template for the model in chat prompt under 127.0.0.1:8080 should be
thanks for the help @ikawrakow @RobertAgee @saood06
edit: sometimes its producing nonsense output
reverted to old prompt template
Beta Was this translation helpful? Give feedback.
All reactions