Commit ab028cb
Migrate inference to llama_batch and llama_decode api (#795)
* Add low-level batching notebook
* fix: tokenization of special characters: (#850)
It should behave like llama.cpp, where most out of the box usages
treat special characters accordingly
* Update CHANGELOG
* Cleanup
* Fix runner label
* Update notebook
* Use llama_decode and batch api
* Support logits_all parameter
---------
Co-authored-by: Antoine Lizee <[email protected]>1 parent f436e0c commit ab028cb
File tree
3 files changed
+753
-8
lines changed- examples/notebooks
- llama_cpp
- tests
3 files changed
+753
-8
lines changed
0 commit comments