v0.5.0 #72
benbrandt
announced in
Announcements
v0.5.0
#72
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
What's New
tokenizers
ortiktoken-rs
crates by applying binary search when attempting to find the next matching chunk size. @benbrandt and @bradfier in Use binary search in TextChunks #71Breaking Changes
tokenizers
is now0.15.0
tiktoken-rs
is now0.5.6
with_trim_chunks(true)
. It is a tradeoff, but would have made the binary search code much more complicated to keep the exact same behavior.chunk_size
method onChunkSizer
now needs to accept aChunkCapacity
argument, and return aChunkSize
struct instead of ausize
. This was to help support the new binary search method in chunking, and should only affect users who implemented customChunkSizer
s and weren't using one of the provided ones.fn chunk_size(&self, chunk: &str, capacity: &impl ChunkCapacity) -> ChunkSize;
Full Changelog: v0.4.5...v0.5.0
This discussion was created from the release v0.5.0.
Beta Was this translation helpful? Give feedback.
All reactions