Releases: isaacus-dev/semchunk
Releases · isaacus-dev/semchunk
v1.0.1
Fixed
- Documented the
progressargument in the docstring forchunkerify()and its type hint in the README.
v1.0.0
Added
- Added a
progressargument to the chunker returned bychunkerify()that, when set toTrueand multiple texts are passed, displays a progress bar.
v0.3.2
v0.3.1
Fixed
- Fixed typo in error messages in
chunkerify()where it was referred to asmake_chunker().
v0.3.0
Added
- Introduced the
chunkerify()function, which constructs a chunker from a tokenizer or token counter that can be reused and can also chunk multiple texts in a single call. The resulting chunker speeds up chunking by 40.4% thanks, in large part, to a token counter that avoid having to count the number of tokens in a text when the number of characters in the text exceed a certain threshold, courtesy of @R0bk (#3) (337a186).
v0.2.4
v0.2.3
Fixed
- Ensured that memoization does not overwrite
chunk()'s function signature.
v0.2.2
Fixed
- Ensured that the
memoizeargument is passed back tochunk()in recursive calls.
v0.2.1
Added
- Memoized
chunk().
Fixed
- Fixed typos in README.
v0.2.0
Added
- Added the
memoizeargument tochunk(), which memoizes token counters by default to significantly improve performance.
Changed
- Improved chunking performance.