✨ skrub version 0.7.2 has been released ✨
In this release we squashed more bugs, improved the API reference, and added a new example.
Main changes:
- The StringEncoder now exposes the vocabulary parameter, so it can be passed to the underlying TfidfVectorizer.
- The function compute_ngram_distance was made private to reduce clutter.
- The repository wheel has been made smaller by removing some material used for benchmarking.
What's Changed
- Rename compute_ngram_distance to _compute_ngram_distance by @siddharthbaleja7 in #1838
- Expose "vocabulary" parameter to "StringEncoder" by @emassoulie in #1819
- Included
size on diskon the california housing dataset by @Faith-Nchifor in #1854 - DOC - new example for using PyTorch w/ DataOps via skorch by @e-strauss in #1880
- Moved benchmarks to a separate repo by @rcap107 in #1893
New Contributors
- @Soham-47 made their first contribution in #1808
- @Faith-Nchifor made their first contribution in #1811
- @siddharthbaleja7 made their first contribution in #1838
- @Eshaan-byte made their first contribution in #1839
- @techy4shri made their first contribution in #1820
- @PredictiveManish made their first contribution in #1858
- @josselinonduty made their first contribution in #1826
- @sabasiddique1 made their first contribution in #1888
- @e-strauss made their first contribution in #1880
Full Changelog: 0.7.1...0.7.2