You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<h3>Major changes<aclass="headerlink" href="#id10" title="Link to this heading">#</a></h3>
827
820
<ulclass="simple">
828
821
<li><p><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">TargetEncoder</span></code> has been removed in favor of
829
-
<aclass="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.TargetEncoder.html#sklearn.preprocessing.TargetEncoder" title="(in scikit-learn v1.5)"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">sklearn.preprocessing.TargetEncoder</span></code></a>, available since scikit-learn 1.3.</p></li>
822
+
<aclass="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.TargetEncoder.html#sklearn.preprocessing.TargetEncoder" title="(in scikit-learn v1.6)"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">sklearn.preprocessing.TargetEncoder</span></code></a>, available since scikit-learn 1.3.</p></li>
830
823
<li><p><aclass="reference internal" href="reference/generated/skrub.Joiner.html#skrub.Joiner" title="skrub.Joiner"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">Joiner</span></code></a> and <aclass="reference internal" href="reference/generated/skrub.fuzzy_join.html#skrub.fuzzy_join" title="skrub.fuzzy_join"><codeclass="xref py py-func docutils literal notranslate"><spanclass="pre">fuzzy_join()</span></code></a> support several ways of rescaling
831
824
distances; <codeclass="docutils literal notranslate"><spanclass="pre">match_score</span></code> has been replaced by <codeclass="docutils literal notranslate"><spanclass="pre">max_dist</span></code>; bugs which
832
825
prevented the Joiner to consistently vectorize inputs and accept or reject
@@ -1077,8 +1070,8 @@ <h3>Major changes<a class="headerlink" href="#id18" title="Link to this heading"
1077
1070
<li><p>The <aclass="reference internal" href="reference/generated/skrub.TableVectorizer.html#skrub.TableVectorizer" title="skrub.TableVectorizer"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">TableVectorizer</span></code></a> has seen some major improvements and bug fixes:</p>
1078
1071
<ulclass="simple">
1079
1072
<li><p>Fixes the automatic casting logic in <codeclass="docutils literal notranslate"><spanclass="pre">transform</span></code>.</p></li>
1080
-
<li><p>To avoid dimensionality explosion when a feature has two unique values, the default encoder (<aclass="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.OneHotEncoder.html#sklearn.preprocessing.OneHotEncoder" title="(in scikit-learn v1.5)"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">OneHotEncoder</span></code></a>) now drops one of the two vectors (see parameter <cite>drop=”if_binary”</cite>).</p></li>
1081
-
<li><p><codeclass="docutils literal notranslate"><spanclass="pre">fit_transform</span></code> and <codeclass="docutils literal notranslate"><spanclass="pre">transform</span></code> can now return unencoded features, like the <aclass="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.compose.ColumnTransformer.html#sklearn.compose.ColumnTransformer" title="(in scikit-learn v1.5)"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">ColumnTransformer</span></code></a>’s behavior. Previously, a <codeclass="docutils literal notranslate"><spanclass="pre">RuntimeError</span></code> was raised.</p></li>
1073
+
<li><p>To avoid dimensionality explosion when a feature has two unique values, the default encoder (<aclass="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.OneHotEncoder.html#sklearn.preprocessing.OneHotEncoder" title="(in scikit-learn v1.6)"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">OneHotEncoder</span></code></a>) now drops one of the two vectors (see parameter <cite>drop=”if_binary”</cite>).</p></li>
1074
+
<li><p><codeclass="docutils literal notranslate"><spanclass="pre">fit_transform</span></code> and <codeclass="docutils literal notranslate"><spanclass="pre">transform</span></code> can now return unencoded features, like the <aclass="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.compose.ColumnTransformer.html#sklearn.compose.ColumnTransformer" title="(in scikit-learn v1.6)"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">ColumnTransformer</span></code></a>’s behavior. Previously, a <codeclass="docutils literal notranslate"><spanclass="pre">RuntimeError</span></code> was raised.</p></li>
1082
1075
</ul>
1083
1076
<p><aclass="reference external" href="https://github.com/skrub-data/skrub/pull/300">#300</a> by <aclass="reference external" href="https://github.com/LilianBoulard">Lilian Boulard</a></p>
1084
1077
</li>
@@ -1139,7 +1132,7 @@ <h3>Major changes<a class="headerlink" href="#id20" title="Link to this heading"
1139
1132
<li><dlclass="simple">
1140
1133
<dt>Improvements to the <aclass="reference internal" href="reference/generated/skrub.MinHashEncoder.html#skrub.MinHashEncoder" title="skrub.MinHashEncoder"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">MinHashEncoder</span></code></a></dt><dd><ulclass="simple">
1141
1134
<li><p>It is now possible to fit multiple columns simultaneously with the <aclass="reference internal" href="reference/generated/skrub.MinHashEncoder.html#skrub.MinHashEncoder" title="skrub.MinHashEncoder"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">MinHashEncoder</span></code></a>.
1142
-
Very useful when using for instance the <aclass="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.compose.make_column_transformer.html#sklearn.compose.make_column_transformer" title="(in scikit-learn v1.5)"><codeclass="xref py py-func docutils literal notranslate"><spanclass="pre">make_column_transformer()</span></code></a> function,
1135
+
Very useful when using for instance the <aclass="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.compose.make_column_transformer.html#sklearn.compose.make_column_transformer" title="(in scikit-learn v1.6)"><codeclass="xref py py-func docutils literal notranslate"><spanclass="pre">make_column_transformer()</span></code></a> function,
1143
1136
on multiple columns.</p></li>
1144
1137
</ul>
1145
1138
</dd>
@@ -1243,7 +1236,7 @@ <h3>Major changes<a class="headerlink" href="#id25" title="Link to this heading"
1243
1236
<li><p><aclass="reference internal" href="reference/generated/skrub.TableVectorizer.html#skrub.TableVectorizer" title="skrub.TableVectorizer"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">TableVectorizer</span></code></a>: Added automatic transform through the
columns automatically based on their type. It provides a replacement
1246
-
for scikit-learn’s <aclass="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.compose.ColumnTransformer.html#sklearn.compose.ColumnTransformer" title="(in scikit-learn v1.5)"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">ColumnTransformer</span></code></a> simpler to use on heterogeneous
1239
+
for scikit-learn’s <aclass="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.compose.ColumnTransformer.html#sklearn.compose.ColumnTransformer" title="(in scikit-learn v1.6)"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">ColumnTransformer</span></code></a> simpler to use on heterogeneous
1247
1240
pandas DataFrame. <aclass="reference external" href="https://github.com/skrub-data/skrub/pull/167">#167</a> by <aclass="reference external" href="https://github.com/LilianBoulard">Lilian Boulard</a></p></li>
1248
1241
<li><p><strong>Backward incompatible change to</strong><aclass="reference internal" href="reference/generated/skrub.GapEncoder.html#skrub.GapEncoder" title="skrub.GapEncoder"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">GapEncoder</span></code></a>: The <aclass="reference internal" href="reference/generated/skrub.GapEncoder.html#skrub.GapEncoder" title="skrub.GapEncoder"><codeclass="xref py py-class docutils literal notranslate"><spanclass="pre">GapEncoder</span></code></a> now only
1249
1242
supports two-dimensional inputs of shape (n_samples, n_features).
0 commit comments