Skip to content

Commit 537e0bb

Browse files
committed
Pushing the docs to dev/ for branch: main, commit ad825d46ebeba0105cc5d1e319409d7bdf2ea550
1 parent 42773f0 commit 537e0bb

File tree

96 files changed

+50549
-50456
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

96 files changed

+50549
-50456
lines changed

dev/CHANGES.html

Lines changed: 5 additions & 13 deletions
Original file line numberDiff line numberDiff line change
@@ -560,13 +560,6 @@ <h3>Bug fixes<a class="headerlink" href="#bug-fixes" title="Link to this heading
560560
[here](<a class="github reference external" href="https://github.com/matplotlib/matplotlib/issues/25041">matplotlib/matplotlib#25041</a>).</p></li>
561561
</ul>
562562
</section>
563-
<section id="maintenance">
564-
<h3>Maintenance<a class="headerlink" href="#maintenance" title="Link to this heading">#</a></h3>
565-
<ul class="simple">
566-
<li><p>Make <cite>skrub</cite> compatible with scikit-learn 1.6.
567-
<a class="reference external" href="https://github.com/skrub-data/skrub/pull/1135">#1135</a> by <a class="reference external" href="https://github.com/glemaitre">Guillaume Lemaitre</a>.</p></li>
568-
</ul>
569-
</section>
570563
</section>
571564
<section id="release-0-4-0">
572565
<h2>Release 0.4.0<a class="headerlink" href="#release-0-4-0" title="Link to this heading">#</a></h2>
@@ -826,7 +819,7 @@ <h2>skrub release 0.1.0<a class="headerlink" href="#skrub-release-0-1-0" title="
826819
<h3>Major changes<a class="headerlink" href="#id10" title="Link to this heading">#</a></h3>
827820
<ul class="simple">
828821
<li><p><code class="xref py py-class docutils literal notranslate"><span class="pre">TargetEncoder</span></code> has been removed in favor of
829-
<a class="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.TargetEncoder.html#sklearn.preprocessing.TargetEncoder" title="(in scikit-learn v1.5)"><code class="xref py py-class docutils literal notranslate"><span class="pre">sklearn.preprocessing.TargetEncoder</span></code></a>, available since scikit-learn 1.3.</p></li>
822+
<a class="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.TargetEncoder.html#sklearn.preprocessing.TargetEncoder" title="(in scikit-learn v1.6)"><code class="xref py py-class docutils literal notranslate"><span class="pre">sklearn.preprocessing.TargetEncoder</span></code></a>, available since scikit-learn 1.3.</p></li>
830823
<li><p><a class="reference internal" href="reference/generated/skrub.Joiner.html#skrub.Joiner" title="skrub.Joiner"><code class="xref py py-class docutils literal notranslate"><span class="pre">Joiner</span></code></a> and <a class="reference internal" href="reference/generated/skrub.fuzzy_join.html#skrub.fuzzy_join" title="skrub.fuzzy_join"><code class="xref py py-func docutils literal notranslate"><span class="pre">fuzzy_join()</span></code></a> support several ways of rescaling
831824
distances; <code class="docutils literal notranslate"><span class="pre">match_score</span></code> has been replaced by <code class="docutils literal notranslate"><span class="pre">max_dist</span></code>; bugs which
832825
prevented the Joiner to consistently vectorize inputs and accept or reject
@@ -1077,8 +1070,8 @@ <h3>Major changes<a class="headerlink" href="#id18" title="Link to this heading"
10771070
<li><p>The <a class="reference internal" href="reference/generated/skrub.TableVectorizer.html#skrub.TableVectorizer" title="skrub.TableVectorizer"><code class="xref py py-class docutils literal notranslate"><span class="pre">TableVectorizer</span></code></a> has seen some major improvements and bug fixes:</p>
10781071
<ul class="simple">
10791072
<li><p>Fixes the automatic casting logic in <code class="docutils literal notranslate"><span class="pre">transform</span></code>.</p></li>
1080-
<li><p>To avoid dimensionality explosion when a feature has two unique values, the default encoder (<a class="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.OneHotEncoder.html#sklearn.preprocessing.OneHotEncoder" title="(in scikit-learn v1.5)"><code class="xref py py-class docutils literal notranslate"><span class="pre">OneHotEncoder</span></code></a>) now drops one of the two vectors (see parameter <cite>drop=”if_binary”</cite>).</p></li>
1081-
<li><p><code class="docutils literal notranslate"><span class="pre">fit_transform</span></code> and <code class="docutils literal notranslate"><span class="pre">transform</span></code> can now return unencoded features, like the <a class="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.compose.ColumnTransformer.html#sklearn.compose.ColumnTransformer" title="(in scikit-learn v1.5)"><code class="xref py py-class docutils literal notranslate"><span class="pre">ColumnTransformer</span></code></a>’s behavior. Previously, a <code class="docutils literal notranslate"><span class="pre">RuntimeError</span></code> was raised.</p></li>
1073+
<li><p>To avoid dimensionality explosion when a feature has two unique values, the default encoder (<a class="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.OneHotEncoder.html#sklearn.preprocessing.OneHotEncoder" title="(in scikit-learn v1.6)"><code class="xref py py-class docutils literal notranslate"><span class="pre">OneHotEncoder</span></code></a>) now drops one of the two vectors (see parameter <cite>drop=”if_binary”</cite>).</p></li>
1074+
<li><p><code class="docutils literal notranslate"><span class="pre">fit_transform</span></code> and <code class="docutils literal notranslate"><span class="pre">transform</span></code> can now return unencoded features, like the <a class="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.compose.ColumnTransformer.html#sklearn.compose.ColumnTransformer" title="(in scikit-learn v1.6)"><code class="xref py py-class docutils literal notranslate"><span class="pre">ColumnTransformer</span></code></a>’s behavior. Previously, a <code class="docutils literal notranslate"><span class="pre">RuntimeError</span></code> was raised.</p></li>
10821075
</ul>
10831076
<p><a class="reference external" href="https://github.com/skrub-data/skrub/pull/300">#300</a> by <a class="reference external" href="https://github.com/LilianBoulard">Lilian Boulard</a></p>
10841077
</li>
@@ -1139,7 +1132,7 @@ <h3>Major changes<a class="headerlink" href="#id20" title="Link to this heading"
11391132
<li><dl class="simple">
11401133
<dt>Improvements to the <a class="reference internal" href="reference/generated/skrub.MinHashEncoder.html#skrub.MinHashEncoder" title="skrub.MinHashEncoder"><code class="xref py py-class docutils literal notranslate"><span class="pre">MinHashEncoder</span></code></a></dt><dd><ul class="simple">
11411134
<li><p>It is now possible to fit multiple columns simultaneously with the <a class="reference internal" href="reference/generated/skrub.MinHashEncoder.html#skrub.MinHashEncoder" title="skrub.MinHashEncoder"><code class="xref py py-class docutils literal notranslate"><span class="pre">MinHashEncoder</span></code></a>.
1142-
Very useful when using for instance the <a class="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.compose.make_column_transformer.html#sklearn.compose.make_column_transformer" title="(in scikit-learn v1.5)"><code class="xref py py-func docutils literal notranslate"><span class="pre">make_column_transformer()</span></code></a> function,
1135+
Very useful when using for instance the <a class="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.compose.make_column_transformer.html#sklearn.compose.make_column_transformer" title="(in scikit-learn v1.6)"><code class="xref py py-func docutils literal notranslate"><span class="pre">make_column_transformer()</span></code></a> function,
11431136
on multiple columns.</p></li>
11441137
</ul>
11451138
</dd>
@@ -1243,7 +1236,7 @@ <h3>Major changes<a class="headerlink" href="#id25" title="Link to this heading"
12431236
<li><p><a class="reference internal" href="reference/generated/skrub.TableVectorizer.html#skrub.TableVectorizer" title="skrub.TableVectorizer"><code class="xref py py-class docutils literal notranslate"><span class="pre">TableVectorizer</span></code></a>: Added automatic transform through the
12441237
<a class="reference internal" href="reference/generated/skrub.TableVectorizer.html#skrub.TableVectorizer" title="skrub.TableVectorizer"><code class="xref py py-class docutils literal notranslate"><span class="pre">TableVectorizer</span></code></a> class. It transforms
12451238
columns automatically based on their type. It provides a replacement
1246-
for scikit-learn’s <a class="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.compose.ColumnTransformer.html#sklearn.compose.ColumnTransformer" title="(in scikit-learn v1.5)"><code class="xref py py-class docutils literal notranslate"><span class="pre">ColumnTransformer</span></code></a> simpler to use on heterogeneous
1239+
for scikit-learn’s <a class="reference external" href="https://scikit-learn.org/stable/modules/generated/sklearn.compose.ColumnTransformer.html#sklearn.compose.ColumnTransformer" title="(in scikit-learn v1.6)"><code class="xref py py-class docutils literal notranslate"><span class="pre">ColumnTransformer</span></code></a> simpler to use on heterogeneous
12471240
pandas DataFrame. <a class="reference external" href="https://github.com/skrub-data/skrub/pull/167">#167</a> by <a class="reference external" href="https://github.com/LilianBoulard">Lilian Boulard</a></p></li>
12481241
<li><p><strong>Backward incompatible change to</strong> <a class="reference internal" href="reference/generated/skrub.GapEncoder.html#skrub.GapEncoder" title="skrub.GapEncoder"><code class="xref py py-class docutils literal notranslate"><span class="pre">GapEncoder</span></code></a>: The <a class="reference internal" href="reference/generated/skrub.GapEncoder.html#skrub.GapEncoder" title="skrub.GapEncoder"><code class="xref py py-class docutils literal notranslate"><span class="pre">GapEncoder</span></code></a> now only
12491242
supports two-dimensional inputs of shape (n_samples, n_features).
@@ -1398,7 +1391,6 @@ <h2>Dirty-cat Release 0.0.5<a class="headerlink" href="#dirty-cat-release-0-0-5"
13981391
<li class="toc-h3 nav-item toc-entry"><a class="reference internal nav-link" href="#new-features">New features</a></li>
13991392
<li class="toc-h3 nav-item toc-entry"><a class="reference internal nav-link" href="#id1">Changes</a></li>
14001393
<li class="toc-h3 nav-item toc-entry"><a class="reference internal nav-link" href="#bug-fixes">Bug fixes</a></li>
1401-
<li class="toc-h3 nav-item toc-entry"><a class="reference internal nav-link" href="#maintenance">Maintenance</a></li>
14021394
</ul>
14031395
</li>
14041396
<li class="toc-h2 nav-item toc-entry"><a class="reference internal nav-link" href="#release-0-4-0">Release 0.4.0</a><ul class="nav section-nav flex-column">
Binary file not shown.
Binary file not shown.
0 Bytes
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.
Binary file not shown.

0 commit comments

Comments
 (0)