Skip to content

Commit 866335a

Browse files
authored
small changes
1 parent 7b68ec2 commit 866335a

File tree

1 file changed

+3
-7
lines changed

1 file changed

+3
-7
lines changed

category_encoders/cat_boost.py

Lines changed: 3 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -17,15 +17,11 @@ class CatBoostEncoder(util.BaseEncoder, util.SupervisedTransformerMixin):
1717
time-aware encoding, regularization and online learning.
1818
1919
This implementation is time-aware (similar to CatBoos 'has_time=True'),
20-
so no random permutations are used. This makes this encoder sensitive to
20+
so no random permutations are used. It makes this encoder sensitive to
2121
ordering of the data and suitable for time series problems. If your data
2222
does not have time dependency it should still work just fine assuming
23-
sorting of the data won't leak any information.
24-
25-
Regularization (parameter a) is achieved by adding this imcrement to running counts
26-
(in this case so called pseudocounts).
27-
Regularization (parameter sigma) is achieved by adding gaussian noise to
28-
the output.
23+
sorting of the data won't leak any information outside training scope
24+
(i.e. no data leakage).
2925
3026
NOTE: behavior of the transformer would differ in transform and fit_transform
3127
methods depending if y values are passed. If no target is passed then

0 commit comments

Comments
 (0)