Releases: scikit-learn-contrib/imbalanced-learn
0.4.2
Version 0.4.2
Bug fixes
- Fix a bug in imblearn.over_sampling.SMOTENC in which the the median of the standard deviation instead of half of the median of the standard deviation. By Guillaume Lemaitre in #491.
- Raise an error when passing target which is not supported, i.e. regression target or multilabel targets. Imbalanced-learn does not support this case. By Guillaume Lemaitre in #490.
0.4.1
Version 0.4
October, 2018
Version 0.4 is the last version of imbalanced-learn to support Python 2.7
and Python 3.4. Imbalanced-learn 0.5 will require Python 3.5 or higher.
Highlights
This release brings its set of new feature as well as some API changes to
strengthen the foundation of imbalanced-learn.
As new feature, 2 new modules imblearn.keras and
imblearn.tensorflow have been added in which imbalanced-learn samplers
can be used to generate balanced mini-batches.
The module imblearn.ensemble has been consolidated with new classifier:
imblearn.ensemble.BalancedRandomForestClassifier,
imblearn.ensemble.EasyEnsembleClassifier,
imblearn.ensemble.RUSBoostClassifier.
Support for string has been added in
imblearn.over_sampling.RandomOverSampler and
imblearn.under_sampling.RandomUnderSampler. In addition, a new class
imblearn.over_sampling.SMOTENC allows to generate sample with data
sets containing both continuous and categorical features.
The imblearn.over_sampling.SMOTE has been simplified and break down
to 2 additional classes:
imblearn.over_sampling.SVMSMOTE and
imblearn.over_sampling.BorderlineSMOTE.
There is also some changes regarding the API:
the parameter sampling_strategy has been introduced to replace the
ratio parameter. In addition, the return_indices argument has been
deprecated and all samplers will exposed a sample_indices_ whenever this is
possible.
0.4.0
Version 0.4
October, 2018
.. warning::
Version 0.4 is the last version of imbalanced-learn to support Python 2.7
and Python 3.4. Imbalanced-learn 0.5 will require Python 3.5 or higher.
Highlights
This release brings its set of new feature as well as some API changes to
strengthen the foundation of imbalanced-learn.
As new feature, 2 new modules imblearn.keras and
imblearn.tensorflow have been added in which imbalanced-learn samplers
can be used to generate balanced mini-batches.
The module imblearn.ensemble has been consolidated with new classifier:
imblearn.ensemble.BalancedRandomForestClassifier,
imblearn.ensemble.EasyEnsembleClassifier,
imblearn.ensemble.RUSBoostClassifier.
Support for string has been added in
imblearn.over_sampling.RandomOverSampler and
imblearn.under_sampling.RandomUnderSampler. In addition, a new class
imblearn.over_sampling.SMOTENC allows to generate sample with data
sets containing both continuous and categorical features.
The imblearn.over_sampling.SMOTE has been simplified and break down
to 2 additional classes:
imblearn.over_sampling.SVMSMOTE and
imblearn.over_sampling.BorderlineSMOTE.
There is also some changes regarding the API:
the parameter sampling_strategy has been introduced to replace the
ratio parameter. In addition, the return_indices argument has been
deprecated and all samplers will exposed a sample_indices_ whenever this is
possible.
0.3.4
0.3.3
0.3.2
0.3.1
0.3.0
What's new in version 0.3.0
Testing
- Pytest is used instead of nosetests. :issue:
321byJoan Massich_.
Documentation
- Added a User Guide and extended some examples. :issue:
295byGuillaume Lemaitre_.
Bug fixes
-
Fixed a bug in :func:
utils.check_ratiosuch that an error is raised when
the number of samples required is negative. :issue:312byGuillaume Lemaitre_. -
Fixed a bug in :class:
under_sampling.NearMissversion 3. The
indices returned were wrong. :issue:312byGuillaume Lemaitre_. -
Fixed bug for :class:
ensemble.BalanceCascadeand :class:combine.SMOTEENN
and :class:SMOTETomek. :issue:295byGuillaume Lemaitre_.` -
Fixed bug for
check_ratioto be able to pass arguments whenratiois a
callable. :issue:307byGuillaume Lemaitre_.`
New features
-
Turn off steps in :class:
pipeline.Pipelineusing theNone
object. ByChristos Aridas_. -
Add a fetching function :func:
datasets.fetch_datasetsin order to get some
imbalanced datasets useful for benchmarking. :issue:249byGuillaume Lemaitre_.
Enhancement
-
All samplers accepts sparse matrices with defaulting on CSR type. :issue:
316by
Guillaume Lemaitre_. -
:func:
datasets.make_imbalancetake a ratio similarly to other samplers. It
supports multiclass. :issue:312byGuillaume Lemaitre_. -
All the unit tests have been factorized and a :func:
utils.check_estimators
has been derived from scikit-learn. ByGuillaume Lemaitre_. -
Script for automatic build of conda packages and uploading. :issue:
242by
Guillaume Lemaitre_ -
Remove seaborn dependence and improve the examples. :issue:
264byGuillaume Lemaitre_. -
adapt all classes to multi-class resampling. :issue:
290byGuillaume Lemaitre_
API changes summary
-
__init__has been removed from the :class:base.SamplerMixinto
create a real mixin class. :issue:242byGuillaume Lemaitre_. -
creation of a module :mod:
exceptionsto handle consistant raising of
errors. :issue:242byGuillaume Lemaitre_. -
creation of a module
utils.validationto make checking of
recurrent patterns. :issue:242byGuillaume Lemaitre_. -
move the under-sampling methods in
prototype_selectionand
prototype_generationsubmodule to make a clearer dinstinction. :issue:277by
Guillaume Lemaitre_. -
change
ratiosuch that it can adapt to multiple class problems. :issue:290by
Guillaume Lemaitre_.
Deprecation
-
Deprecation of the use of
min_c_in :func:datasets.make_imbalance. :issue:312by
Guillaume Lemaitre_ -
Deprecation of the use of float in :func:
datasets.make_imbalancefor the
ratio parameter. :issue:290byGuillaume Lemaitre_. -
deprecate the use of float as ratio in favor of dictionary, string, or
callable. :issue:290byGuillaume Lemaitre_.