feature-engine
diff --git a/‎.circleci/config.yml‎
Lines changed: 1 addition & 1 deletion b/‎.circleci/config.yml‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎.gitignore‎
Lines changed: 2 additions & 2 deletions b/‎.gitignore‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎README.md‎
Lines changed: 17 additions & 17 deletions b/‎README.md‎
Lines changed: 17 additions & 17 deletions
diff --git a/‎docs/blogs.rst‎
Lines changed: 1 addition & 1 deletion b/‎docs/blogs.rst‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/howto.rst‎
Lines changed: 3 additions & 1 deletion b/‎docs/howto.rst‎
Lines changed: 3 additions & 1 deletion
diff --git a/‎docs/images/Thumbs.db‎
-71.5 KB b/‎docs/images/Thumbs.db‎
-71.5 KB
diff --git a/‎docs/images/selectionSummary.png‎
91.2 KB b/‎docs/images/selectionSummary.png‎
91.2 KB
diff --git a/‎docs/selection/index.rst‎
Lines changed: 5 additions & 0 deletions b/‎docs/selection/index.rst‎
Lines changed: 5 additions & 0 deletions
diff --git a/‎docs/whats_new/v1.rst‎
Lines changed: 49 additions & 22 deletions b/‎docs/whats_new/v1.rst‎
Lines changed: 49 additions & 22 deletions
diff --git a/‎…ature-engine-with-sklearn-pipeline.ipynb‎ ‎…ature-engine-with-sklearn-pipeline.ipynb‎examples/feature-engine-with-sklearn-pipeline.ipynb renamed to examples/Pipelines/feature-engine-with-sklearn-pipeline.ipynb b/‎…ature-engine-with-sklearn-pipeline.ipynb‎ ‎…ature-engine-with-sklearn-pipeline.ipynb‎examples/feature-engine-with-sklearn-pipeline.ipynb renamed to examples/Pipelines/feature-engine-with-sklearn-pipeline.ipynb
@@ -138,4 +138,4 @@ workflows:
           filters:
             branches:
               only:
-                - 1.1.X
+                - 1.0.X
@@ -109,6 +109,6 @@ venv.bak/
 .idea
 .vscode
 *.csv
-
 *.DS_Store
-untitled9.py
+*.db
+*.pptx
@@ -8,7 +8,9 @@
 ![Documentation Status](https://readthedocs.org/projects/feature-engine/badge/?version=latest)
 
 
-Feature-engine is a Python library with multiple transformers to engineer features for use in machine learning models. Feature-engine's transformers follow scikit-learn's functionality with fit() and transform() methods to first learn the transforming parameters from data and then transform the data.
+Feature-engine is a Python library with multiple transformers to engineer features for use in machine learning models. 
+Feature-engine's transformers follow scikit-learn's functionality with fit() and transform() methods to first learn the 
+transforming parameters from data and then transform the data.
 
 
 ## Feature-engine features in the following resources:
@@ -38,33 +40,32 @@ More resources will be added as they appear online!
 
 
 ## Current Feature-engine's transformers include functionality for:
-
 * Missing Data Imputation
 * Categorical Variable Encoding
 * Outlier Capping or Removal
 * Discretisation
 * Numerical Variable Transformation
 * Scikit-learn Wrappers
-* Variables Combination
+* Variable Combination
 * Variable Selection
 
 ### Imputing Methods
-
 * MeanMedianImputer
 * RandomSampleImputer
 * EndTailImputer
 * AddNaNBinaryImputer
-* CategoricalVariableImputer
-* FrequentCategoryImputer
+* CategoricalImputer
 * ArbitraryNumberImputer
 
 ### Encoding Methods
-* CountFrequencyCategoricalEncoder
-* OrdinalCategoricalEncoder 
-* MeanCategoricalEncoder
-* WoERatioCategoricalEncoder
-* OneHotCategoricalEncoder
-* RareLabelCategoricalEncoder
+* OneHotEncoder
+* OrdinalEncoder
+* CountFrequencyEncoder
+* MeanEncoder
+* WoEEncoder
+* PRatioEncoder
+* RareLabelEncoder
+* DecisionTreeEncoder
 
 ### Outlier Handling methods
 * Winsorizer
@@ -85,23 +86,22 @@ More resources will be added as they appear online!
 * YeoJohnsonTransformer
 
 ### Scikit-learn Wrapper:
-
  * SklearnTransformerWrapper
 
 ### Variable Combinations:
-
- * MathematicalCombinator
+ * MathematicalCombination
 
 ### Feature Selection:
-
  * DropFeatures
  * DropConstantFeatures
  * DropDuplicateFeatures
  * DropCorrelatedFeatures
+ * SmartCorrelationSelection
  * ShuffleFeaturesSelector
  * SelectBySingleFeaturePerformance
  * SelectByTargetMeanPerformance
  * RecursiveFeatureElimination
+ * RecursiveFeatureAddition
 
 
 ## Installing
@@ -127,8 +127,8 @@ git clone https://github.com/solegalli/feature_engine.git
 ### Usage
 
 ```python
->>> from feature_engine.encoding import RareLabelEncoder
 >>> import pandas as pd
+>>> from feature_engine.encoding import RareLabelEncoder
 
 >>> data = {'var_A': ['A'] * 10 + ['B'] * 10 + ['C'] * 2 + ['D'] * 1}
 >>> data = pd.DataFrame(data)
 
@@ -16,7 +16,7 @@ Blogs
 Videos
 ------
 
-- Coming soon!
+- `Optimising Feature Engineering Pipelines with Feature-engine <https://www.youtube.com/watch?v=qT-3KUaFYmk/>`_, Pydata Cambridge 2020, from minute 51:43.
 
 En Español
 ----------
 
@@ -3,4 +3,6 @@
 How To
 ======
 
-Coming Soon!
+Find `jupyter notebooks with examples <https://nbviewer.jupyter.org/github/solegalli/feature_engine/tree/master/examples/>`_
+of each transformer functionality. Within each folder, you will find a jupyter notebook
+showcasing the functionality of each transformer.
@@ -6,6 +6,11 @@ Feature Selection
 Feature-engine's feature selection transformers are used to drop subsets of variables.
 Or in other words to select subsets of variables.
 
+.. figure::  ../images/selectionSummary.png
+   :align:   center
+
+   Summary of Feature-engine's selectors main characteristics
+
 .. toctree::
    :maxdepth: 2
 
 
@@ -3,21 +3,41 @@ Version 1.0.0
 
 Deployed: TBD
 
-Contributors:
+Contributors
+------------
+    - Ashok Kumar
     - Christopher Samiullah
     - Nicolas Galli
     - Nodar Okroshiashvili
+    - Pradumna Suryawanshi
     - Sana Ben Driss
     - Tejash Shah
     - Tung Lee
     - Soledad Galli
 
 
 In this version, we made a major overhaul of the package, with code quality improvement
-throughout the code base, unification of attributes and methods when possible, addition
-of new transformers and extended documentation. Read below for more details.
+throughout the code base, unification of attributes and methods, addition of new
+transformers and extended documentation. Read below for more details.
 
-**Renaming of Modules within Feature-engine**:
+New transformers for Feature Selection
+--------------------------------------
+
+We included a whole new module with multiple transformers to select features.
+
+    - **DropConstantFeatures**: removes constant and quasi-constant features from a dataframe (**by Tejash Shah**)
+    - **DropDuplicateFeatures**: removes duplicated features from a dataset (**by Tejash Shah and Soledad Galli**)
+    - **DropCorrelatedFeatures**: removes features that are correlated (**by Nicolas Galli**)
+    - **SmartCorrelationSelection**: selects feature from group of correlated features based on certain criteria (**by Soledad Galli**)
+    - **ShuffleFeaturesSelector**: selects features by drop in machine learning model performance after feature's values are randomly shuffled (**by Sana Ben Driss**)
+    - **SelectBySingleFeaturePerformance**: selects features based on a ML model performance trained on individual features (**by Nicolas Galli**)
+    - **SelectByTargetMeanPerformance**: selects features encoding the categories or intervals with the target mean and using that as proxy for performance (**by Tung Lee and Soledad Galli**)
+    - **RecursiveFeatureElimination**: selects features recursively, evaluating the drop in ML performance, from the least to the most important feature (**by Sana Ben Driss**)
+    - **RecursiveFeatureAddition**: selects features recursively, evaluating the increase in ML performance, from the most to the least important feature (**by Sana Ben Driss**)
+
+
+Renaming of Modules
+-------------------
 
 Feature-engine transformers have been sorted into submodules to smooth the development
 of the package and shorten import syntax for users.
@@ -30,50 +50,57 @@ of the package and shorten import syntax for users.
     - **Module selection**: new module hosts transformers to select or remove variables from a dataset.
     - **Module creation**: new module hosts transformers that combine variables into new features using mathematical or other operations.
 
-**Renaming of Classes**:
+Renaming of Classes
+-------------------
 
-In this release, we have shortened the name of categorical encoders, and also renamed
-other classes of Feature-engine to simplify import syntax.
+We shortened the name of categorical encoders, and also renamed other classes to
+simplify import syntax.
 
     - **Encoders**: the word ``Categorical`` was removed from the classes name. Now, instead of ``MeanCategoricalEncoder``, the class is called ``MeanEncoder``. Instead of ``RareLabelCategoricalEncoder`` it is ``RareLabelEncoder`` and so on. Please check the encoders documentation for more details.
     - **Imputers**: the ``CategoricalVariableImputer`` is now called ``CategoricalImputer``.
     - **Discretisers**: the ``UserInputDiscretiser`` is now called ``ArbitraryDiscretiser``.
     - **Creation**: the ``MathematicalCombinator`` is not called ``MathematicalCombination``.
     - **WoEEncoder and PRatioEncoder**: the ``WoEEncoder`` now applies only encoding with the weight of evidence. To apply encoding by probability ratios, use a different transformer: the ``PRatioEncoder`` (**by Nicolas Galli**).
 
-**Renaming of class init Parameters**:
+Renaming of Parameters
+----------------------
 
 We renamed a few parameters to unify the nomenclature across the Package.
 
     - **EndTailImputer**: the parameter ``distribution`` is now called ``imputation_method`` to unify convention among imputers. To impute using the IQR, we now need to pass ``imputation_method="iqr"`` instead of ``imputation_method="skewed"``.
     - **AddMissingIndicator**: the parameter ``missing_only`` now takes the boolean values ``True`` or ``False``.
     - **Winzoriser and OutlierTrimmer**: the parameter ``distribution`` is now called ``capping_method`` to unify names across Feature-engine transformers.
 
-**New transformers and classes**:
 
-We included a whole new module with multiple transformers to select features.
+Tutorials
+---------
 
-    - **DropConstantFeatures**: finds and removes constant and quasi-constant features from a dataframe (**by Tejash Shah**)
-    - **DropDuplicateFeatures**: finds and removes duplicated features from a dataset (**by Tejash Shah and Soledad Galli**)
-    - **DropCorrelatedFeatures**: finds and removes features that are correlated (**by Nicolas Galli**)
-    - **ShuffleFeaturesSelector**: selects features by determining the drop in machine learning model performance when each feature's values are randomly shuffled from a dataframe (**by Sana Ben Driss**)
-    - **SelectBySingleFeaturePerformance**: trains a model based of each individual features, and derives performance (**by Nicolas Galli**)
-    - **SelectByTargetMeanPerformance**: selects features encoding the categories with the target mean and using that as proxy for performance (**by Tung Lee and Soledad Galli**)
-    - **RecursiveFeatureElimination**: selects features recursively, evaluating the drop in ML performance, from the least to the important feature (**by Sana Ben Driss**)
-    - **RecursiveFeatureAddition**: selects features recursively, evaluating the increase in ML performance, after adding a new feature, starting from the most to the least important feature (**by Sana Ben Driss**)
+    - **Imputation**: updated "how to" examples of missing data imputation (**by Pradumna Suryawanshi**)
+    - **Encoders**: new and updated "how to" examples of categorical encoding (**by Ashok Kumar**)
+    - **Discretisation**: new and updated "how to" examples of discretisation (**by Ashok Kumar**)
+
+
+For Contributors and Developers
+-------------------------------
+
+Code Architecture
+~~~~~~~~~~~~~~~~~
 
-**Code Architecture - Important for Contributors and Developers**:
     - **Submodules**: transformers have been grouped within relevant submodules and modules.
     - **Individual tests**: testing classes have been subdivided into individual tests
     - **Code Style**: we adopted the use of flake8 for linting and PEP8 style checks, and black for automatic re-styling of code.
-    - **Type hint**: we rolled out the use of type hint throughout Feature-engine classes and functions (**by Nodar Okroshiashvili, Soledad Galli and Chris Samiullah**)
+    - **Type hint**: we rolled out the use of type hint throughout classes and functions (**by Nodar Okroshiashvili, Soledad Galli and Chris Samiullah**)
+
+Documentation
+~~~~~~~~~~~~~
 
-**Documentation**
     - Switched fully to numpydoc and away from Napoleon
     - Included more detail about methods, parameters, returns and raises, as per numpydoc docstring style (**by Nodar Okroshiashvili, Soledad Galli**)
     - Linked documentation to github repository
     - Improved layout
 
-**Other Changes**:
+Other Changes
+-------------
+
     - **Updated documentation**: documentation reflects the current use of Feature-engine transformers
     - **Typo fixes**: Thank you to all who contributed to typo fixes (Tim Vink, Github user @piecot)