You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* replaces readthedocs by pydata doc template
* reorganizes files to tidy navigation tab
* adds packt attribution link
* edits startpage and quickstart
* major rearrangement of files to make docs work
* fixes wording, reorganizes more files
* update arbitrary number imputer
* update mean median imputer
* update end tail imputer
* update categorical imputer
* update random sample imputer
* update missing indicator
* update drop na imputer
* adds class link to user guide imputation
* update user guide and api creation
* updates index of creation modules
* updates arbitrary and tree discretizers
* update equal frequency disc
* updates equal width discretiser
* update count encoding
* update tree encoder
* update one hot encoder
* update ordinal encoder
* updates remaining encoders
* revises intro and updates tab names
* reviews and updates quickstart files
* reviews and updates imputation api docs
* add table undertitle
* adds '' to string param values
* reviews and updates encoding api docs
* adds markdown tables
* reviews and updates discretisation api
* review and update creation api
* reorganises selection files into user guide and api
* reorganises files into api and user guide for outliers
* reorganises files into user guide and api for transformation
* rewords return in inverse_transform method
* reorganises preprocessing into user gudie and api
* reorganises files into user guide and api for wrapper
* updates and reorganises the learning resources
* updates and improves about
* reviews and edits transformation and outlier api docs
* review and update wrappers and preprocessors api files
* reviews and edits selection api files
* final update about and learning resources
* updates user guide wrappers
* final edits preprocessors user guide
* reviews and edits imputation user guide
* reviews and edits user guide encoding
* final edit to encoding user guide index
* edits main transformations in user guide
* edits final transformation user guide files
* final edits to discretisation user guide files
* final review user guide outliers and creation
* changes capitalization of figure name
* review some of selection user guide files
* edits user guide correlation transformers
* updates more selection user guide files
* final edits to user guide selection
* reorganises contribute files
* update contribute index
* updates contribute code guidelines
* edits contribute code file
* final update contribute section
* reorganises whats new
* adds smoothing info to target encoder user guide
* re-inserts paths to doc config file
* add functionality to drop data based on percentage of NA in DropMissingData (#322)
* DropMissingData thresh_pct param
* adding thresh_pct & test_thresh_pct to DropMissingData
* renamed test to correct thresh
* add fit_transform to test_thresh_pct & pass tox
* update doc naming
* rename thresh_pct to drop_pct for clarity, added tests, rebase from pydata_template
* Refactoring to row_drop_pct, still under discussion.
* pass black, isort, & flake8
* tox.ini had incorrect indentation, resolving
* remove non-relevant files from PR
* update init docstrings
* adds final edits to logic plus todo list
* threshold tests with new logic & return_na_data
* changes wording in test comments
@kevinkurek I forgot that I could edit files directly from the PR
* updating comments, at least 1% must be available
* re word docstrings for param threshold
@kevinkurek could you double check that this wording is correct?
Co-authored-by: Soledad Galli <[email protected]>
* update readme
* add logo to readme
* final update readme
Co-authored-by: Kevin <[email protected]>
[](https://feature-engine.readthedocs.io/en/latest/index.html)
8
9
[](https://gitter.im/feature_engine/community)
*[Feature-engine: A new open-source Python package for feature engineering](https://trainindata.medium.com/feature-engine-a-new-open-source-python-package-for-feature-engineering-29a0ab88ea7c)
35
38
36
39
*[Practical Code Implementations of Feature Engineering for Machine Learning with Python](https://towardsdatascience.com/practical-code-implementations-of-feature-engineering-for-machine-learning-with-python-f13b953d4bcd)
*[Ingeniería de variables para machine learning, Curso Online](https://www.udemy.com/course/ingenieria-de-variables-para-machine-learning/?referralCode=CE398C784F17BD87482C)
46
45
47
46
*[Ingeniería de variables, MachinLenin, charla online](https://www.youtube.com/watch?v=NhCxOOoFXds)
48
47
49
-
More resources will be added as they appear online!
See more usage examples in the Jupyter Notebooks in the **example** folder of this repository, or in the [documentation](http://feature-engine.readthedocs.io).
181
+
Find more examples in our [Jupyter Notebook Gallery](https://nbviewer.org/github/feature-engine/feature-engine-examples/tree/main/)
182
+
or in the [documentation](http://feature-engine.readthedocs.io).
179
183
180
-
## Contributing
184
+
## Contribute
181
185
182
-
Details about how to contribute can be found in the [Contributing Page](https://feature-engine.readthedocs.io/en/latest/contribute/index.html)
186
+
Details about how to contribute can be found in the [Contribute Page](https://feature-engine.readthedocs.io/en/latest/contribute/index.html)
183
187
184
-
In short:
188
+
Briefly:
185
189
186
-
### Local Setup Steps
187
190
- Fork the repo
188
191
- Clone your fork into your local computer: ``git clone https://github.com/<YOURUSERNAME>/feature_engine.git``
189
-
-cd into the repo ``cd feature_engine``
190
-
- Install as a developer: ``pip install -e .``
191
-
- Create and activate a virtual environment with any tool of choice
192
-
- Install the dependencies as explained in the [Contributing Page](https://feature-engine.readthedocs.io/en/latest/contribute/index.html)
192
+
-navigate into the repo folder``cd feature_engine``
193
+
- Install Feature-engine as a developer: ``pip install -e .``
194
+
-Optional: Create and activate a virtual environment with any tool of choice
- Create a feature branch with a meaningful name for your feature: ``git checkout -b myfeaturebranch``
194
197
- Develop your feature, tests and documentation
195
198
- Make sure the tests pass
196
199
- Make a PR
197
200
198
201
Thank you!!
199
202
200
-
### Opening Pull Requests
201
-
PR's are welcome! Please make sure the CI tests pass on your branch.
202
-
203
-
### Tests
204
-
205
-
We prefer tox. In your environment:
206
-
207
-
- Run `pip install tox`
208
-
- cd into the root directory of the repo: ``cd feature_engine``
209
-
- Run `tox`
210
-
211
-
If the tests pass, the code is functional.
212
-
213
-
You can also run the tests in your environment (without tox). For guidelines on how to do so, check the [Contributing Page](https://feature-engine.readthedocs.io/en/latest/contribute/index.html).
214
-
215
203
216
204
### Documentation
217
205
218
206
Feature-engine documentation is built using [Sphinx](https://www.sphinx-doc.org) and is hosted on [Read the Docs](https://readthedocs.org/).
219
207
220
-
To build the documentation make sure you have the dependencies installed. From the root directory: ``pip install -r docs/requirements.txt``.
208
+
To build the documentation make sure you have the dependencies installed: from the root directory: ``pip install -r docs/requirements.txt``.
221
209
222
-
Now you can build the docs: ``sphinx-build -b html docs build``
210
+
Now you can build the docs using: ``sphinx-build -b html docs build``
223
211
224
212
225
213
## License
226
214
227
215
BSD 3-Clause
228
-
229
-
230
-
## References
231
-
232
-
Many of the engineering and encoding functionalities are inspired by this [series of articles from the 2009 KDD Competition](http://www.mtome.com/Publications/CiML/CiML-v3-book.pdf).
If you use Feature-engine in a scientific publication, you can cite the following paper:
51
54
Galli, S., (2021). `Feature-engine: A Python package for feature engineering for machine learning. <https://joss.theoj.org/papers/10.21105/joss.03642>`_
52
55
Journal of Open Source Software, 6(65), 3642.
@@ -81,19 +84,54 @@ Artwork
81
84
High quality PNG and SVG logos are available in the `docs/images/ <https://github.com/feature-engine/feature_engine/tree/main/docs/images/logo>`_
82
85
source directory of the repository.
83
86
84
-
.. figure:: images/logo/FeatureEngine.png
85
-
:align:center
87
+
.. figure:: ../images/logo/FeatureEngine.png
88
+
:width:200
89
+
:figclass:align-center
90
+
:align:center
86
91
87
92
88
93
Sponsors
89
94
--------
90
95
91
96
Feature-engine is a community driven project, however institutional and private grants
92
-
help to assure its sustainability.
97
+
help to assure its sustainability. The project would like to thank the following
98
+
sponsors:
99
+
100
+
|
101
+
|
102
+
103
+
.. raw:: html
104
+
105
+
</div>
106
+
</div>
107
+
108
+
........
109
+
110
+
.. raw:: html
111
+
112
+
<divclass="sk-sponsor-div">
113
+
<divclass="sk-sponsor-div-box">
114
+
115
+
Soledad Galli spends a big part of her time at `Train in Data <https://www.trainindata.com/>`_
116
+
maintaining the project.
117
+
118
+
.. raw:: html
119
+
120
+
</div>
121
+
122
+
<divclass="sk-sponsor-div-box">
123
+
124
+
.. image:: ../images/sponsors/trainindata.png
125
+
:width:150pt
126
+
:align:center
127
+
:target:https://www.trainindata.com/
128
+
129
+
.. raw:: html
93
130
94
-
The project would like to thank the following sponsors:
131
+
</div>
132
+
</div>
95
133
96
-
Soledad Galli spends a big part of her time at Train in Data maintaining the project.
0 commit comments