Skip to content

Commit 0ec4f2b

Browse files
committed
2 parents 914e527 + eef3f9a commit 0ec4f2b

File tree

2 files changed

+25
-19
lines changed

2 files changed

+25
-19
lines changed

README.md

Lines changed: 24 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22

33
[![CircleCI](https://circleci.com/gh/FeatureLabs/autonormalize.svg?style=shield&circle-token=b890443ca669d7e88d62ad2fd712f92951550c4a)](https://circleci.com/gh/FeatureLabs/autonormalize)
44

5-
AutoNormalize is a Python library for automated datatable normalization, intended for use with [Feature Tools](https://github.com/Featuretools/featuretools). AutoNormalize allows you to build an `EntitySet` from a single denormalized table and generate features for machine learning.
5+
AutoNormalize is a Python library for automated datatable normalization, intended for use with [Featuretools](https://github.com/Featuretools/featuretools). AutoNormalize allows you to build an `EntitySet` from a single denormalized table and generate features for machine learning.
66

77
Before AutoNormalize:
88

@@ -14,7 +14,7 @@ After AutoNormalize:
1414
<br />
1515
### Install
1616
```shell
17-
pip install autonormalize
17+
pip install featuretools[autonormalize]
1818
```
1919
### Uninstall
2020
```shell
@@ -23,40 +23,46 @@ pip uninstall autonormalize
2323
<br />
2424

2525
### API Reference
26+
27+
#### `auto_entityset`
2628
```shell
2729
auto_entityset(df, accuracy=0.98, index=None, name=None, time_index=None)
2830
```
2931
Creates a normalized entityset from a dataframe.
3032

31-
Arguments:
33+
**Arguments:**
3234

33-
`df` (pd.Dataframe) : the dataframe containing data
35+
* `df` (pd.Dataframe) : the dataframe containing data
3436

35-
`accuracy` (0 < float <= 1.00; default = 0.98) : the accuracy threshold required in order to conclude a dependency (i.e. with accuracy = 0.98, 0.98 of the rows must hold true the dependency LHS --> RHS)
37+
* `accuracy` (0 < float <= 1.00; default = 0.98) : the accuracy threshold required in order to conclude a dependency (i.e. with accuracy = 0.98, 0.98 of the rows must hold true the dependency LHS --> RHS)
3638

37-
`index` (str, optional) : name of column that is intended index of df
39+
* `index` (str, optional) : name of column that is intended index of df
3840

39-
`name` (str, optional) : the name of created EntitySet
41+
* `name` (str, optional) : the name of created EntitySet
4042

41-
`time_index` (str, optional) : name of time column in the dataframe.
43+
* `time_index` (str, optional) : name of time column in the dataframe.
4244

43-
Returns:
45+
**Returns:**
4446

45-
`entityset` (ft.EntitySet) : created entity set
47+
* `entityset` (ft.EntitySet) : created entity set
4648

4749
<br />
4850

51+
#### `find_dependencies`
52+
4953
```shell
5054
find_dependencies(df, accuracy=0.98, index=None)
5155
```
5256
Finds dependencies within dataframe with the DFD search algorithm.
5357

54-
Returns:
58+
**Returns:**
5559

56-
`dependencies` (Dependencies) : the dependencies found in the data within the contraints provided
60+
* `dependencies` (Dependencies) : the dependencies found in the data within the contraints provided
5761

5862
<br />
5963

64+
#### `normalize_dataframe`
65+
6066
```shell
6167
normalize_dataframe(df, dependencies)
6268
```
@@ -65,20 +71,20 @@ Normalizes dataframe based on the dependencies given. Keys for the newly created
6571
2) has "id" in some form in the name of an attribute
6672
3) has attribute furthest to left in the table
6773

68-
Returns:x
69-
70-
`new_dfs` (list[pd.DataFrame]) : list of new dataframes
74+
**Returns:**
75+
* `new_dfs` (list[pd.DataFrame]) : list of new dataframes
7176

7277
<br />
7378

79+
#### `make_entityset`
80+
7481
```shell
7582
make_entityset(df, dependencies, name=None, time_index=None):
7683
```
7784
Creates a normalized EntitySet from dataframe based on the dependencies given. Keys are chosen in the same fashion as for `normalize_dataframe`and a new index will be created if any key has more than a single attribute.
7885

79-
Returns:
80-
81-
`entityset` (ft.EntitySet) : created EntitySet
86+
**Returns:**
87+
* `entityset` (ft.EntitySet) : created EntitySet
8288

8389
<br />
8490

setup.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@
1010
name='autonormalize',
1111
version='0.1.2',
1212
description='a library for automated table normalization',
13-
url='http://featuretools.com',
13+
url='https://github.com/FeatureLabs/autonormalize',
1414
license='BSD 3-clause',
1515
author='Feature Labs, Inc.',
1616
author_email='[email protected]',

0 commit comments

Comments
 (0)