Skip to content

Commit 6310072

Browse files
committed
Bumping version to 0.0b16.
1 parent 1552122 commit 6310072

File tree

4 files changed

+24
-29
lines changed

4 files changed

+24
-29
lines changed

README.md

Lines changed: 20 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -1,38 +1,37 @@
1-
# AWS Data Wrangler (BETA)
1+
# AWS Data Wrangler (beta)
22

3-
[![Code style: black](https://img.shields.io/badge/code%20style-black-000000.svg)](https://github.com/ambv/black)
4-
5-
> Utilities for Pandas and Apache Spark on AWS
6-
7-
AWS Data Wrangler aims to fill a gap between AWS Analytics Services (Glue, Athena, EMR, Redshift, S3) and the most popular Python data libraries ([Pandas](https://pandas.pydata.org/), [Apache Spark](https://spark.apache.org/)).
3+
> Utility belt to handle data on AWS.
84
95
---
106

11-
*Contents:* **[Use Cases](#Use-Cases)** | **[Installation](#Installation)** | **[Examples](#Examples)** | **[License](#License)**
7+
*Contents:* **[Use Cases](#Use-Cases)** | **[Installation](#Installation)** | **[Examples](#Examples)**
128

139
---
1410

1511
## Use Cases
1612

17-
* Pandas Dataframe -> Parquet (S3)
18-
* Pandas Dataframe -> CSV (S3)
19-
* Pandas Dataframe -> Glue Catalog
20-
* Pandas Dataframe -> Redshift
21-
* Pandas Dataframe -> Athena
22-
* CSV (S3) -> Pandas Dataframe
23-
* Athena -> Pandas Dataframe
24-
* Spark Dataframe -> Redshift
13+
* Pandas -> Parquet (S3)
14+
* Pandas -> CSV (S3)
15+
* Pandas -> Glue Catalog
16+
* Pandas -> Athena
17+
* Pandas -> Redshift
18+
* CSV (S3) -> Pandas
19+
* Athena -> Pandas
20+
* PySpark -> Redshift
2521

2622
## Installation
2723

2824
`pip install awswrangler`
2925

30-
AWS Data Wrangler runs only Python 3.6 and beyond.
31-
And runs on AWS Lambda, AWS Glue, EC2, on-premises, local, etc.
26+
Runs only with Python 3.6 and beyond.
27+
28+
Runs anywhere (AWS Lambda, AWS Glue, EMR, EC2, on-premises, local, etc).
29+
30+
*P.S.* Lambda Layer bundle and Glue egg are available to [download](https://github.com/awslabs/aws-data-wrangler/releases). It's just upload to your account and run! :rocket:
3231

3332
## Examples
3433

35-
### Writing Pandas Dataframe to Data Lake
34+
### Writing Pandas Dataframe to S3 + Glue Catalog
3635

3736
```py3
3837
session = awswrangler.Session()
@@ -46,7 +45,7 @@ session.pandas.to_parquet(
4645

4746
If a Glue Database name is passed, all the metadata will be created in the Glue Catalog. If not, only the s3 data write will be done.
4847

49-
### Reading from Data Lake to Pandas Dataframe
48+
### Reading from AWS Athena to Pandas
5049

5150
```py3
5251
session = awswrangler.Session()
@@ -56,7 +55,7 @@ dataframe = session.pandas.read_sql_athena(
5655
)
5756
```
5857

59-
### Reading from S3 file to Pandas Dataframe
58+
### Reading from S3 (CSV) to Pandas
6059

6160
```py3
6261
session = awswrangler.Session()
@@ -82,7 +81,7 @@ session.pandas.to_parquet( # Storing the data and metadata to Data Lake
8281
)
8382
```
8483

85-
### Loading Spark Dataframe to Redshift
84+
### Loading Pyspark Dataframe to Redshift
8685

8786
```py3
8887
session = awswrangler.Session(spark_session=spark)
@@ -96,7 +95,3 @@ session.spark.to_redshift(
9695
mode="append",
9796
)
9897
```
99-
100-
## License
101-
102-
This library is licensed under the Apache 2.0 License.

awswrangler/__version__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
11
__title__ = "awswrangler"
2-
__description__ = "Utilities for Pandas and Apache Spark on AWS."
2+
__description__ = "Utility belt to handle data on AWS."
33
__version__ = "0.0b16"
44
__license__ = "Apache License 2.0"

requirements-dev.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
yapf>=0.27.0
22
flake8>=3.7.7
33
pytest>=4.3.1
4-
twine>=1.13.0
54
cfn-lint>=0.22.0
5+
twine>=1.13.0
66
pyspark>=2.4.3

testing/run-tests.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,9 +2,9 @@
22

33
cd ..
44
rm -rf *.pytest_cache
5-
yapf --in-place --recursive setup.py awswrangler testing/test_awswrangler
5+
yapf --in-place --recursive setup.py awswrangler testing/test_awswrangler
66
flake8 setup.py awswrangler testing/test_awswrangler
77
pip install -e .
88
pytest testing/test_awswrangler awswrangler
99
rm -rf *.pytest_cache
10-
cd tests
10+
cd testing

0 commit comments

Comments
 (0)