You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: AI-and-Analytics/Getting-Started-Samples/IntelModin_Vs_Pandas/README.md
+23-17Lines changed: 23 additions & 17 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,16 +1,16 @@
1
-
# `Intel® Modin* Vs. Pandas Performance` Sample
1
+
# Modin Vs. Pandas Performance Sample
2
2
3
-
The `Intel® Modin* Vs. Pandas Performance` code illustrates how to use Modin* to replace the Pandas API. The sample compares the performance of Intel® Distribution of Modin* and the performance of Pandas for specific dataframe operations.
3
+
The `Modin Vs. Pandas Performance` code illustrates how to use Modin* to replace the Pandas API. The sample compares the performance of Modin and the performance of Pandas for specific dataframe operations.
4
4
5
5
| Area | Description
6
6
|:--- |:---
7
-
| What you will learn | How to accelerate the Pandas API using Intel® Distribution of Modin*.
8
-
| Time to complete | Less than 10 minutes
9
7
| Category | Concepts and Functionality
8
+
| What you will learn | How to accelerate the Pandas API using Modin.
9
+
| Time to complete | Less than 10 minutes
10
10
11
11
## Purpose
12
12
13
-
Intel® Distribution of Modin* accelerates Pandas operations using Ray or Dask execution engine. The distribution provides compatibility and integration with the existing Pandas code. The sample code demonstrates how to perform some basic dataframe operations using Pandas and Intel® Distribution of Modin*. You will be able to compare the performance difference between the two methods.
13
+
Modin accelerates Pandas operations using Ray or Dask execution engine. The distribution provides compatibility and integration with the existing Pandas code. The sample code demonstrates how to perform some basic dataframe operations using Pandas and Modin. You will be able to compare the performance difference between the two methods.
14
14
You can run the sample locally or in Google Colaboratory (Colab).
15
15
16
16
## Prerequisites
@@ -25,25 +25,25 @@ You can run the sample locally or in Google Colaboratory (Colab).
25
25
26
26
This code sample is implemented for CPU using Python programming language. The sample requires NumPy, Pandas, Modin libraries, and the time module in Python.
27
27
28
-
## Run the `Intel® Modin Vs Pandas Performance` Sample Locally
28
+
## Environment Setup
29
29
30
-
If you want to run the sample on a local system using a command-line interface (CLI), you must install the Intel® Distribution of Modin* in a new Conda* environment first.
30
+
If you want to run the sample on a local system using a command-line interface (CLI), you must install the Modin in a new Conda* environment first.
31
31
32
-
### Install the Intel® Distribution of Modin*
32
+
### Install Modin
33
33
34
34
1. Create a Conda environment.
35
35
```
36
-
conda create --name aikit-modin
36
+
conda create --name modin
37
37
```
38
38
2. Activate the Conda environment.
39
39
```
40
-
source activate aikit-modin
40
+
source activate modin
41
41
```
42
42
3. Remove existing versions of Modin* (if any exist).
43
43
```
44
44
conda remove modin --y
45
45
```
46
-
4. Install Intel® Distribution of Modin* (v0.12.1 or newer).
46
+
4. Install Modin (v0.12.1 or newer).
47
47
```
48
48
pip install modin[all]==0.12.1
49
49
```
@@ -58,24 +58,24 @@ If you want to run the sample on a local system using a command-line interface (
58
58
```
59
59
### Run the Sample
60
60
61
-
1. Change to the directory containing the `IntelModin_Vs_Pandas.ipynb` notebook file on your local system.
61
+
1. Change to the directory containing the `Modin_Vs_Pandas.ipynb` notebook file on your local system.
62
62
63
63
2. Run the sample notebook.
64
64
```
65
-
ipython IntelModin_Vs_Pandas.ipynb
65
+
ipython Modin_Vs_Pandas.ipynb
66
66
```
67
67
68
-
## Run the `Intel® Modin Vs Pandas Performance` Sample in Google Colaboratory
68
+
## Run the `Modin Vs Pandas Performance` Sample in Google Colaboratory
69
69
70
-
1. Change to the directory containing the `IntelModin_Vs_Pandas.ipynb` notebook file on your local system.
70
+
1. Change to the directory containing the `Modin_Vs_Pandas.ipynb` notebook file on your local system.
71
71
72
72
2. Open the notebook file, and remove the prepended number sign (#) symbol from the following lines:
73
73
```
74
74
#!pip install modin[all]==0.12.1
75
75
#!pip install numpy
76
76
#!pip install pandas
77
77
```
78
-
These changes will install the Intel® Distribution of Modin* and the NumPy and Pandas libraries when run in the Colab notebook.
78
+
These changes will install the Modin and the NumPy and Pandas libraries when run in the Colab notebook.
79
79
80
80
3. Save your changes.
81
81
@@ -100,11 +100,17 @@ CPU times: user 8.47 s, sys: 132 ms, total: 8.6 s
100
100
Wall time: 8.57 s
101
101
```
102
102
103
-
Example expected cell output is included in `IntelModin_Vs_Pandas.ipynb`.
103
+
Example expected cell output is included in `Modin_Vs_Pandas.ipynb`.
104
+
105
+
## Related Samples
106
+
107
+
*[Modin Get Started Sample](https://github.com/oneapi-src/oneAPI-samples/tree/master/AI-and-Analytics/Getting-Started-Samples/Modin_GettingStarted)
104
108
105
109
## License
106
110
107
111
Code samples are licensed under the MIT license. See
108
112
[License.txt](https://github.com/oneapi-src/oneAPI-samples/blob/master/License.txt) for details.
109
113
110
114
Third party program licenses are at [third-party-programs.txt](https://github.com/oneapi-src/oneAPI-samples/blob/master/third-party-programs.txt).
115
+
116
+
*Other names and brands may be claimed as the property of others. [Trademarks](https://www.intel.com/content/www/us/en/legal/trademarks.html)
0 commit comments