Skip to content

Commit 5c0c536

Browse files
author
Vianney Taquet
committed
Take 2nd set of TMO comments
1 parent 947ce69 commit 5c0c536

File tree

2 files changed

+6
-6
lines changed

2 files changed

+6
-6
lines changed

notebooks/regression/exoplanets.ipynb

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -83,7 +83,7 @@
8383
"id": "961728c7-fe0b-4a47-9744-43518a88c5d2",
8484
"metadata": {},
8585
"source": [
86-
"Let's start by loading the `exoplanets` dataset and seeing the main information."
86+
"Let's start by loading the `exoplanets` dataset and looking at the main information."
8787
]
8888
},
8989
{
@@ -148,7 +148,7 @@
148148
"id": "33dfa63b-eb4b-45ac-9956-afac4906e3e7",
149149
"metadata": {},
150150
"source": [
151-
"The dataset contains 21 variables giving complementary information about the properties of the discovered planet, the star around which the planet revolves, together with the type of discovery method. 7 features are categorical, and 14 are continuous."
151+
"The dataset contains 21 features giving complementary information about the properties of the discovered planet, the star around which the planet revolves, together with the type of discovery method. 7 features are categorical, and 14 are continuous."
152152
]
153153
},
154154
{
@@ -565,7 +565,7 @@
565565
"id": "9d03ae07-9775-4c15-a7b6-e466ea16ab9e",
566566
"metadata": {},
567567
"source": [
568-
"In this section, we perform a simple preprocessing of the dataset in order to impute the missing values and encode the categorical variables."
568+
"In this section, we perform a simple preprocessing of the dataset in order to impute the missing values and encode the categorical features."
569569
]
570570
},
571571
{

notebooks/regression/exoplanets.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -59,7 +59,7 @@ warnings.filterwarnings("ignore")
5959
## 1. Data Loading
6060

6161

62-
Let's start by loading the `exoplanets` dataset and seeing the main information.
62+
Let's start by loading the `exoplanets` dataset and looking at the main information.
6363

6464
```python
6565
url_file = "https://raw.githubusercontent.com/scikit-learn-contrib/MAPIE/master/notebooks/regression/exoplanets_mass.csv"
@@ -70,7 +70,7 @@ exo_df = pd.read_csv(url_file, index_col=0)
7070
exo_df.info()
7171
```
7272

73-
The dataset contains 21 variables giving complementary information about the properties of the discovered planet, the star around which the planet revolves, together with the type of discovery method. 7 features are categorical, and 14 are continuous.
73+
The dataset contains 21 features giving complementary information about the properties of the discovered planet, the star around which the planet revolves, together with the type of discovery method. 7 features are categorical, and 14 are continuous.
7474

7575

7676
Some properties show high variance among exoplanets and stars due to the astronomical nature of such systems. We therefore decide to use a log transformation for the following features to approach a normal distribution.
@@ -130,7 +130,7 @@ sns.pairplot(exo_df[star_cols])
130130
## 3. Data preprocessing
131131

132132

133-
In this section, we perform a simple preprocessing of the dataset in order to impute the missing values and encode the categorical variables.
133+
In this section, we perform a simple preprocessing of the dataset in order to impute the missing values and encode the categorical features.
134134

135135
```python
136136
endos = list(set(exo_df.columns) - set([target]))

0 commit comments

Comments
 (0)