Take 2nd set of TMO comments

Vianney Taquet · Vianney Taquet · commit 5c0c536ac8ae · 2022-06-20T18:15:38.000+02:00
diff --git a/notebooks/regression/exoplanets.ipynb b/notebooks/regression/exoplanets.ipynb
@@ -83,7 +83,7 @@
    "id": "961728c7-fe0b-4a47-9744-43518a88c5d2",
    "metadata": {},
    "source": [
-    "Let's start by loading the `exoplanets` dataset and seeing the main information."
+    "Let's start by loading the `exoplanets` dataset and looking at the main information."
    ]
   },
   {
@@ -148,7 +148,7 @@
    "id": "33dfa63b-eb4b-45ac-9956-afac4906e3e7",
    "metadata": {},
    "source": [
-    "The dataset contains 21 variables giving complementary information about the properties of the discovered planet, the star around which the planet revolves, together with the type of discovery method. 7 features are categorical, and 14 are continuous."
+    "The dataset contains 21 features giving complementary information about the properties of the discovered planet, the star around which the planet revolves, together with the type of discovery method. 7 features are categorical, and 14 are continuous."
    ]
   },
   {
@@ -565,7 +565,7 @@
    "id": "9d03ae07-9775-4c15-a7b6-e466ea16ab9e",
    "metadata": {},
    "source": [
-    "In this section, we perform a simple preprocessing of the dataset in order to impute the missing values and encode the categorical variables."
+    "In this section, we perform a simple preprocessing of the dataset in order to impute the missing values and encode the categorical features."
    ]
   },
   {
diff --git a/notebooks/regression/exoplanets.md b/notebooks/regression/exoplanets.md
@@ -59,7 +59,7 @@ warnings.filterwarnings("ignore")
 ## 1. Data Loading
 
 
-Let's start by loading the `exoplanets` dataset and seeing the main information.
+Let's start by loading the `exoplanets` dataset and looking at the main information.
 
 ```python
 url_file = "https://raw.githubusercontent.com/scikit-learn-contrib/MAPIE/master/notebooks/regression/exoplanets_mass.csv"
@@ -70,7 +70,7 @@ exo_df = pd.read_csv(url_file, index_col=0)
 exo_df.info()
 ```
 
-The dataset contains 21 variables giving complementary information about the properties of the discovered planet, the star around which the planet revolves, together with the type of discovery method. 7 features are categorical, and 14 are continuous.
+The dataset contains 21 features giving complementary information about the properties of the discovered planet, the star around which the planet revolves, together with the type of discovery method. 7 features are categorical, and 14 are continuous.
 
 
 Some properties show high variance among exoplanets and stars due to the astronomical nature of such systems. We therefore decide to use a log transformation for the following features to approach a normal distribution.
@@ -130,7 +130,7 @@ sns.pairplot(exo_df[star_cols])
 ## 3. Data preprocessing
 
 
-In this section, we perform a simple preprocessing of the dataset in order to impute the missing values and encode the categorical variables.
+In this section, we perform a simple preprocessing of the dataset in order to impute the missing values and encode the categorical features.
 
 ```python
 endos = list(set(exo_df.columns) - set([target]))

Original file line number	Diff line number	Diff line change
`@@ -83,7 +83,7 @@`
`83`	`83`	`"id": "961728c7-fe0b-4a47-9744-43518a88c5d2",`
`84`	`84`	`"metadata": {},`
`85`	`85`	`"source": [`
`86`		- "Let's start by loading the `exoplanets` dataset and seeing the main information."
	`86`	+ "Let's start by loading the `exoplanets` dataset and looking at the main information."
`87`	`87`	`]`
`88`	`88`	`},`
`89`	`89`	`{`
`@@ -148,7 +148,7 @@`
`148`	`148`	`"id": "33dfa63b-eb4b-45ac-9956-afac4906e3e7",`
`149`	`149`	`"metadata": {},`
`150`	`150`	`"source": [`
`151`		`- "The dataset contains 21 variables giving complementary information about the properties of the discovered planet, the star around which the planet revolves, together with the type of discovery method. 7 features are categorical, and 14 are continuous."`
	`151`	`+ "The dataset contains 21 features giving complementary information about the properties of the discovered planet, the star around which the planet revolves, together with the type of discovery method. 7 features are categorical, and 14 are continuous."`
`152`	`152`	`]`
`153`	`153`	`},`
`154`	`154`	`{`
`@@ -565,7 +565,7 @@`
`565`	`565`	`"id": "9d03ae07-9775-4c15-a7b6-e466ea16ab9e",`
`566`	`566`	`"metadata": {},`
`567`	`567`	`"source": [`
`568`		`- "In this section, we perform a simple preprocessing of the dataset in order to impute the missing values and encode the categorical variables."`
	`568`	`+ "In this section, we perform a simple preprocessing of the dataset in order to impute the missing values and encode the categorical features."`
`569`	`569`	`]`
`570`	`570`	`},`
`571`	`571`	`{`