Skip to content

Commit 055eb03

Browse files
Proof reading: python for data analysis scikitlearn
1 parent d476c86 commit 055eb03

File tree

1 file changed

+26
-8
lines changed

1 file changed

+26
-8
lines changed

individual_modules/python_for_data_analysis/Python_ScikitLearn.ipynb

Lines changed: 26 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@
1717
"- Understand the basic concepts and purpose of Scikit-Learn in machine learning tasks\n",
1818
"- Implement linear regression, k-means clustering, and decision tree models using Scikit-Learn\n",
1919
"- Understand the applications and limitations of each model\n",
20-
"- Learn how to clean and preprocess data using pandas before feeding it into machine-learning models\n",
20+
"- Learn how to clean and preprocess data using Pandas before feeding it into machine-learning models\n",
2121
"- Understand the importance of handling missing values and feature engineering\n",
2222
"- Learn how to evaluate the performance of different machine-learning models\n",
2323
"- Interpret the output of models to make informed decisions based on the data \n"
@@ -57,13 +57,19 @@
5757
{
5858
"cell_type": "markdown",
5959
"id": "601a8215-a645-4fe6-8334-1dc7434a589b",
60-
"metadata": {},
60+
"metadata": {
61+
"editable": true,
62+
"slideshow": {
63+
"slide_type": ""
64+
},
65+
"tags": []
66+
},
6167
"source": [
6268
"## What is Scikit-Learn?\n",
6369
"\n",
6470
"Scikit-Learn is a popular Python package that provides a set of algorithms and tools for machine learning that are both easy to use and effective. The package includes support for various tasks, including classification, regression, clustering, dimensionality reduction and model selection and normalization.\n",
6571
"\n",
66-
"In scikit-learn the models that are avaliable include a massive number of possible arguements, and so for the purpose of this course the default arguements have been used. \n",
72+
"In Scikit-Learn, the models that are available include a massive number of possible arguments, and so for the purpose of this course, the default arguments have been used. \n",
6773
"\n",
6874
"\n"
6975
]
@@ -1294,10 +1300,16 @@
12941300
{
12951301
"cell_type": "markdown",
12961302
"id": "06b1be83-6562-4c23-8313-9677f0814b8e",
1297-
"metadata": {},
1303+
"metadata": {
1304+
"editable": true,
1305+
"slideshow": {
1306+
"slide_type": ""
1307+
},
1308+
"tags": []
1309+
},
12981310
"source": [
12991311
"## K-Means Clustering \n",
1300-
"K-means clustering is a method for parritioning data into a specified number of clusters by minimizing the variance within each cluster."
1312+
"K-means clustering is a method for partitioning data into a specified number of clusters by minimizing the variance within each cluster."
13011313
]
13021314
},
13031315
{
@@ -2373,10 +2385,16 @@
23732385
{
23742386
"cell_type": "markdown",
23752387
"id": "ff85e826-ccea-439b-ac60-555059067ed9",
2376-
"metadata": {},
2388+
"metadata": {
2389+
"editable": true,
2390+
"slideshow": {
2391+
"slide_type": ""
2392+
},
2393+
"tags": []
2394+
},
23772395
"source": [
23782396
"## Decision Tree Models\n",
2379-
"Decision Trees are a model framework that splits data into branches based on the some input data, in out case wind speed. The model resembles a tree structure with decision points and leaf nodes where each decision leads to further splits, eventually ending on a final prediction based on the input features."
2397+
"Decision Trees are a model framework that splits data into branches based on the some input data - in our case wind speed. The model resembles a tree structure with decision points and leaf nodes where each decision leads to further splits, eventually ending on a final prediction based on the input features."
23802398
]
23812399
},
23822400
{
@@ -3429,7 +3447,7 @@
34293447
},
34303448
"source": [
34313449
"## Further Models\n",
3432-
"This section of the course has only introducded a very small subset of models that are avaliable, alongside not diving into the different arguements that are avaliable. If you are interested in learning more about the models themselves, and key practical aspects of deploying and ensuring the validity of models you develop then please attend this couses next course \"Introduction to Machine Learning\""
3450+
"This section of the course has only introduced a very small subset of models that are avaliable, alongside not diving into the different arguments that are available. If you are interested in learning more about the models themselves, and key practical aspects of deploying and ensuring the validity of models you develop, then please attend e course \"Introduction to Machine Learning\"."
34333451
]
34343452
},
34353453
{

0 commit comments

Comments
 (0)