33 {
44 "cell_type" : " markdown" ,
55 "source" : [
6- " ## Introduction to Probability and Statistics\r\ n " ,
7- " ## Assignment\r\ n " ,
8- " \r\ n " ,
6+ " ## Introduction to Probability and Statistics\n " ,
7+ " ## Assignment\n " ,
8+ " \n " ,
99 " In this assignment, we will use the dataset of diabetes patients taken [from here](https://www4.stat.ncsu.edu/~boos/var.select/diabetes.html)."
1010 ],
1111 "metadata" : {}
1414 "cell_type" : " code" ,
1515 "execution_count" : 13 ,
1616 "source" : [
17- " import pandas as pd\r\ n " ,
18- " import numpy as np\r\ n " ,
19- " \r\ n " ,
20- " df = pd.read_csv(\" ../../data/diabetes.tsv\" ,sep='\\ t')\r\ n " ,
17+ " import pandas as pd\n " ,
18+ " import numpy as np\n " ,
19+ " \n " ,
20+ " df = pd.read_csv(\" ../../data/diabetes.tsv\" ,sep='\\ t')\n " ,
2121 " df.head()"
2222 ],
2323 "outputs" : [
149149 {
150150 "cell_type" : " markdown" ,
151151 "source" : [
152- " \r\ n " ,
153- " In this dataset, columns as the following:\r\ n " ,
154- " * Age and sex are self-explanatory\r\ n " ,
155- " * BMI is body mass index\r\ n " ,
156- " * BP is average blood pressure\r\ n " ,
157- " * S1 through S6 are different blood measurements\r\ n " ,
158- " * Y is the qualitative measure of disease progression over one year\r\ n " ,
159- " \r\ n " ,
160- " Let's study this dataset using methods of probability and statistics.\r\ n " ,
161- " \r\ n " ,
152+ " \n " ,
153+ " In this dataset, columns as the following:\n " ,
154+ " * Age and sex are self-explanatory\n " ,
155+ " * BMI is body mass index\n " ,
156+ " * BP is average blood pressure\n " ,
157+ " * S1 through S6 are different blood measurements\n " ,
158+ " * Y is the qualitative measure of disease progression over one year\n " ,
159+ " \n " ,
160+ " Let's study this dataset using methods of probability and statistics.\n " ,
161+ " \n " ,
162162 " ### Task 1: Compute mean values and variance for all values"
163163 ],
164164 "metadata" : {}
201201 {
202202 "cell_type" : " markdown" ,
203203 "source" : [
204- " ### Task 4: Test the correlation between different variables and disease progression (Y)\r\ n " ,
205- " \r\ n " ,
204+ " ### Task 4: Test the correlation between different variables and disease progression (Y)\n " ,
205+ " \n " ,
206206 " > **Hint** Correlation matrix would give you the most useful information on which values are dependent."
207207 ],
208208 "metadata" : {}
249249 },
250250 "nbformat" : 4 ,
251251 "nbformat_minor" : 2
252- }
252+ }
0 commit comments