You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: projects/analyze-us-census-data-with-scipy/analyze-us-census-data-with-scipy.mdx
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -128,7 +128,7 @@ When conducting an exploratory analysis, we first want to make sure that our dat
128
128
129
129
Generally speaking, most data science models abide by what we call parametric assumptions, which refer to normal distribution of a fixed set of parameters. In our particular case, those parameters include, but are not limited to, the columns we listed above. The three parametric assumptions are independence, normality, and homogeneity of variances.
130
130
131
-
Additionally, traditional A/B testing typically utilizes one of two methods: either a chi-squared (which looks for dependence between two categorical variables) or a t-test (which looks for a statistically significant difference between the averages of two groups) to validate what we refer to as the null hypothesis (which is the assumption that there is no relationship or comparison between two patterns of behavior).
131
+
Additionally, traditional **A/B testing** typically utilizes one of two methods: either a **chi-squared** (which looks for dependence between two categorical variables) or a **t-test** (which looks for a statistically significant difference between the averages of two groups) to validate what we refer to as the null hypothesis (which is the assumption that there is no relationship or comparison between two patterns of behavior).
132
132
133
133
For this tutorial, we'll be running t-tests.
134
134
@@ -163,8 +163,8 @@ v = ("/content/moved_between_states.csv")
163
163
control = pd.read_csv(c)
164
164
variant = pd.read_csv(v)
165
165
166
-
#control.head()
167
-
#variant.head()
166
+
#control.head()
167
+
#variant.head()
168
168
```
169
169
170
170
@@ -266,7 +266,7 @@ region["High School Graduate (or its Equivalency)"] = control.groupby("Region")[
0 commit comments