You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/mlpaths/A1_Intro_to_DataScience_and_ML.md
+21-11Lines changed: 21 additions & 11 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -180,6 +180,7 @@ This quiz helps you self-assess your understanding. Answers can be verified by r
180
180
d) The development of software that can reason and solve complex problems like humans.<br>
181
181
*(Answer: (1) :eyes:)*
182
182
{ .annotate }
183
+
183
184
1. Answer is: <b>b</b>
184
185
185
186
2. Consider the following Python code using Pandas:
@@ -195,9 +196,10 @@ This quiz helps you self-assess your understanding. Answers can be verified by r
195
196
b) 0<br>
196
197
c) Approximately 2.67<br>
197
198
d) 6<br>
198
-
*(Answer: (a) :eyes:)*
199
+
*(Answer: (2) :eyes:)*
199
200
{ .annotate }
200
-
a. Answer is: <b>c</b>
201
+
202
+
2. Answer is: <b>c</b>
201
203
202
204
203
205
3. Which Python library is primarily used for creating statistical visualizations like heatmaps and pair plots with concise syntax?<br>
@@ -207,6 +209,7 @@ This quiz helps you self-assess your understanding. Answers can be verified by r
207
209
d) Scikit-learn<br>
208
210
*(Answer: (1) :eyes:)*
209
211
{ .annotate }
212
+
210
213
1. Answer is: <b>b</b>
211
214
212
215
@@ -217,34 +220,38 @@ This quiz helps you self-assess your understanding. Answers can be verified by r
217
220
d) It's a technique for reducing the number of features.<br>
218
221
*(Answer: (1) :eyes:)*
219
222
{ .annotate }
223
+
220
224
1. Answer is: <b>b</b>
221
225
222
226
5. What is the primary purpose of `train_test_split`in Scikit-learn?<br>
223
227
a) To combine two different datasets into one.<br>
224
228
b) To separate features from the target variable within a single dataset.<br>
225
229
c) To divide a dataset into one part for training the model and another, unseen part for evaluating its performance.<br>
226
230
d) To visualize the distribution of data.<br>
227
-
*(Answer: (a) :eyes:)*
231
+
*(Answer: (2) :eyes:)*
228
232
{ .annotate }
229
-
a. Answer is: <b>c</b>
233
+
234
+
2. Answer is: <b>c</b>
230
235
231
236
6. If you want to create a scatter plot in Python to visualize the relationship between 'Height'and'Weight' columns in a Pandas DataFrame `df`, which line of code is most appropriate using Seaborn?<br>
232
237
a) `sns.histplot(data=df, x='Height', y='Weight')`<br>
233
238
b) `sns.boxplot(data=df, x='Height', y='Weight')`<br>
234
239
c) `sns.scatterplot(data=df, x='Height', y='Weight')`<br>
235
240
d) `df.plot(kind='scatter', x='Height', y='Weight')` (This is Pandas plotting, not Seaborn directly)<br>
236
-
*(Answer: (a) :eyes:)*
241
+
*(Answer: (2) :eyes:)*
237
242
{ .annotate }
238
-
a. Answer is: <b>c</b>
243
+
244
+
2. Answer is: <b>c</b>
239
245
240
246
7. You have loaded a dataset into a Pandas DataFrame called `sales_df`. How would you display the first 10 rows of this DataFrame?<br>
241
247
a) `sales_df.show(10)`<br>
242
248
b) `sales_df.display_head(10)`<br>
243
249
c) `sales_df.head(10)`<br>
244
250
d) `sales_df.first(10)`<br>
245
-
*(Answer: (a) :eyes:)*
251
+
*(Answer: (2) :eyes:)*
246
252
{ .annotate }
247
-
a. Answer is: <b>c</b>
253
+
254
+
2. Answer is: <b>c</b>
248
255
249
256
8. When you encounter a Python error message that you don't understand while working in a Jupyter Notebook, how can an LLM assist you most effectively?<br>
250
257
a) By automatically fixing the code in your notebook.<br>
@@ -253,6 +260,7 @@ This quiz helps you self-assess your understanding. Answers can be verified by r
253
260
d) By advising you to restart your computer.<br>
254
261
*(Answer: (1) :eyes:)*
255
262
{ .annotate }
263
+
256
264
1. Answer is: <b>b</b>
257
265
258
266
9. What does the `.info()` method in Pandas primarily provide for a DataFrame?<br>
@@ -262,16 +270,18 @@ This quiz helps you self-assess your understanding. Answers can be verified by r
262
270
d) The correlation matrix of numerical columns.<br>
263
271
*(Answer: (a) :eyes:)*
264
272
{ .annotate }
273
+
265
274
a. Answer is: <b>c</b>
266
275
267
276
10. Which of these tasks falls under the 'Data Cleaning/Preparation' stage of the data science workflow?<br>
268
277
a) Defining business objectives.<br>
269
278
b) Training a machine learning model.<br>
270
279
c) Handling missing values and transforming variables.<br>
0 commit comments