Skip to content

Commit 5e08fb0

Browse files
Update A1_Intro_to_DataScience_and_ML.md
1 parent 1048b76 commit 5e08fb0

File tree

1 file changed

+21
-21
lines changed

1 file changed

+21
-21
lines changed

docs/mlpaths/A1_Intro_to_DataScience_and_ML.md

Lines changed: 21 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
[[**Back to ML Paths Home!**](https://ua-datalab.github.io/mlpaths_grids/)]
1+
[[:arrow_left: **Back to ML Paths Home!**](https://ua-datalab.github.io/mlpaths_grids/)]
22

33
## Introduction to Data Science and Machine Learning 🧭🤖
44

@@ -178,9 +178,9 @@ This quiz helps you self-assess your understanding. Answers can be verified by r
178178
b) A field of study that gives computers the ability to learn without being explicitly programmed.<br>
179179
c) The process of using computers to analyze large datasets only.<br>
180180
d) The development of software that can reason and solve complex problems like humans.<br>
181-
*(Answer:(1) :eyes:*
181+
*(Answer:(1) :eyes:)*
182182
{ .annotate }
183-
1. Answer is: <b>b</>
183+
1. Answer is: <b>b</b>
184184

185185

186186
2. Consider the following Python code using Pandas:
@@ -196,9 +196,9 @@ This quiz helps you self-assess your understanding. Answers can be verified by r
196196
b) 0<br>
197197
c) Approximately 2.67<br>
198198
d) 6<br>
199-
*(Answer:(2) :eyes:*
199+
*(Answer:(2) :eyes:)*
200200
{ .annotate }
201-
2. Answer is: <b>c</>
201+
2. Answer is: <b>c</b>
202202

203203

204204
3. Which Python library is primarily used for creating statistical visualizations like heatmaps and pair plots with concise syntax?<br>
@@ -207,9 +207,9 @@ This quiz helps you self-assess your understanding. Answers can be verified by r
207207
c) Pandas<br>
208208
d) Scikit-learn<br>
209209
*(Answer: b)*
210-
*(Answer:(3) :eyes:*
210+
*(Answer:(3) :eyes:)*
211211
{ .annotate }
212-
3. Answer is: <b>b</>
212+
3. Answer is: <b>b</b>
213213

214214

215215
4. In a typical classification problem, what is the role of the 'target variable'?<br>
@@ -218,69 +218,69 @@ This quiz helps you self-assess your understanding. Answers can be verified by r
218218
c) It's a numerical value the model tries to estimate.<br>
219219
d) It's a technique for reducing the number of features.<br>
220220
*(Answer: b)*
221-
*(Answer:(4) :eyes:*
221+
*(Answer:(4) :eyes:)*
222222
{ .annotate }
223-
4. Answer is: <b>b</>
223+
4. Answer is: <b>b</b>
224224

225225
5. What is the primary purpose of `train_test_split` in Scikit-learn?<br>
226226
a) To combine two different datasets into one.<br>
227227
b) To separate features from the target variable within a single dataset.<br>
228228
c) To divide a dataset into one part for training the model and another, unseen part for evaluating its performance.<br>
229229
d) To visualize the distribution of data.<br>
230230
*(Answer: c)*
231-
*(Answer:(5) :eyes:*
231+
*(Answer:(5) :eyes:)*
232232
{ .annotate }
233-
5. Answer is: <b>c</>
233+
5. Answer is: <b>c</b>
234234

235235
6. If you want to create a scatter plot in Python to visualize the relationship between 'Height' and 'Weight' columns in a Pandas DataFrame `df`, which line of code is most appropriate using Seaborn?<br>
236236
a) `sns.histplot(data=df, x='Height', y='Weight')`<br>
237237
b) `sns.boxplot(data=df, x='Height', y='Weight')`<br>
238238
c) `sns.scatterplot(data=df, x='Height', y='Weight')`<br>
239239
d) `df.plot(kind='scatter', x='Height', y='Weight')` (This is Pandas plotting, not Seaborn directly)<br>
240240
*(Answer: c)*
241-
*(Answer:(6) :eyes:*
241+
*(Answer:(6) :eyes:)*
242242
{ .annotate }
243-
6. Answer is: <b>c</>
243+
6. Answer is: <b>c</b>
244244

245245
7. You have loaded a dataset into a Pandas DataFrame called `sales_df`. How would you display the first 10 rows of this DataFrame?<br>
246246
a) `sales_df.show(10)`<br>
247247
b) `sales_df.display_head(10)`<br>
248248
c) `sales_df.head(10)`<br>
249249
d) `sales_df.first(10)`<br>
250250
*(Answer: c)*
251-
*(Answer:(7) :eyes:*
251+
*(Answer:(7) :eyes:)*
252252
{ .annotate }
253-
7. Answer is: <b>c</>
253+
7. Answer is: <b>c</b>
254254

255255
8. When you encounter a Python error message that you don't understand while working in a Jupyter Notebook, how can an LLM assist you most effectively?<br>
256256
a) By automatically fixing the code in your notebook.<br>
257257
b) By explaining what the error message typically means, suggesting possible causes, and providing examples of how to fix similar errors.<br>
258258
c) By providing a link to the full Python documentation without context.<br>
259259
d) By advising you to restart your computer.<br>
260260
*(Answer: b)*
261-
*(Answer:(8) :eyes:*
261+
*(Answer:(8) :eyes:)*
262262
{ .annotate }
263-
8. Answer is: <b>b</>
263+
8. Answer is: <b>b</b>
264264

265265
9. What does the `.info()` method in Pandas primarily provide for a DataFrame?<br>
266266
a) A statistical summary of numerical columns (mean, std, min, max).<br>
267267
b) The first five rows of the DataFrame.<br>
268268
c) A concise summary of the DataFrame, including data types of columns and non-null counts.<br>
269269
d) The correlation matrix of numerical columns.<br>
270270
*(Answer: c)*
271-
*(Answer:(8) :eyes:*
271+
*(Answer:(8) :eyes:)*
272272
{ .annotate }
273-
8. Answer is: <b>b</>
273+
8. Answer is: <b>b</b>
274274

275275
10. Which of these tasks falls under the 'Data Cleaning/Preparation' stage of the data science workflow?<br>
276276
a) Defining business objectives.<br>
277277
b) Training a machine learning model.<br>
278278
c) Handling missing values and transforming variables.<br>
279279
d) Presenting results to stakeholders.<br>
280280
*(Answer: c)*
281-
*(Answer:(10) :eyes:*
281+
*(Answer:(10) :eyes:)*
282282
{ .annotate }
283-
1. Answer is: <b>c</>
283+
1. Answer is: <b>c</b>
284284

285285
---
286286

0 commit comments

Comments
 (0)