You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
\subsubsection*{Estimating the ``smoothness'' of knowledge}\label{subsec:smoothness}
1344
1343
1345
-
In the analysis reported in Figure~\ref{fig:smoothness}A, we show how participants' ability to correctly answer quiz questions changes as a function of distance from a given correctly or incorrectly answered reference question.
1346
-
We used a bootstrap-based approach to estimate the maximum distances over which these proportions of correctly answered questions could be reliably distinguished from participants' overall average proportion of correctly answered questions.
1347
-
1348
-
For each of 10,000 iterations, we drew a random subsample (with replacement) of 50 participants from our dataset full dataset.
1349
-
Within each iteration, we first computed the 95\% confidence interval (CI) of the across-subsample-participants mean proportion correct on each of the three quizzes, separately.
1350
-
To compute this interval for each quiz, we repeatedly (1,000 times) subsampled participants (with replacement, from the outer subsample for the current iteration) and computed the mean proportion correct each of these inner subsamples.
1351
-
We then identified the 2.5\textsuperscript{th} and 97.5\textsuperscript{th} percentiles of the resulting distributions of 1,000 means.
1352
-
These three intervals (one for each quiz) served as our thresholds for confidence that the proportion correct within a given distance from a reference question was reliably different (at the $p < 0.05$ significance level) from the average proportion correct across all questions on the given quiz.
1353
-
1354
-
Next, for each participant in the current subsample, and for each of the three quizzes they completed (separately), we iteratively treated each of the 15 questions appearing on the given quiz as the ``reference'' question.
1355
-
We constructed a series of concentric 15-dimensional ``spheres'' centered on the reference question's embedding space coordinate, where each successive sphere's radius increased by 0.01 (correlation distance) between 0 and 2, inclusive (i.e., tiling the range of possible correlation distances with 201 spheres in total).
1356
-
We then computed the proportion of questions enclosed within each sphere that the participant answered correctly, and averaged these per-radius proportion correct scores across reference questions that were answered correctly, and those that were answered incorrectly.
1357
-
This resulted in two number-of-spheres sequences of proportion-correct scores for each subsample participant and quiz: one derived from correctly answered reference questions, and one derived from incorrectly answered reference questions.
1358
-
1359
-
We computed the across-subsample-participants mean proportion correct for each radius value (i.e., sphere) and ``correctness'' of reference question.
1360
-
This yielded two sequences of proportion-correct scores for each quiz, analogous to the blue and red lines displayed in Figure~\ref{fig:smoothness}A, but for the present subsample.
1361
-
For each quiz, we then found the minimum distance from the reference question (i.e., sphere radius) at which each of these two sequences of per-radius proportion correct scores intersected the 95\% confidence interval for the overall proportion correct (i.e., analogous to the black error bands in Fig.~\ref{fig:smoothness}A).
1362
-
1363
-
This resulted in two ``intersection'' distances for each quiz (for correctly answered and incorrectly answered reference questions).
1364
-
Repeating this full process for each of the 10,000 bootstrap iterations output two distributions of intersection distances for each of the three quizzes.
1365
-
The means and 95\% confidence intervals for these distributions are plotted in Figure~\ref{fig:smoothness}B.
1344
+
In the analysis reported in Figure~\ref{fig:smoothness}A, we show how
1345
+
participants' ability to correctly answer quiz questions changes as a function
1346
+
of distance from a given correctly or incorrectly answered reference question.
1347
+
We used a bootstrap-based approach to estimate the maximum distances over which
1348
+
these proportions of correctly answered questions could be reliably
1349
+
distinguished from participants' overall average proportion of correctly
1350
+
answered questions.
1351
+
1352
+
For each of 10,000 iterations, we drew a random subsample (with replacement) of
1353
+
50 participants from our dataset. Within each iteration, we first computed the
1354
+
95\% confidence interval (CI) of the across-subsample-participants mean
1355
+
proportion correct on each of the three quizzes, separately. To compute this
1356
+
interval for each quiz, we repeatedly (1,000 times) subsampled participants
1357
+
(with replacement, from the outer subsample for the current iteration) and
1358
+
computed the mean proportion correct of each of these inner subsamples. We then
1359
+
identified the 2.5\textsuperscript{th} and 97.5\textsuperscript{th} percentiles
1360
+
of the resulting distributions of 1,000 means. These three intervals (one for
1361
+
each quiz) served as our thresholds for confidence that the proportion correct
1362
+
within a given distance from a reference question was reliably different (at
1363
+
the $p < 0.05$ significance level) from the average proportion correct across
1364
+
all questions on the given quiz.
1365
+
1366
+
Next, for each participant in the current subsample, and for each of the three
1367
+
quizzes they completed (separately), we iteratively treated each of the 15
1368
+
questions appearing on the given quiz as the ``reference'' question. We
1369
+
constructed a series of concentric 15-dimensional ``spheres'' centered on the
1370
+
reference question's embedding space coordinate, where each successive sphere's
1371
+
radius increased by 0.01 (correlation distance) between 0 and 2, inclusive
1372
+
(i.e., tiling the range of possible correlation distances with 201 spheres in
1373
+
total). We then computed the proportion of questions enclosed within each
1374
+
sphere that the participant answered correctly, and averaged these per-radius
1375
+
proportion correct scores across reference questions that were answered
1376
+
correctly, and those that were answered incorrectly. This resulted in two
1377
+
number-of-spheres sequences of proportion-correct scores for each subsample
1378
+
participant and quiz: one derived from correctly answered reference questions,
1379
+
and one derived from incorrectly answered reference questions.
1380
+
1381
+
We computed the across-subsample-participants mean proportion correct for each
1382
+
radius value (i.e., sphere) and ``correctness'' of reference question. This
1383
+
yielded two sequences of proportion-correct scores for each quiz, analogous to
1384
+
the blue and red lines displayed in Figure~\ref{fig:smoothness}A, but for the
1385
+
present subsample. For each quiz, we then found the minimum distance from the
1386
+
reference question (i.e., sphere radius) at which each of these two sequences
1387
+
of per-radius proportion correct scores intersected the 95\% confidence
1388
+
interval for the overall proportion correct (i.e., analogous to the black error
1389
+
bands in Fig.~\ref{fig:smoothness}A).
1390
+
1391
+
This resulted in two ``intersection'' distances for each quiz (for correctly
1392
+
answered and incorrectly answered reference questions). Repeating this full
1393
+
process for each of the 10,000 bootstrap iterations output two distributions of
1394
+
intersection distances for each of the three quizzes. The means and 95\%
1395
+
confidence intervals for these distributions are plotted in
1396
+
Figure~\ref{fig:smoothness}B.
1366
1397
1367
1398
\subsubsection*{Creating knowledge and learning map visualizations}\label{subsec:knowledge-maps}
1368
1399
@@ -1378,15 +1409,15 @@ \subsubsection*{Creating knowledge and learning map visualizations}\label{subsec
1378
1409
projection of the text embedding space. Whereas our main analyses used a
1379
1410
15-topic embedding space, we used a 100-topic embedding space for these
1380
1411
visualizations. This change in the number of topics overcame an undesirable
1381
-
behavior in the UMAP embedding procedure, whereby embedding
1382
-
coordinates for the 15-topic model tended to be ``clumped'' into separated
1383
-
clusters, rather than forming a smooth trajectory through the 2D space. When we
1384
-
increased the number of topics to 100, the embedding coordinates in the 2D
1385
-
space formed a smooth trajectory through the space, with substantially less
1386
-
clumping (Fig.~\ref{fig:knowledge-maps}). Creating a ``map'' by
1387
-
sampling this 100-dimensional space at high resolution to obtain an adequate
1388
-
set of topic vectors spanning the embedding space would be computationally
1389
-
intractable. However, sampling a 2D grid is trivial.
1412
+
behavior in the UMAP embedding procedure, whereby embedding coordinates for the
1413
+
15-topic model tended to be ``clumped'' into separated clusters, rather than
1414
+
forming a smooth trajectory through the 2D space. When we increased the number
1415
+
of topics to 100, the embedding coordinates in the 2D space formed a smooth
1416
+
trajectory through the space, with substantially less clumping
1417
+
(Fig.~\ref{fig:knowledge-maps}). Creating a ``map'' by sampling this
1418
+
100-dimensional space at high resolution to obtain an adequate set of topic
1419
+
vectors spanning the embedding space would be computationally intractable.
1420
+
However, sampling a 2D grid is trivial.
1390
1421
1391
1422
At a high level, the UMAP algorithm obtains low-dimensional embeddings by
1392
1423
minimizing the cross-entropy between the pairwise (clustered) distances between
0 commit comments