You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: blog/2025-6-20-ai-review.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -74,7 +74,7 @@ Participant demographics were extracted for each experimental group in the study
74
74
**Preliminary Validation.**
75
75
- We annotated over 220 articles for participant demographics.
76
76
- We observed a high level of accuracy for most fields, notably for participant *count* (\<0.15 Mean Percentage Error).
77
-
- In our annotated sample, we identified 100 individual participant groups with a *diagnosis* labels (e.g. “schizophrenia”). Using BERTScore to quantitatively compare the extracted and annotated diagnoses, the best performing models achieved \>0.8 F1-score, indicating moderate to high accuracy. (higher scores are better).
77
+
- In our annotated sample, we identified 100 individual participant groups with a *diagnosis* labels (e.g. “schizophrenia”). Using BERTScore to quantitatively compare the extracted and annotated diagnoses, the best performing models achieved >0.8 F1-score, indicating moderate to high accuracy. (higher scores are better).
78
78
- Qualitative analysis confirmed that LLMs are increasingly adept at capturing specific diagnostic information (e.g., "Autism Spectrum Disorder", "phobic prone", "eating disorders prone") and associating it correctly with relevant demographic data, even if the specific form differed from the manual annotation.
0 commit comments