A (Mid)journey Through Reality: Assessing Accuracy, Impostor Bias, and Automation Bias in Human Detection of AI-Generated Images
This repository contains the dataset, stimulus images, and Grad-CAM visualizations for the paper "A (Mid)journey Through Reality" (DOI: 10.1155/hbe2/9977058), published in Human Behavior and Emerging Technologies (2025).
This dataset supports the study investigating whether people are able to distinguish real photographs from AI-generated images, and whether they are subject to automation bias when an algorithmic suggestion is provided, or impostor bias (systematic over-skepticism).
Through a mixed-methods study with 746 participants across 5 distinct experimental variants, we collected 11,170 individual judgments. See the paper for a full discussion of the results.
.
├── data/
│ ├── responses_task.csv ← Processed data: one row per participant × image
│ ├── participants_demographics.csv← Demographics: one row per participant
│ ├── stimuli_manifest.csv ← Metadata for every stimulus image
│ ├── statistical_analysis_results_with_effect_sizes.csv
│ ├── resnet/ ← ResNet-50 predictions used in the study
│ └── raw/ ← Fully anonymized raw CSV exports
├── stimuli/ ← The 75 stimulus images (alpha-epsilon)
└── gradcam/ ← Grad-CAM visualizations (human vs model)
One row per (participant × image) observation — 11,170 rows total (746 participants × 15 images each).
| Column | Description |
|---|---|
participant_id |
Anonymous identifier P0001–P0746 |
variant |
Questionnaire variant (alpha–epsilon) |
image_id |
Foreign key to stimuli_manifest.csv |
ground_truth |
real · AI_midjourney · AI_magnific |
initial_response |
First judgment before algorithm: real or AI |
had_doubt |
Self-reported uncertainty: yes / no |
ai_confidence_score |
Simulated algorithm confidence (0–100 %) displayed to the participant |
post_suggestion_action |
confirm (kept initial choice) or change (switched) |
effective_final_response |
Final answer after the suggestion: real or AI |
One row per stimulus (75 total). Every variant contains exactly 5 real, 5 AI_midjourney, and 5 AI_magnific images, fully counterbalanced across positions.
| Column | Description |
|---|---|
image_id |
Unique identifier, format {variant}_{order:02d} (e.g. alpha_01) |
variant |
Questionnaire variant: alpha, beta, gamma, delta, epsilon |
presentation_order |
Position in the questionnaire (1–15) |
filename |
Original filename, zero-padded (e.g. 01_real.jpg) |
ground_truth |
real · AI_midjourney · AI_magnific |
| Variant | Name | N participants |
|---|---|---|
| 1 | alpha | 256 |
| 2 | beta | 150 |
| 3 | gamma | 142 |
| 4 | delta | 106 |
| 5 | epsilon | 92 |
| Total | 746 |
Each subfolder (alpha through epsilon) contains exactly 15 JPEG files named {presentation_order:02d}_{type}.jpg, where {type} encodes the ground truth:
| Suffix | Meaning | Tool / Source |
|---|---|---|
_real |
Authentic photograph | Publicly licensed stock images |
_m |
Synthetic image | Midjourney v6 |
_mm |
Synthetic image (upscaled/enhanced) | Midjourney v6 + Magnific AI |
All images depict human figures (portraits, groups, or scenes with people) so as to make the detection task ecologically plausible.
- No names, email addresses, IP addresses, or precise timestamps are included in ANY dataset file.
- Free-text observation fields have been completely removed from all files.
- Age was collected as a categorical range.
academic_affiliationis a normalised broad label.- Note: the files in
data/raw/contain the original data exports, but have been fully stripped of personal data (timestamps, emails, and observations).
Unless otherwise noted, the stimulus images and CSV data in this repository are licensed under CC BY 4.0. Reuse, redistribution, and modification are permitted, provided that appropriate attribution is given, changes are indicated, and the following paper is cited:
@article{https://doi.org/10.1155/hbe2/9977058,
author = {Casu, Mirko and Guarnera, Luca and Zangara, Ignazio and Caponnetto, Pasquale and Battiato, Sebastiano},
title = {A (Mid)journey Through Reality: Assessing Accuracy, Impostor Bias, and Automation Bias in Human Detection of AI-Generated Images},
journal = {Human Behavior and Emerging Technologies},
volume = {2025},
number = {1},
pages = {9977058},
doi = {https://doi.org/10.1155/hbe2/9977058},
url = {https://onlinelibrary.wiley.com/doi/abs/10.1155/hbe2/9977058},
year = {2025}
}For questions about the dataset, please contact the corresponding authors (contact information available in the manuscript).
