Replication and extension of Turpin et al. (2023) on unfaithfulness in Chain-of-Thought prompting. Contributions by members of the Evals Reading Group.
- Author 1 (Institution)
- Author 2 (Institution)
- Author 3 (Institution)
# Install uv
curl -LsSf https://astral.sh/uv/install.sh | sh
# Clone and install
git clone https://github.com/mjbroerman/cot_unfaithfulness_replication.git
cd cot_unfaithfulness_replication
uv sync
# Configure API keys
cp .env.example .env
# Edit .env with your keys# PR submission (limited, tracked in git)
uv run python run_experiments.py --mode pr
# Full replication
uv run python run_experiments.py --mode full
quarto render@misc{author2025cot,
title={Chain-of-Thought Unfaithfulness: A Replication Study},
author={Author 1 and Author 2 and Author 3},
year={2025},
howpublished={\url{https://github.com/mjbroerman/cot_unfaithfulness_replication}}
}[MIT/Apache-2.0/CC-BY-4.0]
This project was supported by BlueDot Impact and the organizers of the Evals Reading Group.