The FReD Replication Extractor is a tool designed to support the Federal Reserve Database (FReD) in processing and analyzing replication studies in economics. This semi-automated tool helps keep pace with the expanding replication literature by extracting key metadata from published replication studies.
The tool serves two primary objectives:
- Original Study Identification: Identify which original study a replication paper targets
- Replication Outcome Analysis: Determine the reported replication outcome (success/mixed/failure)
- Combines text matching and LLM-based inference techniques
- Stores transparent justifications for conclusions
- Facilitates simple and auditable validation of outputs
The system consists of three main components:
- Text Preprocessor: Handles document parsing and text normalization
- Inference Engine:
- Uses LLM for context understanding
- Applies text matching algorithms
- Generates confidence scores
- Output Generator: Creates structured output with justifications
- Clone the repository:
git clone https://github.com/ebel-frank/fred_repl_extractor.git
cd fred_repl_extractor- Install dependencies:
pip install -r requirements.txt- Basic usage:
python main.pyDOI_r: 10.1177/0956797617734315
Title_r: Does Smile Intensity in Photographs Really Predict Longevity? A Replication and Extension of Abel and Kruger (2010)
Reference_r: Dufner Michael, Brümmer Martin, Chung Joanne M. et al. (2017). Does Smile Intensity in Photographs Really Predict Longevity? A Replication and Extension of Abel and Kruger (2010). Psychological Science, 29(1), 147-153.
DOI_o: 10.1177/0956797610363775
Title_o: Smile Intensity in Photographs Predicts Longevity
Reference_o: Abel Ernest L., Kruger Michael L. (2010). Smile Intensity in Photographs Predicts Longevity. Psychological Science, 21(4), 542-544.
Abstract: <jats:p> Abel and Kruger (2010) found that the smile intensity of professional baseball players who were active in 1952, as coded from photographs, predicted these players’ longevity. In the current investigation, we sought to replicate this result and to extend the initial analyses. We analyzed (a) a sample that was almost identical to the one from Abel and Kruger’s study using the same database and inclusion criteria ( N = 224), (b) a considerably larger nonoverlapping sample consisting of other players from the same cohort ( N = 527), and (c) all players in the database ( N = 13,530 valid cases). Like Abel and Kruger, we relied on categorical smile codings as indicators of positive affectivity, yet we supplemented these codings with subjective ratings of joy intensity and automatic codings of positive affectivity made by computer programs. In both samples and for all three indicators, we found that positive affectivity did not predict mortality once birth year was controlled as a covariate. </jats:p>
Confidence: high
Outcome: failed
Proof: In both samples and for all three indicators, we found that positive affectivity did not predict mortality once birth year was controlled as a covariate.
Similarity: 100%- Create a
.envfile in the project root:
GOOGLE_API_KEY=your_api_key_here- Or set environment variables:
# Windows PowerShell
$env:GOOGLE_API_KEY="your_api_key_here"
# Linux/Mac
export GOOGLE_API_KEY="your_api_key_here"The system employs a multi-factor confidence scoring mechanism:
- Text Matching Score (0-100): Based on abstract and proof similarity
- Context Score (high, medium, low): LLM evaluation of contextual references