A system for processing and analyzing FEMA disaster declaration forms (Form 010-0-13).
This project aims to develop an intelligent assistant to help state and local government officials navigate the complex process of applying for federal assistance after a natural disaster. The system uses LLMs to process disaster declaration forms and extract structured data; next, we'll develop and benchmark the capability to fill out forms automatically.
- Implemented PDF parsing pipeline using DocETL with 97.5% field extraction accuracy
- Created storage system for managing declaration documents and metadata
- Added capabilities to match declarations with FEMA disaster IDs and fetch associated Preliminary Damage Assessment (PDA) reports
- Storage System: UUID-based document management with metadata tracking
- Parsing Pipeline: Extracts structured data from PDFs using LLMs
- Evaluation Pipeline: Verifies extraction accuracy against ground truth (12 reports manually parsed/transcribed)
- FEMA Data Integration: Matches declarations with official FEMA disaster IDs and PDA reports
- Python 3.8+
- DocETL (
pip install docetl) - Required Python packages:
requirements.txt
- Clone this repository
- Install dependencies:
pip install -r requirements.txt - Download the dataset, available here:
metadata.jsonl- Declaration metadatapdfs.zip- PDF documents
# Set up the declaration repository with PDFs and metadata
python scripts/setup_declarations.py --pdf-archive path/to/pdfs.zip --jsonl-file path/to/metadata.jsonl# Parse declarations from storage
python -m fema_agent.parse --storage-dir data/processed/all-declarations --outpath parsed_results.json --model gemini-2.0-flash-lite
# Update storage with parsed results
python -m fema_agent.parse --storage-dir data/processed/all-declarations --outpath parsed_results.json --update-storage# Check parsed results against ground truth
python -m fema_agent.check parsed_results.json --ground-truth data/ground_truth/test_set_truth.jsonsrc/fema_agent/- Core agent codestorage.py- Document storage systemparse.py- Parsing pipelinecheck.py- Evaluation pipelineforms/- Form field definitions
scripts/- Utility scripts for setup and data processing/linkingexperiments/- Evaluation results and experiments