Grammars of Formal Uncertainty

PCFG-based uncertainty quantification for programs. Extracts probabilistic context-free grammars from source code and computes entropy metrics.

Install

pip install -e .

Or with Docker:

docker compose build

Quick Start

Python API

from grammar_uncertainty import GrammarAnalyzer

result = GrammarAnalyzer(grammar="smtlib").analyze("/path/to/programs")
print(result.uncertainty.grammar_entropy, result.uncertainty.perplexity)

CLI

# Setup grammar (downloads ANTLR grammar and generates parser)
grammar-uncertainty setup smtlib

# Analyze a folder of programs
grammar-uncertainty analyze ./programs --grammar smtlib --output results.json

# Start the API server
grammar-uncertainty serve --port 8000

Docker

# Run CLI commands
docker compose run --rm grammar-uncertainty grammar-uncertainty analyze /data/samplesmtfolder --grammar smtlib

# Start API server
docker compose up

Experiments

Run uncertainty metric evaluation on SMT datasets:

# List available datasets
docker compose run --rm grammar-uncertainty python -m experiments list-datasets smtlogs

# Run analysis on a dataset
docker compose run --rm grammar-uncertainty python -m experiments run smtlogs/ProofWriter/03-mini/200-100/o3-mini-200-100

See experiments/README.md for details.

Supported Grammars

Grammar	Extensions	Setup
smtlib	.smt2, .smt	`grammar-uncertainty setup smtlib`
prolog	.pl, .prolog	`grammar-uncertainty setup prolog`

Extensible to any ANTLR v4 grammar. Implement GrammarHandler in src/grammar_uncertainty/grammars/.

Metrics

Grammar Entropy: Shannon entropy of the PCFG
Perplexity: Exponential of cross-entropy
NSUI: Normalized Syntactic Uncertainty Index
KL Divergence: Distance from uniform distribution
Rényi Entropy: Generalized entropy (α-parameterized)

API

curl -X POST localhost:8000/analyze \
  -H "Content-Type: application/json" \
  -d '{"folder_path": "/data/programs", "grammar": "smtlib"}'

Endpoint	Method	Description
/analyze	POST	Analyze programs
/grammars	GET	List grammars
/setup	POST	Setup grammar
/health	GET	Health check

Citation

@inproceedings{
ganguly2025grammars,
title={Grammars of Formal Uncertainty: When to Trust {LLM}s in Automated Reasoning Tasks},
author={Debargha Ganguly and Vikash Singh and Sreehari Sankar and Biyao Zhang and Xuecen Zhang and Srinivasan Iyengar and Xiaotian Han and Amit Sharma and Shivkumar Kalyanaraman and Vipin Chaudhary},
booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems},
year={2025},
url={https://openreview.net/forum?id=QfKpJ00t2L}
}

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
experiments		experiments
sampleprologfolder		sampleprologfolder
samplesmtfolder		samplesmtfolder
src/grammar_uncertainty		src/grammar_uncertainty
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Grammars of Formal Uncertainty

Install

Quick Start

Python API

CLI

Docker

Experiments

Supported Grammars

Metrics

API

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Grammars of Formal Uncertainty

Install

Quick Start

Python API

CLI

Docker

Experiments

Supported Grammars

Metrics

API

Citation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages