Add LLM-based tool for auditing Rocq spec–implementation consistency by Valentin889 · Pull Request #10 · LindenRegex/Warblre

Valentin889 · 2026-03-16T14:55:53Z

Add LLM-based tool for auditing Rocq spec–implementation consistency

This PR introduces a tool to automatically audit the consistency between specification comments and their Rocq implementations in the mechanization.

The tool extracts Definition and Fixpoint blocks from .v files and sends them to an LLM to check whether the implementation syntactically follows the steps described in the specification comments. The goal is to help detect typos, missing steps, or mismatches introduced during the manual translation of the ECMAScript specification into Rocq.

Added components

comment_code_audit.py
Main script that:
- Extracts definitions from Rocq files
- Queries an LLM with configurable prompts
- Stores results
- Generates an HTML review report
Configuration files
- config.json – model configuration and generation parameters
- prompts.json – system prompt and audit prompts
Environment setup
- environment.yml – Conda environment definition
- requirements.txt – Python dependencies
Project utilities
- .gitignore – ignores .env and generated results/
- README.md – documentation and usage instructions

Output

Running the script produces:

JSON logs containing the full prompts and model responses
HTML reports for easier manual inspection

Results are stored under:

results/YYYY-MM-DD/MODEL_NAME/

Valentin889 added 7 commits March 16, 2026 14:39

Create .gitignore

c4a29de

Create config file

c1d443d

Create conda environment file

71abe0c

Create python requirements

fc5776f

Create prompts file

48462c2

Create main scrypt

7edf9ee

Create README.md

061f332

Valentin889 self-assigned this Mar 16, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add LLM-based tool for auditing Rocq spec–implementation consistency#10

Add LLM-based tool for auditing Rocq spec–implementation consistency#10
Valentin889 wants to merge 7 commits intomainfrom
vs/autoformalization/comment-code-audit

Valentin889 commented Mar 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Valentin889 commented Mar 16, 2026

Add LLM-based tool for auditing Rocq spec–implementation consistency

Added components

Output

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant