🧠 SQuire (Trainee KNighter)

Team: Chinmay Dalal & Snehashish Reddy Manda
Course: COMP790‑199
Proposal Type: Systems Research Project

🌟 Overview

SQuire is an exploration into how Large Language Models (LLMs) can help automatically synthesize static analysis checkers — the tools that detect bugs in large codebases like the Linux kernel.

Traditional static analyzers are often hand‑written, expensive to maintain, and limited to predefined bug patterns. Our aim is to see if LLMs can learn bug patterns directly from historical bug‑fix patches, generate targeted static checkers (specifically for the Clang Static Analyzer), and refine them over time.

In short:

Instead of using LLMs to scan code directly, we use them to create the tools that do.

🏗️ Background

Our idea is inspired by the KNighter (SOSP ’25) paper, which demonstrated an LLM‑driven approach to synthesizing static checkers. While KNighter targeted a broad range of bugs, SQuire focuses on simple, intra‑procedural fixes (e.g., Null Pointer Dereference, Use-Before-Initialization) to maximize precision and reduce hallucination.

⚙️ Approach

We have built an end-to-end pipeline:

Patch Mining (src/filter_commits.py) → Gather and curate relevant Linux kernel bug‑fix patches.
Agentic Pipeline (src/agentic_pipeline.py) → An LLM-driven loop that:
- Extracts the abstract bug pattern.
- Synthesizes a detection plan.
- Generates executable C++ code for a Clang Static Analyzer checker.
Validation → Compile and run the checker against test cases and historical kernel versions.

🛠️ Project Setup

Prerequisites: Ensure you are on a Linux distro (Arch/Manjaro recommended for latest LLVM) and have:
- Python 3.10+
- Clang/LLVM 20
- git, make, gcc

Clone & Submodules:

git clone https://github.com/srmanda-cs/SQuire.git
cd SQuire
git submodule update --init --recursive

Python Environment:

python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Environment Variables: Create a .env file in the root directory:

API_KEY=<your_openai_compatible_api_key>
BASE_URL=<your_openai_compatible_base_url>
LLM_MODEL=<your_chosen_model_name>

🏃‍♂️ Running the Pipeline

To run the full agentic loop (Pattern Extraction → Plan → Code Generation):

python src/agentic_pipeline.py

This will read from mined_patches_curated/, interact with the LLM, and output a GeneratedNPDChecker.cpp file.

🧪 Smoke Testing

Once a checker has been generated (or using the pre-generated example), you can verify it using our smoke test harness.

Navigate to the test directory:

cd smoke_test/simple_tool

Build the checker:

make clean
make

This compiles the C++ checker into a shared object (libNPDChecker.so).

Run the analysis:

clang -Xclang -load -Xclang ./libNPDChecker.so \
      -Xclang -analyze \
      -Xclang -analyzer-checker=squire.NPDChecker \
      test.c

Expected Output: You should see a warning pointing to the specific line in test.c where the bug exists:

test.c:10:8: warning: Result of a possibly failing allocation or metadata access is used without a preceding NULL check [squire.NPDChecker]
   10 |     *p = 42;
      |     ~~ ^

👥 Roles

Member	Responsibilities
Chinmay Dalal	Kernel infrastructure, Tooling (LLVM/Clang), Checker Refinement
Snehashish Reddy	LLM Pipeline (Prompts, Agentic Loop), Project Vision, Smoke Testing

📘 References

Yang, C., et al. (2025). KNighter: Transforming Static Analysis with LLM‑Synthesized Checkers. SOSP '25.

🪪 License

Apache License 2.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
assets		assets
linux @ bbf5c97		linux @ bbf5c97
prompts		prompts
smoke_test		smoke_test
src		src
test		test
.gitignore		.gitignore
.gitmodules		.gitmodules
GeneratedNPDChecker.cpp		GeneratedNPDChecker.cpp
LICENSE		LICENSE
README.md		README.md
checker_plan.txt		checker_plan.txt
libNPDChecker.so		libNPDChecker.so
merged_pattern.txt		merged_pattern.txt
pattern_outputs.txt		pattern_outputs.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 SQuire (Trainee KNighter)

🌟 Overview

🏗️ Background

⚙️ Approach

🛠️ Project Setup

🏃‍♂️ Running the Pipeline

🧪 Smoke Testing

👥 Roles

📘 References

🪪 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠 SQuire (Trainee KNighter)

🌟 Overview

🏗️ Background

⚙️ Approach

🛠️ Project Setup

🏃‍♂️ Running the Pipeline

🧪 Smoke Testing

👥 Roles

📘 References

🪪 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

🧠 SQuire (Trainee KNighter)

Packages