[ZIPT Benchmark] Z3 c3 branch — 2026-03-06 #8881
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it expired on 2026-03-13T05:39:52.704Z.
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
QF_S Benchmark Report — Z3 c3 branch
Date: 2026-03-06
Branch: c3
Benchmark set: QF_S (50 randomly selected files from tests/QF_S.tar.zst)
Timeout: 10 seconds per benchmark (
-T:10)Total files in archive: 22,172
Summary
Soundness disagreements (seq says sat, nseq says unsat or vice versa): 0
Per-File Results (click to expand)
Notable Issues
✅ No Soundness Disagreements
Both solvers agreed on all sat/unsat verdicts.
✅ No Crashes
No crashes or errors detected.
🐌 Slow Benchmarks (> 8s)
Key Observations
nseq solver completeness: The nseq solver returned "unknown" for 39 out of 50 benchmarks (78%), while seq returned "unknown" for only 4 (8%). This suggests the nseq solver has significant completeness issues on these benchmarks.
Performance: The seq solver completed the benchmark suite in 18.353s (avg 0.367s per file), while nseq took 272.689s (avg 5.454s per file). The nseq solver spent substantial time timing out on benchmarks it couldn't solve.
Generated automatically by the QF_S Benchmark workflow on the c3 branch.
Beta Was this translation helpful? Give feedback.
All reactions