[ZIPT Benchmark] Z3 c3 branch — 2026-03-12 #8964
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by Qf S Benchmark. A newer discussion is available at Discussion #9031. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Date: 2026-03-12
Branch: c3
Z3 version: 4.17.0 (build hash: acbf5c6)
Benchmark set: QF_S (50 randomly selected files from
tests/QF_S.tar.zst, total pool: 22,172 files)Timeout: 10 seconds per benchmark (
-T:10)Summary
seqsolvernseqsolverSoundness disagreements (seq says sat, nseq says unsat or vice versa): 0 ✅
Notable Issues
Soundness Disagreements (Critical)
None detected. ✅
Crashes / Bugs
None detected. ✅
Performance Gap
nseqis significantly less complete thanseqon this benchmark set:seqsolved 42/50 (84%) benchmarks definitively (sat or unsat)nseqsolved only 12/50 (24%) definitivelynseqreturnedunknownon 38 out of 50 benchmarks, including 22 cases where it hit the 10-second timeout butseqsolved the same instance quicklySlow Benchmarks (> 8s, either solver)
Per-File Results
Click to expand all 50 results
Generated automatically by the QF_S Benchmark workflow on the c3 branch.
Beta Was this translation helpful? Give feedback.
All reactions