Add ldv regression benchmarks to incremental benchmarks #35

jerhard · 2022-08-11T10:53:42Z

This PR imports benchmarks that were taken from https://gitlab.com/sosy-lab/software/regression-verification-tasks/-/tree/new-svcomp-spec. Further information on these benchmarks can be found here.

The benchmarks were processed with incremental-ldv.py to generate the patch files and create the benchmark sets in index/sets/ldv/. The file index/sets/ldv/combined.yaml contains all benchmarks that are imported; the other yaml files in index/sets/ldv/ contain the benchmarks per directory.

Problems

There are some significant problems with this benchmark set. In particular:

Most of the benchmarks have syntax errors that cause Goblint to be unable to parse them.
Those benchmarks that Goblint runs on have very little live code. I checked ten benchmarks and the reported number of live code on these varies between 20 and 230 lines of code. @michael-schwarz and I inspected one benchmark manually and came to the conclusion that indeed very little code is live.
Due to the little live code, run times on many of the working benchmarks are within ~3 seconds
The patches tend to be large, and the patches are patches of CIL-files. While there are some patches where Goblint only detects 1-2 functions to be changed, a considerable amount of the patches that Goblint can run on results in >50 functions to be changed.

…ion benchmarks

…al benchmarking

…ntal ldv benchmarks

…bench_incremental.rb

Source of the benchmarks: https://gitlab.com/sosy-lab/software/regression-verification-tasks/-/tree/new-svcomp-spec

…ncremental.yaml

sim642 · 2022-08-13T07:51:29Z

Most of the benchmarks have syntax errors that cause Goblint to be unable to parse them.

Would be useful to have a CIL issue about this parsing error.

michael-schwarz · 2022-08-13T08:51:24Z

Iirc they were actually errors (the type of the implementation not matching the one given in the prototype in the same file) etc that GCC also flags. If we want to do something here, I don't think we should patch CIL to accept invalid programs, but rather fix the benchmarks.

jerhard added 13 commits August 5, 2022 14:20

Add preliminary version of incremental ldv benchmark script

2d69b20

Add some class structure to benchmark results

3fd8d32

Print HTML table

ae0224d

Close file

d62e9dd

Change script to generate/copy incremental benchmarks for ldv regress…

bf5d5f9

…ion benchmarks

Update incremental-ldv.py script to generate yaml files for increment…

7b7ee22

…al benchmarking

Remove some debug printout script

3646c50

Change script to also generate comined.yaml that contains all increme…

432abf5

…ntal ldv benchmarks

Use ana.dead-code.functions instead of dbg.print_dead_code in update_…

869a785

…bench_incremental.rb

Remove no longer supported abort and abort-verify entries td3.json

e4abb42

Add ldv-regression benchmarks for incremental benchmarking.

30abe16

Source of the benchmarks: https://gitlab.com/sosy-lab/software/regression-verification-tasks/-/tree/new-svcomp-spec

Change incremental.reluctant.on to incremental.reluctant.enabled in i…

3b2870a

…ncremental.yaml

Merge master into incremental_ldv

dadeda5

sim642 marked this pull request as draft August 11, 2022 11:27

sim642 added the new benchmark New benchmark to analyze label Aug 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add ldv regression benchmarks to incremental benchmarks #35

Add ldv regression benchmarks to incremental benchmarks #35

Uh oh!

jerhard commented Aug 11, 2022

Uh oh!

sim642 commented Aug 13, 2022

Uh oh!

michael-schwarz commented Aug 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Add ldv regression benchmarks to incremental benchmarks #35

Are you sure you want to change the base?

Add ldv regression benchmarks to incremental benchmarks #35

Uh oh!

Conversation

jerhard commented Aug 11, 2022

Problems

Uh oh!

sim642 commented Aug 13, 2022

Uh oh!

michael-schwarz commented Aug 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants