Use KeYParser.g4 for parsing proof scripts #3021

wadoon · 2023-02-05T14:33:37Z

This PR removes the hand-written parser for proof scripts and uses a few rules in KeYParser.g4 instead.

Since the start of proof scripts, the KeYParser has changed and became an ANTLR4 grammar. This allows us to easily write a grammar for our proof scripts. (Or just copy the few rules from Sarah and mine proof script parser). This eliminates the handwritten proof script parser with the following benefits:

Proof Scripts are a first-class citizen in KeY files. You do not need to put your proof script into a string literal:
```
\proofscript {
    andLeft;
    andRight;  
    rule a ==> b;
}
```
Please note an ambiguity in the grammar: rule b ==> c could either be interpreted as a command with one or two arguments:
1. a semi-sequent b ==> and a term c, or
2. a sequent b ==> c.
3. or a term b and a semi-sequent ==> c.
Use quotes or (better) parentheses to clarify these situations.

The grammar should follow the KeY grammar as best as possible. Backwards-compatibility is tried to preserve.
Earlier and better syntax errors, during reading of the KeY file instead of proof script execution.
Better positioning information, as these are generated by using the ParseContexts.
Proper data structure: no triple of strings is pushed through KeY.
You do not need to put your arguments in quotes. Literals, terms and sequents are parsed.

Breaking Change

The following is now prohibited:
```
my-command x;
```
Commands should be valid KeY identifiers, hence they should be free of dashes (aka subtraction operation).
Single quotes are reserved for character literals. Please only use ".

Backward compatibility

Tried to achieve backward-compatibility.

The proof script can still be given as a string: \proofscript "..."; inside a KeY file.
Term arguments can still be given as a string. The conversion mechanism are still in place to convert the data types in many directions (string to int, string to term, but also term to string, etc.)

Note there is a difference between \f(x)`and"f(x)". The first one is parsed directly as a term and can only be passed to term or string arguments (or any other data type for which a converter is registered). The second one "f(x)"` is parsed as a string. By meta-information on commands, the string is lazily converted to a term if necessary. This conversion is delayed until the command execution.

The term f(x) might be parsed early with the input file, but the expression is evaluated (translated from parse tree to Term) on command-execution time, hence, the use of goal-local variables should be possible.

Side changes to Proof Scripts

"Higher-order proof scripts"

A proof command can receive a block of commands as an argument. Consider this:

try { auto; }

where try is a regular proof command. And { auto; } is a block. This constructs replace tryclose; but gives you more power on what will be tried.

This feature arose by the current complicated version of the AllCommand (e.g., onAll hide \f(x)`) executes the sub-command specified by the arguments on all goals). In SaG scripts we used foreach { ... }` to achieve this behavior.

The extension adapts the command syntax to take a list of sub-commands in curly brackets.

proofScriptCommand: 
    name  args  (  '{'   <sub-command>  '}' ';'? | ';' )
    ;

The new syntax for onAll is also onAll { hide \f(x)`; }. Commands receive the code block using the key #blockinside the map as a parse tree, that can be sent back to theScriptEngineusing#execute(state, statements)`. The new syntax allows building new commands like

try {} -- executes  ignoring errors
repeat { } -- executes  as long as there are changes on the sequent.
matching <term> { } -- execute on all goals where <term> is prescense.

Removal of the XML for command documentation.

We used an XML file to store documentation of proof script commands. This file was deleted, and we use text block literals instead.

Remove of `@`-prefix

TODO

mattulbrich · 2023-02-05T15:04:25Z

I agree that the current parser is more a proof-of-concept solution than future-proof.

However: With many ideas around proof scripts, which should possibly discuss how this should go now: The script debugger has its ways of communicating to the server, scripts can be in .key files, they were at a point interactively editable, and the plan is to have them in JML code. Do they all go to the same background linear script language or is the base language itself more than a sequence of commands?

@WolframPfeifer @jwiesler

mattulbrich

First round of review. Some feedback. Could not review the whole thing.

key.core/src/main/java/de/uka/ilkd/key/nparser/KeyAst.java

key.core/src/main/antlr4/KeYParser.g4

key.core/src/main/java/de/uka/ilkd/key/api/ProofApi.java

key.core/src/main/java/de/uka/ilkd/key/control/KeYEnvironment.java

key.core/src/main/java/de/uka/ilkd/key/macros/scripts/AbstractCommand.java

key.core/src/main/java/de/uka/ilkd/key/macros/scripts/AllCommand.java

key.core/src/main/java/de/uka/ilkd/key/macros/scripts/ExprEvaluator.java

key.core/src/main/java/de/uka/ilkd/key/macros/scripts/LetCommand.java

mattulbrich · 2025-01-20T19:33:46Z

Reply to me-from-the-past:

However: With many ideas around proof scripts, which should possibly discuss how this should go now: The script debugger has its ways of communicating to the server, scripts can be in .key files, they were at a point interactively editable, and the plan is to have them in JML code. Do they all go to the same background linear script language or is the base language itself more than a sequence of commands?

After implementing a/the script language for JML: This was compiled down to a linear script. So it seems that a list of individual proof commands looks like the right level of abstraction.

wadoon · 2025-01-29T21:34:32Z

After implementing a/the script language for JML: This was compiled down to a linear script. So it seems that a list of individual proof commands looks like the right level of abstraction.

This PR gives the current state a proper syntax in alignment with the KeY grammar. So the list of proof commands are still preserved. Giving the commands the possibility to receive a block of sub-commands arises from your AllCommand and allows future extensions w/o needing syntax changes.

We need a quoting symbol for terms for the ambiguities, " is used for string values, and ' is also occupied (do we use it for char literals?). So backticks \`` are a possible option, but we can also go for parentheses, e.g., let @A=(f(x))` etc. I do not care.

key.ui/examples/heap/quicksort/sort.key

key.ui/examples/heap/quicksort/split.key

key.core/src/test/resources/de/uka/ilkd/key/macros/scripts/cases.yml

key.core/src/main/resources/de/uka/ilkd/key/proof/rules/wdFormulaRules.key

mattulbrich · 2025-02-22T10:20:26Z

There seem to be accidental artifacts of other PRs in this one.

The old principle was: Every argument is essentially a string. If it is a single identifier the quotes can be omitted.
Why not keep this convention? I'd vote for this. (It's heavily inspired by "isar.")

If dropping this in favour of type safety, then probably this should be handled more strictly in order to not add confusion instead of removing it.

key.core/src/main/antlr4/KeYParser.g4

mattulbrich · 2025-03-07T20:57:13Z

This PR seems to change the infrastructure quite considerably. It touches > 260 files, some of them rather far from the topic at hand.

I do like the idea to reuse the KeY parser instead of the handmade parser.
Still, there is nothing wrong (I think) with the character bases handmade parser, but the ANTLR machine of course allows for more flexibility and cleaner code.

I was not able to understand the higher order feature, but that looked nice indeed.

I'd argue against the (voluntary) strong typing. The design of the language is untyped. Everything is a string. I would like to stick to this on the KeY level. It is a different story on the JML level.

Can we break this down into manageable units (by cherrypicking aspects e.g.)?

The script proof engine can an overhaul if/when the JML scripts are on master-track.

mattulbrich · 2025-03-07T21:38:43Z

Consider force pushing this to weigl/psg4-cleaned where most non-related aspects have been removed by rebasing the branch.

mattulbrich · 2025-03-07T23:50:40Z

Afterwards consider https://github.com/KeYProject/key/tree/psg4-cleaned-continued where a few ideas from other branches regarding the non-JML bit of proof scripts have come together.

mattulbrich · 2025-03-07T23:54:49Z

The ambiguity of the parser still is really a problem. command a = b ==> can be at least three different calls. Should ANTLR not complain? Parser generators usually have guarantees regarding ambiguities. ...

wadoon · 2025-03-12T17:22:39Z

I'll rebase this PR. Splitting into multiple PRs should only bring complexity.

mattulbrich · 2025-03-17T11:59:38Z

Just read the commit message "Introduce ScriptCommandAst to keep the ProofEngine separated from ANT…"

I'd propose to limit this PR to its original purpose and merge it into master.
Going on from that point with ScriptCommandAPI etc is then up to further development in a next PR.
This would also allow to add new feature in other PRs.

key.core/src/main/antlr4/KeYParser.g4

* adds script block commands arguments

commands

* higher-order commands * integrate documentation

more trimming for windows

wadoon self-assigned this Feb 6, 2023

FliegendeWurst added the KeY Parser label Mar 24, 2023

wadoon closed this Oct 13, 2023

wadoon reopened this Jan 20, 2025

wadoon requested a review from mattulbrich January 20, 2025 18:44

wadoon mentioned this pull request Jan 20, 2025

Missing spaces in .key files are ignored #3538

Closed

mattulbrich reviewed Jan 20, 2025

View reviewed changes

wadoon marked this pull request as ready for review January 20, 2025 21:24

wadoon added this to the v2.12.4 milestone Jan 25, 2025

KeYProject deleted a comment from codecov bot Jan 29, 2025

wadoon requested a review from mattulbrich January 30, 2025 01:50

KeYProject deleted a comment from codecov bot Jan 30, 2025

mattulbrich requested changes Feb 22, 2025

View reviewed changes

mattulbrich reviewed Feb 22, 2025

View reviewed changes

key.core/src/main/antlr4/KeYParser.g4 Show resolved Hide resolved

mattulbrich added the Reviewer Feedback Feedback from the review needs to be addressed label Mar 7, 2025

KeYProject deleted a comment from codecov bot Mar 12, 2025

wadoon force-pushed the weigl/psg4 branch 2 times, most recently from 1b8c2cd to 1477ca8 Compare March 15, 2025 02:14

wadoon force-pushed the weigl/psg4 branch 2 times, most recently from 3a33c99 to 1f94eac Compare March 29, 2025 00:18

wadoon force-pushed the weigl/psg4 branch 3 times, most recently from e936226 to 13136b4 Compare April 11, 2025 17:15

wadoon requested a review from mattulbrich April 12, 2025 15:05

mattulbrich reviewed Apr 15, 2025

View reviewed changes

key.core/src/main/antlr4/KeYParser.g4 Outdated Show resolved Hide resolved

wadoon added 11 commits April 16, 2025 23:18

Use KeYParser.g4 for parsing proof scripts

4179908

reactivate this branch

e8746e4

* adds script block commands arguments

save as much as positional information as possible

ebbbb38

remove description in xml files for proof script

c066ce3

commands

revival of this branch:

1efc749

* higher-order commands * integrate documentation

Allow ":" to make named parameters in proof scripts

5194ab4

remove the "@"-prefix from proof scripts

b226364

move the package away from macro

c129f71

strange formatting errors in key file

bd617db

remove of backticks for escaping, use ( ) instead.

dfeae06

more trimming for windows

no term w/o parentheses

835a347

wadoon force-pushed the weigl/psg4 branch from 49afdd0 to 46048cf Compare April 16, 2025 21:19

fix unit-tests

db46d41

wadoon force-pushed the weigl/psg4 branch from 46048cf to db46d41 Compare April 16, 2025 21:21

wadoon enabled auto-merge April 16, 2025 21:24

wadoon requested a review from mattulbrich April 17, 2025 10:47

wadoon added Proof Scripts Breaking Change and removed Reviewer Feedback Feedback from the review needs to be addressed labels Apr 17, 2025

mattulbrich approved these changes May 9, 2025

View reviewed changes

wadoon added this pull request to the merge queue May 9, 2025

Merged via the queue into KeYProject:main with commit 3f4fd23 May 9, 2025
8 of 9 checks passed

wadoon deleted the weigl/psg4 branch May 9, 2025 13:22

wadoon restored the weigl/psg4 branch December 30, 2025 07:05

Use KeYParser.g4 for parsing proof scripts #3021

Use KeYParser.g4 for parsing proof scripts #3021

Uh oh!

Conversation

wadoon commented Feb 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Breaking Change

Backward compatibility

Side changes to Proof Scripts

"Higher-order proof scripts"

Removal of the XML for command documentation.

Remove of @-prefix

Uh oh!

mattulbrich commented Feb 5, 2023

Uh oh!

mattulbrich left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattulbrich commented Jan 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wadoon commented Jan 29, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattulbrich commented Feb 22, 2025

Uh oh!

Uh oh!

mattulbrich commented Mar 7, 2025

Uh oh!

mattulbrich commented Mar 7, 2025

Uh oh!

mattulbrich commented Mar 7, 2025

Uh oh!

mattulbrich commented Mar 7, 2025

Uh oh!

wadoon commented Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattulbrich commented Mar 17, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wadoon commented Feb 5, 2023 •

edited

Loading

Remove of `@`-prefix

mattulbrich commented Jan 20, 2025 •

edited

Loading

wadoon commented Mar 12, 2025 •

edited

Loading