Document current relooper algorithm and do minor cleanup #1397

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

randomPoison wants to merge 15 commits into master from legare/relooper-analysis

randomPoison commented Oct 7, 2025 •

edited

Loading

I've gone through the current relooper algorithm and added documentation and comments to clarify what the algorithm is currently doing. The module docs now have an overview of what the algorithm does, and I've added many verbose comments to help understand what each step of the algorithm is doing and why we do the things we do. I've aired on the side of verbosity for many of these comments, trying to capture all of the information I needed to understand the algorithm.

I've also done a small number of cleanup changes in places where the code was doing something unnecessary or was expressed in a way that I found hard to follow. ~~None of the changes should change behavior, expect for the one place where I added a panic! to surface an error case that was previously being silently ignored.~~

Note that this is not meant to be reviewed one commit at a time. Intermediate commits sometimes have incorrect comments that I fix in later commits.

randomPoison and others added 14 commits

September 30, 2025 12:35


          Notes and cleanup and whatnot

b1581f7


          Document transitive_closure and friends

01146e2


          Moar comments

c072a7e


          More judgemental comments

838d8f2


          More comments and such

222ae41


          Document why we rewrite GoTo to ExitTo in loops

35440c4


          Document the logic for matching a C multiple

e0fa89f


          Document how/why absent entries are handled

4aa926a


          Cleanup some comments

2192be1


          Add more comments about loop stuff

d0e9b7c


          Document more of the subtleties of multiple creation

5d29b50


          Assert on unrecognized loop IDs

fd722cc


          Add overview of relooper

e0e159e


          Fix formatting

070bd76

randomPoison changed the title ~~Document current relooper algorithm do minor cleanup~~ Document current relooper algorithm and do minor cleanup

Author

randomPoison commented Oct 7, 2025

Ah, I see my error case that should never happen has happened. I'll look into that and likely just revert the panic! I added.


          DON'T PANIC!!!

1be52a6

fw-immunant self-requested a review

October 7, 2025 18:43

fw-immunant reviewed

View reviewed changes

Contributor

fw-immunant left a comment

Various comments inline.

c2rust-transpile/src/cfg/relooper.rs

Comment on lines +442 to +448

    
                          // Calculate the transitive closure of the successor map, i.e. the full set of

                          // labels reachable from each block, not including itself. Then flip the edges

                          // to get a map from a label to the set of labels that can reach it.

                          let strict_reachable_from = flip_edges(transitive_closure(&successor_map));

                          // Flip the edges of the successor map to get a map of immediate predecessors

                          // for each block.

Contributor

fw-immunant Oct 7, 2025

These comments don't seem to add information beyond exactly what the code is doing, so I'd remove them in favor of concision.

c2rust-transpile/src/cfg/relooper.rs

    
                      // Try to match an existing branch point (from the initial C). See `MultipleInfo` for more

                      // information on this.

                      // Try to match an existing branch point (from the initial C).

                      //

Contributor

fw-immunant Oct 7, 2025

Remove empty comment line..

c2rust-transpile/src/cfg/relooper.rs

    
                      // Try to match an existing branch point (from the initial C). See `MultipleInfo` for more

                      // information on this.

                      // Try to match an existing branch point (from the initial C).

Contributor

fw-immunant Oct 7, 2025

Leave the reference to MultipleInfo, but fix the existing /// TODO: document me in its documentation.

c2rust-transpile/src/cfg/relooper.rs

    
                      // Try to match an existing branch point (from the initial C).

                      //

                      // We do this before creating a loop to better handle the cases where we have

                      // multiple entries that are part of disjoint loops. The loop analysis that

Contributor

fw-immunant Oct 7, 2025

What do you mean by "disjoint" here? In the comment at the top of multiples.rs, it shows two CFG edges (the not-taken side of an if and the exit of a while loop) that coincide. But these two constructs are nested, not disjoint.

c2rust-transpile/src/cfg/relooper.rs

    
                              recognized_c_multiple = true;

                              for (entry, content) in arms {

                              // Search through the arms of the C multiple to verify that our current CFG matches

Contributor

fw-immunant Oct 7, 2025

Calling this a "C multiple" is confusing. Most of the code just calls this a Multiple, even though this information does come from C.

c2rust-transpile/src/cfg/relooper.rs

Comment on lines +504 to +506

    
                                  // Check that we've only visited the expected nodes. If we've visited any nodes

                                  // not in `content`, then the CFG differs from the C multiple and the search

                                  // fails.

Contributor

fw-immunant Oct 7, 2025

The logic here does seem to do what you describe, not what the comment said before. Should it be doing what it said before (ensuring that all of content is in visited rather than vice-versa), though? Intuitively I thought we wanted to find an exact match rather than a (possibly proper) subset.

The opposite direction, ensuring that nodes we visit are always in content, seems to be covered by the if !content.contains(&lbl) { check a few lines earlier.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet