Skip to content

Commit a8512d2

Browse files
committed
proper termination, take 2
This PR is an alternative to #99. This is built on top of #116. With this PR, the following test cases now pass correctly: ```julia # Final block is not a `return`: Need to use `config::SelectiveEvalRecurse` explicitly ex = quote x = 1 yy = 7 @Label loop x += 1 x < 5 || return yy @goto loop end frame = Frame(ModSelective, ex) src = frame.framecode.src edges = CodeEdges(ModSelective, src) config = SelectiveEvalRecurse() isrequired = lines_required(GlobalRef(ModSelective, :x), src, edges, config) selective_eval_fromstart!(config, frame, isrequired, true) @test ModSelective.x == 5 @test !isdefined(ModSelective, :yy) ``` The basic approach is overloading `JuliaInterpreter.step_expr!` and `LoweredCodeUtils.next_or_nothing!` for the new `SelectiveEvalController` type, as described below, to perform correct selective execution. When `SelectiveEvalController` is passed as the `recurse` argument of `selective_eval!`, the selective execution is adjusted as follows: - **Implicit return**: In Julia's IR representation (`CodeInfo`), the final block does not necessarily return and may `goto` another block. And if the `return` statement is not included in the slice in such cases, it is necessary to terminate `selective_eval!` when execution reaches such implicit return statements. `controller.implicit_returns` records the PCs of such return statements, and `selective_eval!` will return when reaching those statements. This is the core part of the fix for the test cases in #99. - **CFG short-cut**: When the successors of a conditional branch are inactive, and it is safe to move the program counter from the conditional branch to the nearest common post-dominator of those successors, this short-cut is taken. This short-cut is not merely an optimization but is actually essential for the correctness of the selective execution. This is because, in `CodeInfo`, even if we simply fall-through dead blocks (i.e., increment the program counter without executing the statements of those blocks), it does not necessarily lead to the nearest common post-dominator block. And now [`lines_required`](@ref) or [`lines_required!`](@ref) will update the `SelectiveEvalController` passed as their argument to be appropriate for the program slice generated. One thing to note is that currently, the `controller` is not be recursed. That said, in Revise, which is the main consumer of LCU, there is no need for recursive selective execution, and so `selective_eval!` does not provide a system for inter-procedural selective evaluation. Accordingly `SelectiveEvalController` does not recurse too, but this can be left as a future extension.
1 parent 0f44fa8 commit a8512d2

File tree

6 files changed

+155
-21
lines changed

6 files changed

+155
-21
lines changed

docs/src/api.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -17,6 +17,7 @@ lines_required
1717
lines_required!
1818
selective_eval!
1919
selective_eval_fromstart!
20+
SelectiveEvalController
2021
```
2122

2223
## Interpreter

src/codeedges.jl

Lines changed: 131 additions & 19 deletions
Original file line numberDiff line numberDiff line change
@@ -182,6 +182,42 @@ function postprint_linelinks(io::IO, idx::Int, src::CodeInfo, cl::CodeLinks, bbc
182182
return nothing
183183
end
184184

185+
struct CFGShortCut
186+
from::Int # pc of GotoIfNot with inactive 𝑰𝑵𝑭𝑳 blocks
187+
to::Int # pc of the entry of the nearest common post-dominator of the GotoIfNot's successors
188+
end
189+
190+
"""
191+
controller::SelectiveEvalController
192+
193+
When this object is passed as the `recurse` argument of `selective_eval!`,
194+
the selective execution is adjusted as follows:
195+
196+
- **Implicit return**: In Julia's IR representation (`CodeInfo`), the final block does not
197+
necessarily return and may `goto` another block. And if the `return` statement is not
198+
included in the slice in such cases, it is necessary to terminate `selective_eval!` when
199+
execution reaches such implicit return statements. `controller.implicit_returns` records
200+
the PCs of such return statements, and `selective_eval!` will return when reaching those statements.
201+
202+
- **CFG short-cut**: When the successors of a conditional branch are inactive, and it is
203+
safe to move the program counter from the conditional branch to the nearest common
204+
post-dominator of those successors, this short-cut is taken.
205+
This short-cut is not merely an optimization but is actually essential for the correctness
206+
of the selective execution. This is because, in `CodeInfo`, even if we simply fall-through
207+
dead blocks (i.e., increment the program counter without executing the statements of those
208+
blocks), it does not necessarily lead to the nearest common post-dominator block.
209+
210+
These adjustments are necessary for performing selective execution correctly.
211+
[`lines_required`](@ref) or [`lines_required!`](@ref) will update the `SelectiveEvalController`
212+
passed as an argument to be appropriate for the program slice generated.
213+
"""
214+
struct SelectiveEvalController{RC}
215+
inner::RC # N.B. this doesn't support recursive selective evaluation
216+
implicit_returns::BitSet # pc where selective execution should terminate even if they're inactive
217+
shortcuts::Vector{CFGShortCut}
218+
SelectiveEvalController(inner::RC=finish_and_return!) where RC = new{RC}(inner, BitSet(), CFGShortCut[])
219+
end
220+
185221
function namedkeys(cl::CodeLinks)
186222
ukeys = Set{GlobalRef}()
187223
for c in (cl.namepreds, cl.namesuccs, cl.nameassigns)
@@ -591,8 +627,8 @@ function terminal_preds!(s, j, edges, covered) # can't be an inner function bec
591627
end
592628

593629
"""
594-
isrequired = lines_required(obj::GlobalRef, src::CodeInfo, edges::CodeEdges)
595-
isrequired = lines_required(idx::Int, src::CodeInfo, edges::CodeEdges)
630+
isrequired = lines_required(obj::GlobalRef, src::CodeInfo, edges::CodeEdges, [controller::SelectiveEvalController])
631+
isrequired = lines_required(idx::Int, src::CodeInfo, edges::CodeEdges, [controller::SelectiveEvalController])
596632
597633
Determine which lines might need to be executed to evaluate `obj` or the statement indexed by `idx`.
598634
If `isrequired[i]` is `false`, the `i`th statement is *not* required.
@@ -601,21 +637,26 @@ will end up skipping a subset of such statements, perhaps while repeating others
601637
602638
See also [`lines_required!`](@ref) and [`selective_eval!`](@ref).
603639
"""
604-
function lines_required(obj::GlobalRef, src::CodeInfo, edges::CodeEdges; kwargs...)
640+
function lines_required(obj::GlobalRef, src::CodeInfo, edges::CodeEdges,
641+
controller::SelectiveEvalController=SelectiveEvalController();
642+
kwargs...)
605643
isrequired = falses(length(edges.preds))
606644
objs = Set{GlobalRef}([obj])
607-
return lines_required!(isrequired, objs, src, edges; kwargs...)
645+
return lines_required!(isrequired, objs, src, edges, controller; kwargs...)
608646
end
609647

610-
function lines_required(idx::Int, src::CodeInfo, edges::CodeEdges; kwargs...)
648+
function lines_required(idx::Int, src::CodeInfo, edges::CodeEdges,
649+
controller::SelectiveEvalController=SelectiveEvalController();
650+
kwargs...)
611651
isrequired = falses(length(edges.preds))
612652
isrequired[idx] = true
613653
objs = Set{GlobalRef}()
614-
return lines_required!(isrequired, objs, src, edges; kwargs...)
654+
return lines_required!(isrequired, objs, src, edges, controller; kwargs...)
615655
end
616656

617657
"""
618-
lines_required!(isrequired::AbstractVector{Bool}, src::CodeInfo, edges::CodeEdges;
658+
lines_required!(isrequired::AbstractVector{Bool}, src::CodeInfo, edges::CodeEdges,
659+
[controller::SelectiveEvalController];
619660
norequire = ())
620661
621662
Like `lines_required`, but where `isrequired[idx]` has already been set to `true` for all statements
@@ -627,9 +668,11 @@ should _not_ be marked as a requirement.
627668
For example, use `norequire = LoweredCodeUtils.exclude_named_typedefs(src, edges)` if you're
628669
extracting method signatures and not evaluating new definitions.
629670
"""
630-
function lines_required!(isrequired::AbstractVector{Bool}, src::CodeInfo, edges::CodeEdges; kwargs...)
671+
function lines_required!(isrequired::AbstractVector{Bool}, src::CodeInfo, edges::CodeEdges,
672+
controller::SelectiveEvalController=SelectiveEvalController();
673+
kwargs...)
631674
objs = Set{GlobalRef}()
632-
return lines_required!(isrequired, objs, src, edges; kwargs...)
675+
return lines_required!(isrequired, objs, src, edges, controller; kwargs...)
633676
end
634677

635678
function exclude_named_typedefs(src::CodeInfo, edges::CodeEdges)
@@ -649,7 +692,9 @@ function exclude_named_typedefs(src::CodeInfo, edges::CodeEdges)
649692
return norequire
650693
end
651694

652-
function lines_required!(isrequired::AbstractVector{Bool}, objs, src::CodeInfo, edges::CodeEdges; norequire = ())
695+
function lines_required!(isrequired::AbstractVector{Bool}, objs, src::CodeInfo, edges::CodeEdges,
696+
controller::SelectiveEvalController=SelectiveEvalController();
697+
norequire = ())
653698
# Mark any requested objects (their lines of assignment)
654699
objs = add_requests!(isrequired, objs, edges, norequire)
655700

@@ -684,7 +729,10 @@ function lines_required!(isrequired::AbstractVector{Bool}, objs, src::CodeInfo,
684729
end
685730

686731
# now mark the active goto nodes
687-
add_active_gotos!(isrequired, src, cfg, postdomtree)
732+
add_active_gotos!(isrequired, src, cfg, postdomtree, controller)
733+
734+
# check if there are any implicit return blocks
735+
record_implcit_return!(controller, isrequired, cfg)
688736

689737
return isrequired
690738
end
@@ -768,13 +816,14 @@ end
768816
# The basic algorithm is based on what was proposed in [^Wei84]. If there is even one active
769817
# block in the blocks reachable from a conditional branch up to its successors' nearest
770818
# common post-dominator (referred to as 𝑰𝑵𝑭𝑳 in the paper), it is necessary to follow
771-
# that conditional branch and execute the code. Otherwise, execution can be short-circuited
819+
# that conditional branch and execute the code. Otherwise, execution can be short-cut
772820
# from the conditional branch to the nearest common post-dominator.
773821
#
774-
# COMBAK: It is important to note that in Julia's intermediate code representation (`CodeInfo`),
775-
# "short-circuiting" a specific code region is not a simple task. Simply ignoring the path
776-
# to the post-dominator does not guarantee fall-through to the post-dominator. Therefore,
777-
# a more careful implementation is required for this aspect.
822+
# It is important to note that in Julia's intermediate code representation (`CodeInfo`),
823+
# "short-cutting" a specific code region is not a simple task. Simply incrementing the
824+
# program counter without executing the statements of 𝑰𝑵𝑭𝑳 blocks does not guarantee that
825+
# the program counter fall-throughs to the post-dominator.
826+
# To handle such cases, `selective_eval!` needs to use `SelectiveEvalController`.
778827
#
779828
# [Wei84]: M. Weiser, "Program Slicing," IEEE Transactions on Software Engineering, 10, pages 352-357, July 1984.
780829
function add_control_flow!(isrequired, src::CodeInfo, cfg::CFG, postdomtree)
@@ -849,8 +898,8 @@ function reachable_blocks(cfg, from_bb::Int, to_bb::Int)
849898
return visited
850899
end
851900

852-
function add_active_gotos!(isrequired, src::CodeInfo, cfg::CFG, postdomtree)
853-
dead_blocks = compute_dead_blocks(isrequired, src, cfg, postdomtree)
901+
function add_active_gotos!(isrequired, src::CodeInfo, cfg::CFG, postdomtree, controller::SelectiveEvalController)
902+
dead_blocks = compute_dead_blocks!(isrequired, src, cfg, postdomtree, controller)
854903
changed = false
855904
for bbidx = 1:length(cfg.blocks)
856905
if bbidx dead_blocks
@@ -868,7 +917,7 @@ function add_active_gotos!(isrequired, src::CodeInfo, cfg::CFG, postdomtree)
868917
end
869918

870919
# find dead blocks using the same approach as `add_control_flow!`, for the converged `isrequired`
871-
function compute_dead_blocks(isrequired, src::CodeInfo, cfg::CFG, postdomtree)
920+
function compute_dead_blocks!(isrequired, src::CodeInfo, cfg::CFG, postdomtree, controller::SelectiveEvalController)
872921
dead_blocks = BitSet()
873922
for bbidx = 1:length(cfg.blocks)
874923
bb = cfg.blocks[bbidx]
@@ -889,13 +938,31 @@ function compute_dead_blocks(isrequired, src::CodeInfo, cfg::CFG, postdomtree)
889938
end
890939
if !is_𝑰𝑵𝑭𝑳_active
891940
union!(dead_blocks, delete!(𝑰𝑵𝑭𝑳, postdominator))
941+
if postdominator 0
942+
postdominator_bb = cfg.blocks[postdominator]
943+
postdominator_entryidx = postdominator_bb.stmts[begin]
944+
push!(controller.shortcuts, CFGShortCut(termidx, postdominator_entryidx))
945+
end
892946
end
893947
end
894948
end
895949
end
896950
return dead_blocks
897951
end
898952

953+
function record_implcit_return!(controller::SelectiveEvalController, isrequired, cfg::CFG)
954+
for bbidx = 1:length(cfg.blocks)
955+
bb = cfg.blocks[bbidx]
956+
if isempty(bb.succs)
957+
i = findfirst(idx::Int->!isrequired[idx], bb.stmts)
958+
if !isnothing(i)
959+
push!(controller.implicit_returns, bb.stmts[i])
960+
end
961+
end
962+
end
963+
nothing
964+
end
965+
899966
# Do a traveral of "numbered" predecessors and find statement ranges and names of type definitions
900967
function find_typedefs(src::CodeInfo)
901968
typedef_blocks, typedef_names = UnitRange{Int}[], Symbol[]
@@ -1018,6 +1085,42 @@ function add_inplace!(isrequired, src, edges, norequire)
10181085
return changed
10191086
end
10201087

1088+
function JuliaInterpreter.step_expr!(controller::SelectiveEvalController, frame::Frame, @nospecialize(node), istoplevel::Bool)
1089+
if frame.pc in controller.implicit_returns
1090+
return nothing
1091+
elseif node isa GotoIfNot
1092+
for shortcut in controller.shortcuts
1093+
if shortcut.from == frame.pc
1094+
return frame.pc = shortcut.to
1095+
end
1096+
end
1097+
end
1098+
# TODO allow recursion: @invoke JuliaInterpreter.step_expr!(controller::Any, frame::Frame, node::Any, istoplevel::Bool)
1099+
JuliaInterpreter.step_expr!(controller.inner, frame, node, istoplevel)
1100+
end
1101+
1102+
next_or_nothing!(frame::Frame) = next_or_nothing!(finish_and_return!, frame)
1103+
function next_or_nothing!(@nospecialize(recurse), frame::Frame)
1104+
pc = frame.pc
1105+
if pc < nstatements(frame.framecode)
1106+
return frame.pc = pc + 1
1107+
end
1108+
return nothing
1109+
end
1110+
function next_or_nothing!(controller::SelectiveEvalController, frame::Frame)
1111+
if frame.pc in controller.implicit_returns
1112+
return nothing
1113+
elseif pc_expr(frame) isa GotoIfNot
1114+
for shortcut in controller.shortcuts
1115+
if shortcut.from == frame.pc
1116+
return frame.pc = shortcut.to
1117+
end
1118+
end
1119+
end
1120+
# TODO allow recursion: @invoke next_or_nothing!(controller::Any, frame::Frame)
1121+
next_or_nothing!(controller.inner, frame)
1122+
end
1123+
10211124
"""
10221125
struct SelectiveInterpreter{S<:Interpreter,T<:AbstractVector{Bool}} <: Interpreter
10231126
inner::S
@@ -1068,6 +1171,15 @@ See [`selective_eval_fromstart!`](@ref) to have that performed automatically.
10681171
10691172
`isrequired` pertains only to `frame` itself, not any of its callees.
10701173
1174+
When `recurse::SelectiveEvalController` is specified, the selective evaluation execution
1175+
becomes fully correct. Conversely, with the default `finish_and_return!`, selective
1176+
evaluation may not be necessarily correct for all possible Julia code (see
1177+
https://github.com/JuliaDebug/LoweredCodeUtils.jl/pull/99 for more details).
1178+
1179+
Ensure that the specified `controller` is properly synchronized with `isrequired`.
1180+
Additionally note that, at present, it is not possible to recurse the `controller`.
1181+
In other words, there is no system in place for interprocedural selective evaluation.
1182+
10711183
This will return either a `BreakpointRef`, the value obtained from the last executed statement
10721184
(if stored to `frame.framedata.ssavlues`), or `nothing`.
10731185
Typically, assignment to a variable binding does not result in an ssa store by JuliaInterpreter.

src/packagedef.jl

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,8 @@ const trackedheads = (:method,) # Revise uses this (for now), don't delete; a
2424
const structdecls = (:_structtype, :_abstracttype, :_primitivetype)
2525

2626
export signature, rename_framemethods!, methoddef!, methoddefs!, bodymethod
27-
export CodeEdges, lines_required, lines_required!, selective_eval!, selective_eval_fromstart!
27+
export CodeEdges, SelectiveEvalController,
28+
lines_required, lines_required!, selective_eval!, selective_eval_fromstart!
2829

2930
include("utils.jl")
3031
include("signatures.jl")

src/signatures.jl

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -334,6 +334,7 @@ function _rename_framemethods!(interp::Interpreter, frame::Frame,
334334
set_to_running_name!(interp, replacements, frame, methodinfos, selfcalls[idx], calledby, callee, caller)
335335
catch err
336336
@warn "skipping callee $callee (called by $caller) due to $err"
337+
# showerror(stderr, err, stacktrace(catch_backtrace()))
337338
end
338339
end
339340
for sc in selfcalls

test/codeedges.jl

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -217,6 +217,24 @@ module ModSelective end
217217
@test ModSelective.k11 == 0
218218
@test 3 <= ModSelective.s11 <= 15
219219

220+
# Final block is not a `return`: Need to use `controller::SelectiveEvalController` explicitly
221+
ex = quote
222+
x = 1
223+
yy = 7
224+
@label loop
225+
x += 1
226+
x < 5 || return yy
227+
@goto loop
228+
end
229+
frame = Frame(ModSelective, ex)
230+
src = frame.framecode.src
231+
edges = CodeEdges(ModSelective, src)
232+
controller = SelectiveEvalController()
233+
isrequired = lines_required(GlobalRef(ModSelective, :x), src, edges, controller)
234+
selective_eval_fromstart!(controller, frame, isrequired, true)
235+
@test ModSelective.x == 5
236+
@test !isdefined(ModSelective, :yy)
237+
220238
# Control-flow in an abstract type definition
221239
ex = :(abstract type StructParent{T, N} <: AbstractArray{T, N} end)
222240
frame = Frame(ModSelective, ex)

test/signatures.jl

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -414,9 +414,10 @@ bodymethtest5(x, y=Dict(1=>2)) = 5
414414
oldenv = Pkg.project().path
415415
try
416416
# we test with the old version of CBinding, let's do it in an isolated environment
417+
# so we don't cause package conflicts with everything else
417418
Pkg.activate(; temp=true, io=devnull)
418419

419-
@info "Adding CBinding to the environment for test purposes"
420+
@info "Adding CBinding v0.9.4 to the environment for test purposes"
420421
Pkg.add(; name="CBinding", version="0.9.4", io=devnull) # `@cstruct` isn't defined for v1.0 and above
421422

422423
m = Module()

0 commit comments

Comments
 (0)