assign `side_effects` and `occurrence` as attributes of `@code` #223

m7pr · 2024-11-04T14:49:34Z

Part of #216

Changes:

moved dependency extraction from get_code_dependency to eval_code
removed extract_code_graph
extended documentation of qenv with 1 new attributes: dependency, occurrence
merged side_effects and occurrence inside eval_code as they were previously joined in extract_code_graph anyway
created tests for qenv() |> eval_code |> get_code_attr("dependency")
changed extract_side_effects and extract_occurrence so they work on an element, and they don't use lapply

R/qenv-eval_code.R

… in eval_code

…qenv with 2 new attributes: side_effects, occurrence

…object instead of a list

m7pr · 2024-11-05T11:55:12Z

Hey @gogonzo - if you have some spare time, this PR is ready for the first review.
It aims for transition of code_graph calculations from get_code_dependency into single calls in eval_code

R/utils-get_code_dependency.R

gogonzo · 2024-11-05T12:17:55Z

R/qenv-eval_code.R

+
+    pd <- utils::getParseData(current_call)
+    pd <- normalize_pd(pd)
+    call_pd <- extract_calls(pd)[[1]]


extract_calls while calls are already split?

extract_occurrence assume that pd has a specific order of rows, that gets changed when you apply extract_calls(pd) on pd.

I can move call_pd <- extract_calls(pd)[[1]] inside extract_occurrence. Or is it ok to keep in eval_code?

Let me ask another question - why to reorder pd?

extract_occurrence assumes object names (SYMBOL) appear after <- (LEFT_ASSIGN/ASSIGN).

extract_calls reorders pd in this way.

code <- "a<-1" parsed_code <- parse(text = code, keep.source = TRUE) pd <- utils::getParseData(parsed_code) > pd line1 col1 line2 col2 id parent token terminal text 7 1 1 1 4 7 0 expr FALSE 1 1 1 1 1 1 3 SYMBOL TRUE a 3 1 1 1 1 3 7 expr FALSE 2 1 2 1 3 2 7 LEFT_ASSIGN TRUE <- 4 1 4 1 4 4 5 NUM_CONST TRUE 1 5 1 4 1 4 5 7 expr FALSE > extract_calls(pd) line1 col1 line2 col2 id parent token terminal text 7 1 1 1 4 7 0 expr FALSE 3 1 1 1 1 3 7 expr FALSE 2 1 2 1 3 2 7 LEFT_ASSIGN TRUE <- 5 1 4 1 4 5 7 expr FALSE 1 1 1 1 1 1 3 SYMBOL TRUE a 4 1 4 1 4 4 5 NUM_CONST TRUE 1

If we don't want to use extract_calls we need to revisit and rewrite extract_occurrence. I am not sure we will invent rules to figure out dependencies for a non-sorted pd.

So right now this is a matter of whether we

use extract_calls on pd and before we apply pd inside extract_occurrence

or whether we put extract_calls inside extract_occurrence

or whether we rewrite extract_occurrence.

extract_occurrence is a pretty big beast : P

Maybe we can simplify extract_calls part so that it only reorders

I just played with what parts of extract_calls are needed so that extract_occurrence works and below are the needed parts:

# reordering basen on parent-children relation parent_ids <- pd[pd$parent == 0 & (pd$token !e= "COMMENT" | grepl("@linksto", pd$text, fixed = TRUE)), "id"] pd_order <- do.call(rbind, lapply(parent_ids, function(parent) rbind(pd[pd$id == parent, ], get_children(pd, parent)))) # filtering or edge cases if (!is.null(pd_order) && !(nrow(pd_order) == 1 && call$token == "';'")) { # fixing assignment arrows pd_order <- fix_arrows(list(pd_order))[[1]] attr(current_code, "dependency") <- c(extract_side_effects(pd_order), extract_occurrence(pd_order)) }

So this is basically all of extract_calls without fix_shifted_comments that is skipped if there is only 1 call.

If extract_calls would be renamed to reorder_and_clean_calls then I suppose usage of this function is totally justified in here. We can always put this part in extract_occurrence but it's gonna be repeated in extract_calls and extract_occurrence

R/utils-get_code_dependency.R

m7pr · 2024-11-06T09:19:54Z

Hey @gogonzo I also needed to change the way we create teal.data::teal_data()
a0d4890b7816b614ba353be8db2175ccfb6c39ed
so that it's @code field corresponds to the changes made in here.

There is a bit code from teal.code for unexported functions. Should we export something in teal.code, so that it could be reused in teal.data?

…ndency attributes

R/qenv-eval_code.R

gogonzo · 2024-11-06T11:44:08Z

R/qenv-eval_code.R

    }

    attr(current_code, "id") <- sample.int(.Machine$integer.max, size = 1)
+    attr(current_code, "dependency") <- extract_dependency(current_call)


This loop looks nice now. extract_dependency is a good move 👍

R/utils-get_code_dependency.R

tests/testthat/test-qenv_eval_code.R

gogonzo

All good here 👍 Please see the comment below

R/utils-get_code_dependency.R

Co-authored-by: Dawid Kałędkowski <[email protected]> Signed-off-by: Marcin <[email protected]>

assign side_effects and occurrence as attributes of code

f86bc13

m7pr added the core label Nov 4, 2024

m7pr mentioned this pull request Nov 4, 2024

211 [.qenv S3 method + replacement of @id, @warnings, and @messages fields #216

Merged

8 tasks

donyunardi assigned gogonzo Nov 4, 2024

gogonzo reviewed Nov 4, 2024

View reviewed changes

R/qenv-eval_code.R Outdated Show resolved Hide resolved

m7pr added 4 commits November 5, 2024 10:48

do not remove curly brakcets as its not needed, and reuse parsed_code…

291cf10

… in eval_code

update documentation of extract_code_graph + extend documentation of …

b04d078

…qenv with 2 new attributes: side_effects, occurrence

extract_side_effects and extract_occurrence can now work on a single …

e49187a

…object instead of a list

add a test for dependency attribute

b65f18d

m7pr requested a review from gogonzo November 5, 2024 11:54

rewrite detect_libraries to work on graph

36444b1

gogonzo reviewed Nov 5, 2024

View reviewed changes

m7pr added 2 commits November 5, 2024 13:43

clean check_names in get_code_dependency

4ea2ae2

bring back extract_call for the assignment check

e4fdcd4

m7pr commented Nov 5, 2024

View reviewed changes

R/utils-get_code_dependency.R Outdated Show resolved Hide resolved

m7pr added 2 commits November 6, 2024 11:43

create a wrapper that changes the code into a list of calls with depe…

76b0dd8

…ndency attributes

move code2list to teal.data

839fca6

gogonzo reviewed Nov 6, 2024

View reviewed changes

m7pr added 3 commits November 6, 2024 13:52

simplify the check_names condition in get_code_dependency

a8016bc

remove trimws

8d3649e

split tests

90fab4f

m7pr requested a review from gogonzo November 6, 2024 12:58

gogonzo approved these changes Nov 6, 2024

View reviewed changes

R/utils-get_code_dependency.R Outdated Show resolved Hide resolved

m7pr and others added 2 commits November 6, 2024 14:05

Update R/utils-get_code_dependency.R

751e6ca

Co-authored-by: Dawid Kałędkowski <[email protected]> Signed-off-by: Marcin <[email protected]>

reorder and put more explanation to usage of extract_calls

bcc9c5f

m7pr merged commit 6f03292 into 211_subset@main Nov 6, 2024
1 check passed

m7pr deleted the 211_subset_simplify@main branch November 6, 2024 13:09

github-actions bot locked and limited conversation to collaborators Nov 6, 2024

Uh oh!

assign side_effects and occurrence as attributes of @code #223

assign side_effects and occurrence as attributes of @code #223

Uh oh!

Conversation

m7pr commented Nov 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

m7pr commented Nov 5, 2024

Uh oh!

Uh oh!

gogonzo Nov 5, 2024

Choose a reason for hiding this comment

Uh oh!

m7pr Nov 5, 2024

Choose a reason for hiding this comment

Uh oh!

gogonzo Nov 5, 2024

Choose a reason for hiding this comment

Uh oh!

m7pr Nov 5, 2024

Choose a reason for hiding this comment

Uh oh!

m7pr Nov 5, 2024

Choose a reason for hiding this comment

Uh oh!

m7pr Nov 5, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

m7pr commented Nov 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

gogonzo Nov 6, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

gogonzo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

assign `side_effects` and `occurrence` as attributes of `@code` #223

assign `side_effects` and `occurrence` as attributes of `@code` #223

m7pr commented Nov 4, 2024 •

edited

Loading

m7pr commented Nov 6, 2024 •

edited

Loading