Experimental optimisations, proposed by ChatGPT. #337

neilwalkinshaw · 2025-07-11T10:00:47Z

Prompt: "Can you speed this up?", pasting CausalDag.enumerate_minimal_adjustment_sets

Summary of changes:

moral_graph[t] instead of nx.neighbors(...)
Accesses adjacency dict directly, faster in memory

update(...) instead of union with comprehension
Avoids building intermediate sets

Eliminated intermediate list conversions
Avoids unnecessary copies

Variable renaming (pbd_graph, ancestor_graph, etc.)
Improves clarity and prevents redundant calls

Same applied to list_all_min_sep

Summary of changes:

graph[node] instead of nx.neighbors(graph, node)
Faster adjacency access in memory

treatment_component = ... break
Avoids unnecessary iteration after finding the component

sample(sorted(...), 1)[0]
Returns a value, not a set, avoiding later unpacking

Removed repeated set(...) wrapping
Reduces GC and memory allocations

Same applied to constructive_backdoor_criterion

Avoids repeated set(self.nodes)
self.nodes is probably already a set-like iterable

Combines descendant updates efficiently
Reduces overhead of set.union with unpacking

Avoids constructing logger message unless needed
Significant savings if logging level > INFO

Clearer and faster condition check with & (set intersection)
Cleaner than issubset(difference(...))

github-actions · 2025-07-11T10:01:42Z

🦙 MegaLinter status: ✅ SUCCESS

Descriptor	Linter	Files	Fixed	Errors	Elapsed time
✅ PYTHON	black	32		0	0.95s
✅ PYTHON	pylint	32		0	5.45s

See detailed report in MegaLinter reports

MegaLinter is graciously provided by

codecov · 2025-07-11T10:03:40Z

Codecov Report

❌ Patch coverage is 98.75000% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 95.67%. Comparing base (ad50cfb) to head (9816b98).
⚠️ Report is 17 commits behind head on main.

Files with missing lines	Patch %	Lines
causal_testing/specification/causal_dag.py	98.63%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #337      +/-   ##
==========================================
- Coverage   95.78%   95.67%   -0.12%     
==========================================
  Files          27       27              
  Lines        1638     1618      -20     
==========================================
- Hits         1569     1548      -21     
- Misses         69       70       +1

Files with missing lines	Coverage Δ
causal_testing/main.py	`96.17% <100.00%> (ø)`
...sal_testing/surrogate/causal_surrogate_assisted.py	`100.00% <100.00%> (ø)`
causal_testing/testing/metamorphic_relation.py	`100.00% <100.00%> (ø)`
causal_testing/specification/causal_dag.py	`98.88% <98.63%> (-0.62%)`	⬇️

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 756203c...9816b98. Read the comment docs.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

…_adjustment_sets

…rk into fandango_experiment

jmafoster1 · 2025-08-01T15:40:46Z

I've been through this. Most of the optimisations were fine. Some were a little odd. I also took the opportunity to add some extra ones of my own. Also, most notably, I made this class a proper subclass of nx.DiGraph so we don't have to keep accessing .graph all the time. It doesn't make it any faster, but it does make it much cleaner.

f-allian

Suggestions look all reasonable to me. This is exactly what I was looking into some time ago when I tried to speed up the identification for large DAGs (see related issue #259). The use of generators makes sense here, and removing the NetworkX bottlenecks will also likely speed things up. But without doing a detailed profiling analysis into the before/after, we won't know exactly how much this will have optimised the identification of large DAGs.

jmafoster1 · 2025-08-07T07:13:32Z

we won't know exactly how much this will have optimised the identification of large DAGs.

My admittedly limited test runs indicate not much! 🙃 But at least the code is cleaner now.

Experimental optimisations, proposed by ChatGPT.

197c5d7

neilwalkinshaw requested a review from jmafoster1 July 11, 2025 10:07

jmafoster1 and others added 14 commits July 11, 2025 13:04

Some changes to constructive_backdoor_criterion and enumerate_minimal…

5d881c1

…_adjustment_sets

MF reviewed all optimisations, tests pass

3dc9ef3

pylint

16c468d

Added support for XML graphs for speedy reading in

16f868e

Using subgraphs instead of copy seems promising

e168bde

More optimisations

4218038

Removed all instances of copy

5d53f9a

Fixed optimised causal dag tests

bfca023

Finished optimising, tests pass

edc8455

Integrated optimised CausalDAG class

64596f6

Pylint

5e0d8e6

Merge branch 'main' of github.com:CITCOM-project/CausalTestingFramewo…

347887b

…rk into fandango_experiment

Pytest

d71d2d3

Codecov

9816b98

jmafoster1 marked this pull request as ready for review August 1, 2025 15:29

jmafoster1 requested a review from f-allian August 1, 2025 15:35

f-allian approved these changes Aug 6, 2025

View reviewed changes

jmafoster1 merged commit 0ad05e2 into main Aug 7, 2025
22 checks passed

jmafoster1 deleted the fandango_experiment branch August 7, 2025 07:13

jmafoster1 mentioned this pull request Aug 7, 2025

Identification is very slow for big DAGs #259

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Experimental optimisations, proposed by ChatGPT. #337

Experimental optimisations, proposed by ChatGPT. #337

Uh oh!

neilwalkinshaw commented Jul 11, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jul 11, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jul 11, 2025 •

edited

Loading

Uh oh!

jmafoster1 commented Aug 1, 2025

Uh oh!

f-allian left a comment •

edited

Loading

Uh oh!

jmafoster1 commented Aug 7, 2025

Uh oh!

Uh oh!

Uh oh!

Experimental optimisations, proposed by ChatGPT. #337

Experimental optimisations, proposed by ChatGPT. #337

Uh oh!

Conversation

neilwalkinshaw commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Prompt: "Can you speed this up?", pasting CausalDag.enumerate_minimal_adjustment_sets

Same applied to list_all_min_sep

Same applied to constructive_backdoor_criterion

Uh oh!

github-actions bot commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦙 MegaLinter status: ✅ SUCCESS

Uh oh!

codecov bot commented Jul 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jmafoster1 commented Aug 1, 2025

Uh oh!

f-allian left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jmafoster1 commented Aug 7, 2025

Uh oh!

Uh oh!

Uh oh!

neilwalkinshaw commented Jul 11, 2025 •

edited

Loading

github-actions bot commented Jul 11, 2025 •

edited

Loading

codecov bot commented Jul 11, 2025 •

edited

Loading

f-allian left a comment •

edited

Loading