Speed up test_weakref_leak and test_step_args by mcwanza · Pull Request #8107 · pymc-devs/pymc

mcwanza · 2026-02-12T01:59:44Z

Addresses #7686 — profiling-backed optimizations targeting tests where iteration reduction yields meaningful savings.

Changes

test_weakref_leak (89s → ~30s estimated on CI)

Object count profiling shows memory state stabilizes at iteration 2:

Iteration | Growing objects | Details
    0     | 10 growing      | function:+67554, tuple:+50400, dict:+30266
    1     |  1 growing      | list:+1
    2     |  0 growing      | STABLE
    3     |  0 growing      | STABLE
   ...    |  0 growing      | STABLE (all remaining iterations)

The original test ran 20 iterations but only checked after iteration 15 — 16 warmup iterations when 3 is sufficient. Reduced to 6 total (3 warmup + 3 check). Each conditional_logp call costs ~4.5s on CI, so removing 14 iterations saves ~63s.

test_step_args (62s → ~25s estimated on CI)

This test verifies target_accept argument plumbing, checking acceptance_rate.mean() ≈ 0.5 with decimal=1 precision. The default pm.sample() uses 1000 draws/tune, but 200 is more than sufficient for this loose tolerance (verified stable over 10 runs with different seeds).

Profiling: first pm.sample() with 1000 draws takes 2.20s locally, with 200 draws takes 0.29s. The test calls pm.sample() 5 times.

Changes NOT made (profiling showed negligible impact)

Test	CI time	Bottleneck	Iteration savings
test_default_value_transform_logprob	35s	Compile: 8.85s, Loop(10): 0.0014s	0.6ms total
TestMatchesScipy::test_interpolated	33s	Compile: 91.8%, Array: 7.9%	~296ms across 56 iters

Compilation dominates these tests — reducing iteration counts has near-zero impact, consistent with @ricardoV94's review feedback.

ricardoV94

Some changes are fine, other sacrifice too much on the tests. Left some commenst

ricardoV94 · 2026-02-12T23:25:28Z

tests/distributions/test_multivariate.py

        "size, shape",
        [
            ((10,), None),
-            (None, (10, 6)),


I don't think we should remove any of the parametrizations in test_multivariate. If these are slow the best thing is to analyze why instead

Reverted — all test_multivariate.py changes have been removed from this PR.

Reworked the entire PR based on this feedback. Profiled all the slow tests to understand where time is actually spent — you were right that compilation dominates.

Removed all the original changes (parametrization reduction, iteration count tweaks on compile-dominated tests). The PR now only contains 2 profiling-backed changes:

test_weakref_leak (20→6 iterations): Object count profiling shows state stabilizes at iteration 2 — the original 16 warmup iterations were overkill. Each conditional_logp call costs ~4.5s on CI, so this saves ~63s.

test_step_args (explicit draws=200, tune=200): This test checks acceptance_rate.mean() ≈ 0.5 with decimal=1 precision — 200 draws is sufficient. The default 1000 was unnecessary for this tolerance level.

ricardoV94 · 2026-02-12T23:27:39Z

tests/progress_bar/test_manager.py

-            "draws": 10,
-            "tune": 10,
+            "draws": 5,
+            "tune": 5,


I would be very surprised if this played a role. The cost will be mostly compile time, not running 5 +- steps. Let me know if I am wrong

Reverted — the progress bar change has been removed from this PR.

Confirmed by profiling. The test_default_value_transform_logprob loop takes 0.0014s total for 10 iterations while compilation takes 8.85s — compile/loop ratio of 6480x. Removed this change from the PR.

Profiling-backed changes targeting the two tests where iteration reduction yields meaningful savings. test_weakref_leak (89s on CI → estimated ~30s): Object count profiling shows memory state stabilizes at iteration 2. The original test used 16 warmup iterations before checking — reduced to 3 warmup + 3 check = 6 total iterations (from 20). Each conditional_logp call costs ~4.5s on CI, so removing 14 iterations saves ~63s. test_step_args (62s on CI → estimated ~25s): This test verifies target_accept argument plumbing, checking acceptance_rate.mean() ≈ 0.5 with decimal=1 precision. The default pm.sample() uses 1000 draws/tune, but 200 is sufficient for this loose tolerance. Verified stable over 10 runs with different seeds. Changes NOT made (profiling showed negligible impact): - test_default_value_transform_logprob range(10)→range(3): compile time is 99.98% of cost, loop saves 0.6ms total - test_interpolated x_points 100k→20k: compilation is 91.8% of cost, array reduction saves ~296ms across 56 iterations Addresses pymc-devs#7686

mcwanza marked this pull request as ready for review February 12, 2026 02:17

mcwanza force-pushed the speedup-slow-tests branch from 5137112 to 6dba2f2 Compare February 12, 2026 20:48

ricardoV94 requested changes Feb 12, 2026

View reviewed changes

mcwanza force-pushed the speedup-slow-tests branch from 6dba2f2 to c023167 Compare February 12, 2026 23:58

mcwanza force-pushed the speedup-slow-tests branch from c023167 to c513193 Compare February 17, 2026 23:39

mcwanza changed the title ~~Reduce unnecessary iteration counts in slow tests~~ Speed up test_weakref_leak and test_step_args Feb 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Speed up test_weakref_leak and test_step_args#8107

Speed up test_weakref_leak and test_step_args#8107
mcwanza wants to merge 1 commit intopymc-devs:mainfrom
mcwanza:speedup-slow-tests

mcwanza commented Feb 12, 2026 •

edited

Loading

Uh oh!

ricardoV94 left a comment

Uh oh!

ricardoV94 Feb 12, 2026

Uh oh!

mcwanza Feb 13, 2026

Uh oh!

mcwanza Feb 17, 2026

Uh oh!

ricardoV94 Feb 12, 2026

Uh oh!

mcwanza Feb 13, 2026

Uh oh!

mcwanza Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

mcwanza commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

test_weakref_leak (89s → ~30s estimated on CI)

test_step_args (62s → ~25s estimated on CI)

Changes NOT made (profiling showed negligible impact)

Uh oh!

ricardoV94 left a comment

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

mcwanza Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

mcwanza Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Feb 12, 2026

Choose a reason for hiding this comment

Uh oh!

mcwanza Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

mcwanza Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

mcwanza commented Feb 12, 2026 •

edited

Loading