Add `simulator` function to `SBC` by currocam · Pull Request #44 · arviz-devs/simuk

currocam · 2025-08-30T11:04:11Z

Description

This PR adds an optional argument simulator to provide a more flexible user interface by allowing SBC to be used with (1) models with no observed variables and (2) custom simulators that do not match the probabilistic model. We discussed this possibility at #4 . It’s my first time using numpyro and contributing to some PyMC ecosystem library, so I’m happy to learn.

I haven't updated the documentation, but test cases include new functionality, and I've written the following toy example.

Toy example for `simulator`

Let's say we are interested in modeling the annual dispersal of a species across a (1-dimensional) landscape. For example, we might want to estimate the average dispersal (sigma) from a dataset of start and end coordinates.

from arviz_plots import plot_ecdf_pit
import pymc as pm
import numpy as np
import simuk
start_points = np.asarray([0., -5., 10., 15., 30.])
end_points = np.asarray([-10., 15., 20., 25., 20.])
displacement = end_points - start_points
# A very simple model could be
with pm.Model() as model:
    sigma = pm.HalfNormal("sigma", sigma=5)
    y_obs = pm.Normal("y", mu=0, sigma=sigma, observed=displacement)

The custom simulator function is useful when (a) having a model that does not have a built-in simulator (because it was a custom Potential call, for example) or (b) when we want to simulate data from a model that is not a probabilistic model.

Let's say we have a mechanistic model that describes the dispersal process. Starting and ending points were taken with a 1-year interval. Every day, an individual moves a distance drawn from an unknown distribution with finite variance. According to the central limit theorem, the total sum of the displacements (if independent) should follow a normal distribution with mean 0 and standard deviation sqrt(time) * sigma as time increases. We might wonder if 1 year is enough time for this approximation to be accurate, and we can test this using SMC for different dispersal distributions with a custom simulator function.

def simulator_uniform(seed, sigma):
    rng = np.random.default_rng(seed)
    time_in_days, n_obs = 365, 5
    a = -np.sqrt(3 * sigma**2 / time_in_days)
    b =  np.sqrt(3 * sigma**2 / time_in_days)
    steps = rng.uniform(a, b, size=(time_in_days, n_obs))
    return {'y': steps.sum(axis=0)}

sbc = simuk.SBC(model, num_simulations=100, simulator=simulator_uniform)
sbc.run_simulations()
plot_ecdf_pit(sbc.simulations)

aloctavodia

Thanks for this contribution, and sorry for taking so much time to answer. I added a few comments that help simplify the code (assuming they work as intended).

simuk/sbc.py

Co-authored-by: Osvaldo A Martin <aloctavodia@gmail.com>

aloctavodia · 2025-09-03T13:24:03Z

I will check who is using numba. We are using it in a couple of places in arviz, but it should be optional there.

We should have a simple example on how to use a simulator. But that can be done in a separated PR. We could also have examples here https://arviz-devs.github.io/EABM/Chapters/Simulation_based_calibration.html

currocam · 2025-09-03T18:13:05Z

Bambi dependency issue

The test for bambi fails in the current version fails with bambi==0.13.0 (in CI) but passes with 0.15.0 (in my machine).
However, I'm not really sure what's going on under the hood.

Current error at 0.13.0 says:

ValueError: Error generating prior predictive sample with parameters {'x': array(-2.04538266), 'Intercept': array(0.45027041), 'y_sigma': array(1.25691733), 'seed': np.int64(745490419)}: bmb_simulator() missing 2 required positional arguments: 'mu' and 'sigma'.

This happens because with bambi==0.13.0 the PyMC model has variables named as {'x', ‘Intercept', 'y_sigma'} (and not mu and sigma, as the test expects).

With bambi==0.15.0 the PyMC model has variables named as
{'x', ‘Intercept', 'sigma', 'mu', 'x'}

I think I definitely misunderstood the Bambi model when writing the text. I guess x is the coefficient corresponding to the 'x' variable and not mu as I thought initially?

Do you have any input on this? I've never used Bambi (and I see you're a maintainer). Is there a way we can provide a better interface? I can imagine this being very confusing to use (at least, more confusing than the vanilla pymc version).

aloctavodia · 2025-09-04T07:54:41Z

yes, x is the coefficient for the x variable. There has been a change in how auxiliary variables are named. It used to be nameresponse_nameparameter (like "y_sigma"), now it's just "nameparameter", like sigma. I think it's ok to ask for Bambi >= 0.14. Actually there are reasons to set the lower bound at 0.16 once it gets released.

currocam · 2025-09-04T17:26:57Z

CI now fails with

ERROR: Ignored the following versions that require a different python version: 0.14.0 Requires-Python >=3.10,<3.13; 0.15.0 Requires-Python >=3.10,<3.13

Perhaps most pragmatic decision would be to simply skip the test?

aloctavodia · 2025-09-04T18:33:13Z

Sounds good to me. It will work with the upcoming release of bambi.

codecov-commenter · 2025-09-04T18:36:13Z

Welcome to Codecov 🎉

Once you merge this PR into your default branch, you're all set! Codecov will compare coverage reports and display results in all future pull requests.

Thanks for integrating Codecov - We've got you covered ☂️

currocam added 2 commits August 30, 2025 11:31

Minimal function

6477a0d

Add seed parameter

29ff475

aloctavodia requested changes Sep 3, 2025

View reviewed changes

simuk/sbc.py Show resolved Hide resolved

simuk/sbc.py Outdated Show resolved Hide resolved

simuk/sbc.py Outdated Show resolved Hide resolved

currocam and others added 4 commits September 3, 2025 12:47

Apply suggestions from code review

81f994e

Co-authored-by: Osvaldo A Martin <aloctavodia@gmail.com>

Add missing import

3ece99c

Check simulator returns a Mapping + linter

b23422b

Add Numba as dependency + docstring

ab941b4

aloctavodia changed the title ~~[WIP] Add simulator function to SBC~~ Add simulator function to SBC Sep 3, 2025

currocam added 2 commits September 4, 2025 18:52

Bump bambi version

5102f8d

Revert bump version & skip test

f094ea8

aloctavodia merged commit c3c9052 into arviz-devs:main Sep 4, 2025
4 checks passed

currocam mentioned this pull request Sep 29, 2025

Add example custom simulator #47

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add `simulator` function to `SBC`#44

Add `simulator` function to `SBC`#44
aloctavodia merged 8 commits intoarviz-devs:mainfrom
currocam:main

currocam commented Aug 30, 2025

Uh oh!

aloctavodia left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aloctavodia commented Sep 3, 2025

Uh oh!

currocam commented Sep 3, 2025 •

edited

Loading

Uh oh!

aloctavodia commented Sep 4, 2025

Uh oh!

currocam commented Sep 4, 2025

Uh oh!

aloctavodia commented Sep 4, 2025

Uh oh!

codecov-commenter commented Sep 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

currocam commented Aug 30, 2025

Description

Toy example for simulator

Uh oh!

aloctavodia left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aloctavodia commented Sep 3, 2025

Uh oh!

currocam commented Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bambi dependency issue

Uh oh!

aloctavodia commented Sep 4, 2025

Uh oh!

currocam commented Sep 4, 2025

Uh oh!

aloctavodia commented Sep 4, 2025

Uh oh!

codecov-commenter commented Sep 4, 2025

Welcome to Codecov 🎉

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Toy example for `simulator`

currocam commented Sep 3, 2025 •

edited

Loading