|
| 1 | +# [Advice for performant SDE simulations](@id sde_simulation_performance) |
| 2 | +While there exist relatively straightforward approaches to manage performance for [ODE](@ref ode_simulation_performance) and jump simulations, this is generally not the case for SDE simulations. Below, we briefly describe some options. However, as one starts to investigate these, one quickly reaches what is (or could be) active areas of research. |
| 3 | + |
| 4 | +## [SDE solver selection](@id sde_simulation_performance_solvers) |
| 5 | +We have previously described how [ODE solver selection](@ref ode_simulation_performance_solvers) can impact simulation performance. Again, it can be worthwhile to investigate solver selection's impact on performance for SDE simulations. Throughout this documentation, we generally use the `STrapezoid` solver as the default choice. However, if the `DifferentialEquations` package is loaded |
| 6 | +```julia |
| 7 | +using DifferentialEquations |
| 8 | +``` |
| 9 | +automatic SDE solver selection enabled (just like is the case for ODEs by default). Generally, the automatic SDE solver choice enabled by `DifferentialEquations` is better than just using `STrapezoid`. Next, if performance is critical, it can be worthwhile to check the [list of available SDE solvers](https://docs.sciml.ai/DiffEqDocs/stable/solvers/sde_solve/) to find one with advantageous performance for a given problem. When doing so, it is important to pick a solver compatible with *non-diagonal noise* and with [*Ito problems*](https://en.wikipedia.org/wiki/It%C3%B4_calculus). |
| 10 | + |
| 11 | +## [Options for Jacobian computation](@id sde_simulation_performance_jacobian) |
| 12 | +In the section on ODE simulation performance, we describe various [options for computing the system Jacobian](@ref ode_simulation_performance_jacobian), and how these could be used to improve performance for [implicit solvers](@ref ode_simulation_performance_stiffness). These can be used in tandem with implicit SDE solvers (such as `STrapezoid`). However, due to additional considerations during SDE simulations, it is much less certain whether these will actually have any impact on performance. So while these options might be worth reading about and trialling, there is no guarantee that they will be beneficial. |
| 13 | + |
| 14 | +## [Parallelisation on CPUs and GPUs](@id sde_simulation_performance_parallelisation) |
| 15 | +We have previously described how simulation parallelisation can be used to [improve performance when multiple ODE simulations are carried out](@ref ode_simulation_performance_parallelisation). The same approaches can be used for SDE simulations. Indeed, it is often more relevant for SDEs, as these are often re-simulated using identical simulation conditions (to investigate their typical behaviour across many samples). CPU parallelisation of SDE simulations uses the [same approach as ODEs](@ref ode_simulation_performance_parallelisation_CPU). GPU parallelisation requires some additional considerations, which are described below. |
| 16 | + |
| 17 | +### [GPU parallelisation of SDE simulations](@id sde_simulation_performance_parallelisation_GPU) |
| 18 | +GPU parallelisation of SDE simulations uses a similar approach as that for [ODE simulations](@ref ode_simulation_performance_parallelisation_GPU). The main differences are that SDE parallelisation requires a GPU SDE solver (like `GPUEM`) and fixed time stepping. |
| 19 | + |
| 20 | +We will assume that we are using the CUDA GPU hardware, so we will first load the [CUDA.jl](https://github.com/JuliaGPU/CUDA.jl) backend package, as well as DiffEqGPU: |
| 21 | +```julia |
| 22 | +using CUDA, DiffEqGPU |
| 23 | +``` |
| 24 | +Which backend package you should use depends on your available hardware, with the alternatives being listed [here](https://docs.sciml.ai/DiffEqGPU/stable/manual/backends/). |
| 25 | + |
| 26 | +Next, we create the `SDEProblem` which we wish to simulate. Like for ODEs, we ensure that all vectors are [static vectors](https://github.com/JuliaArrays/StaticArrays.jl) and that all values are `Float32`s. Here we prepare the parallel simulations of a simple [birth-death process](@ref basic_CRN_library_bd). |
| 27 | +```@example sde_simulation_performance_gpu |
| 28 | +using Catalyst, StochasticDiffEq, StaticArrays |
| 29 | +bd_model = @reaction_network begin |
| 30 | + (p,d), 0 <--> X |
| 31 | +end |
| 32 | +@unpack X, p, d = bd_model |
| 33 | +
|
| 34 | +u0 = @SVector [X => 20.0f0] |
| 35 | +tspan = (0.0f0, 10.0f0) |
| 36 | +ps = @SVector [p => 10.0f0, d => 1.0f0] |
| 37 | +sprob = SDEProblem(bd_model, u0, tspan, ps) |
| 38 | +nothing # hide |
| 39 | +``` |
| 40 | +The `SDEProblem` is then used to [create an `EnsembleProblem`](@ref ensemble_simulations_monte_carlo). |
| 41 | +```@example sde_simulation_performance_gpu |
| 42 | +eprob = EnsembleProblem(sprob) |
| 43 | +nothing # hide |
| 44 | +``` |
| 45 | +Finally, we can solve our `EnsembleProblem` while: |
| 46 | +- Using a valid GPU SDE solver (either [`GPUEM`](https://docs.sciml.ai/DiffEqGPU/stable/manual/ensemblegpukernel/#DiffEqGPU.GPUEM) or [`GPUSIEA`](https://docs.sciml.ai/DiffEqGPU/stable/manual/ensemblegpukernel/#DiffEqGPU.GPUSIEA)). |
| 47 | +- Designating the GPU ensemble method, `EnsembleGPUKernel` (with the correct GPU backend as input). |
| 48 | +- Designating the number of trajectories we wish to simulate. |
| 49 | +- Designating a fixed time step size. |
| 50 | + |
| 51 | +```julia |
| 52 | +esol = solve(eprob, GPUEM(), EnsembleGPUKernel(CUDA.CUDABackend()); trajectories = 10000, dt = 0.01) |
| 53 | +``` |
| 54 | + |
| 55 | +Above we parallelise GPU simulations with identical initial conditions and parameter values. However, [varying these](@ref ensemble_simulations_varying_conditions) is also possible. |
| 56 | + |
| 57 | +### [Multilevel Monte Carlo](@id sde_simulation_performance_parallelisation_mlmc) |
| 58 | +An approach for speeding up parallel stochastic simulations is so-called [*multilevel Monte Carlo approaches*](https://en.wikipedia.org/wiki/Multilevel_Monte_Carlo_method) (MLMC). These are used when a stochastic process is simulated repeatedly using identical simulation conditions. Here, instead of performing all simulations using identical [tolerance](@ref ode_simulation_performance_error), the ensemble is simulated using a range of tolerances (primarily lower ones, which yields faster simulations). Currently, [StochasticDiffEq.jl](https://github.com/SciML/StochasticDiffEq.jl) do not have a native implementation for performing MLMC simulations (this will hopefully be added in the future). However, if high performance of parallel SDE simulations is required, these approaches may be worth investigating. |
0 commit comments