Skip to content

Commit 53ddee6

Browse files
committed
gpu tutorial update
1 parent 959588c commit 53ddee6

File tree

1 file changed

+4
-1
lines changed

1 file changed

+4
-1
lines changed

docs/src/catalyst_applications/ode_simulation_performance.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -260,7 +260,7 @@ Generally, we can parallelise `EnsembleProblem`s across several GPUs in a very s
260260
- Depending on which GPU hardware is used, a specific back-end package has to be installed and imported (e.g. CUDA for NVIDIA's GPUs or Metal for Apple's).
261261
- For some cases, we must use a special ODE solver supporting simulations on GPUs.
262262

263-
Furthermore, to receive good performance, we should also make the following adaptations:
263+
Furthermore (while not required) to receive good performance, we should also make the following adaptations:
264264
- By default, Julia's decimal numbers are implemented as `Float64`s, however, using `Float32`s is advantageous on GPUs. Ideally, all initial conditions and parameter values should be specified using these.
265265
- We should designate all our vectors (i.e. initial conditions and parameter values) as [static vectors](https://github.com/JuliaArrays/StaticArrays.jl).
266266

@@ -311,5 +311,8 @@ Note that we have to provide the `CUDA.CUDABackend()` argument to our ensemble a
311311

312312
Just like OrdinaryDiffEq is able to utilise parallel CPU processes to speed up the linear solve part of ODE simulations, GPUs can also be used. More details on this can be found [here](https://docs.sciml.ai/DiffEqGPU/stable/tutorials/within_method_gpu/). This is only recommended when ODEs are very large (at least 1,000 species), and typically not applicable to CRNs.
313313

314+
For more information of differential equation simulations on GPUs in Julia, please read [DiffEqGPU's documentation](https://docs.sciml.ai/DiffEqGPU/stable/). Furthermore, if performance is critical, [this tutorial](https://docs.sciml.ai/DiffEqGPU/stable/tutorials/lower_level_api/) provides information on how to redesign your simulation code to make it more suitable for GPU simulations.
315+
316+
314317
---
315318
## References

0 commit comments

Comments
 (0)