Add docs for GPU support (trixi-framework#660)

efaulhaber · LasNikas · commit e71f4a43a9a8 · 2024-11-18T18:33:50.000+01:00
* Add docs for GPU support

* Reformat code

* Implement suggestions

* Implement suggestions

* Fix typo
diff --git a/docs/make.jl b/docs/make.jl
@@ -120,7 +120,8 @@ makedocs(sitename="TrixiParticles.jl",
              "Preprocessing" => [
                  "Sampling of Geometries" => joinpath("preprocessing", "preprocessing.md")
              ],
-             "Components" => [
+             "GPU Support" => "gpu.md",
+             "API Reference" => [
                  "Overview" => "overview.md",
                  "General" => [
                      "Semidiscretization" => joinpath("general", "semidiscretization.md"),
diff --git a/docs/src/gpu.md b/docs/src/gpu.md
@@ -0,0 +1,61 @@
+# GPU Support
+
+GPU support is still an experimental feature that is actively being worked on.
+As of now, the [`WeaklyCompressibleSPHSystem`](@ref) and the [`BoundarySPHSystem`](@ref)
+are supported on GPUs.
+We have tested this on GPUs by Nvidia and AMD.
+
+To run a simulation on a GPU, we need to use the [`FullGridCellList`](@ref)
+as cell list for the [`GridNeighborhoodSearch`](@ref).
+This cell list requires a bounding box for the domain, unlike the default cell list, which
+uses an unbounded domain.
+For simulations that are bounded by a closed tank, we can use the boundary of the tank
+to obtain the bounding box as follows.
+```jldoctest gpu; output=false, setup=:(using TrixiParticles; trixi_include(@__MODULE__, joinpath(examples_dir(), "fluid", "hydrostatic_water_column_2d.jl"), sol=nothing))
+search_radius = TrixiParticles.compact_support(smoothing_kernel, smoothing_length)
+min_corner = minimum(tank.boundary.coordinates, dims=2) .- search_radius
+max_corner = maximum(tank.boundary.coordinates, dims=2) .+ search_radius
+cell_list = TrixiParticles.PointNeighbors.FullGridCellList(; min_corner, max_corner)
+
+# output
+PointNeighbors.FullGridCellList{PointNeighbors.DynamicVectorOfVectors{Int32, Matrix{Int32}, Vector{Int32}, Base.RefValue{Int32}}, Nothing, SVector{2, Float64}, SVector{2, Float64}}(Vector{Int32}[], nothing, [-0.24500000000000002, -0.24500000000000002], [1.245, 1.245])
+```
+
+We then need to pass this cell list to the neighborhood search and the neighborhood search
+to the [`Semidiscretization`](@ref).
+```jldoctest gpu; output=false
+semi = Semidiscretization(fluid_system, boundary_system,
+                          neighborhood_search=GridNeighborhoodSearch{2}(; cell_list))
+
+# output
+┌──────────────────────────────────────────────────────────────────────────────────────────────────┐
+│ Semidiscretization                                                                               │
+│ ══════════════════                                                                               │
+│ #spatial dimensions: ………………………… 2                                                                │
+│ #systems: ……………………………………………………… 2                                                                │
+│ neighborhood search: ………………………… GridNeighborhoodSearch                                           │
+│ total #particles: ………………………………… 636                                                              │
+└──────────────────────────────────────────────────────────────────────────────────────────────────┘
+```
+
+At this point, we should run the simulation and make sure that it still works and that
+the bounding box is large enough.
+For some simulations where particles move outside the initial tank coordinates,
+for example when the tank is not closed or when the tank is moving, an appropriate
+bounding box has to be specified.
+
+Then, we only need to specify the data type that is used for the simulation.
+On an Nvidia GPU, we specify:
+```julia
+using CUDA
+ode = semidiscretize(semi, tspan, data_type=CuArray)
+```
+On an AMD GPU, we use:
+```julia
+using AMDGPU
+ode = semidiscretize(semi, tspan, data_type=ROCArray)
+```
+Then, we can run the simulation as usual.
+All data is transferred to the GPU during initialization and all loops over particles
+and their neighbors will be executed on the GPU as kernels generated by KernelAbstractions.jl.
+Data is only copied to the CPU for saving VTK files via the [`SolutionSavingCallback`](@ref).
diff --git a/docs/src/overview.md b/docs/src/overview.md
@@ -1,10 +1,16 @@
 # Overview
+The actual API reference is not listed on a single page, like in most Julia packages,
+but instead is split into multiple sections that follow a similar structure
+as the code files themselves.
+In these sections, API docs are combined with explanations of the theoretical background
+of these methods.
+
 The following page gives a rough overview of important parts of the code.
 
 ## Program flow
 
 To initiate a simulation, the goal is to solve an ordinary differential equation, for example,
-by employing the time integration schemes provided by OrdinaryDiffEq.jl. These schemes are then 
+by employing the time integration schemes provided by OrdinaryDiffEq.jl. These schemes are then
 utilized to integrate ``\mathrm{d}u/\mathrm{d}t`` and ``\mathrm{d}v/\mathrm{d}t``, where ``u``
 represents the particles' positions and ``v`` their properties such as velocity and density.
 During a single time step or an intermediate step of the time integration scheme, the functions
diff --git a/docs/src/reference-pointneighbors.md b/docs/src/reference-pointneighbors.md
@@ -1,4 +1,4 @@
-# PointNeighbors.jl API
+# [PointNeighbors.jl API](@id pointneighbors)
 
 ```@meta
 CurrentModule = PointNeighbors
diff --git a/test/examples/examples.jl b/test/examples/examples.jl
@@ -9,8 +9,20 @@
                                    "hydrostatic_water_column_2d.jl"),
                           fluid_system=nothing, sol=nothing, semi=nothing, ode=nothing)
 
+            # Neighborhood search for `FullGridCellList` test below
+            search_radius = TrixiParticles.compact_support(smoothing_kernel,
+                                                           smoothing_length)
+            min_corner = minimum(tank.boundary.coordinates, dims=2) .- search_radius
+            max_corner = maximum(tank.boundary.coordinates, dims=2) .+ search_radius
+            cell_list = TrixiParticles.PointNeighbors.FullGridCellList(; min_corner,
+                                                                       max_corner)
+            semi_fullgrid = Semidiscretization(fluid_system, boundary_system,
+                                               neighborhood_search=GridNeighborhoodSearch{2}(;
+                                                                                             cell_list))
+
             hydrostatic_water_column_tests = Dict(
                 "WCSPH default" => (),
+                "WCSPH with FullGridCellList" => (semi=semi_fullgrid,),
                 "WCSPH with source term damping" => (source_terms=SourceTermDamping(damping_coefficient=1e-4),),
                 "WCSPH with SummationDensity" => (fluid_density_calculator=SummationDensity(),
                                                   clip_negative_pressure=true),

Original file line number	Diff line number	Diff line change
`@@ -1,4 +1,4 @@`
`1`		`-# PointNeighbors.jl API`
	`1`	`+# [PointNeighbors.jl API](@id pointneighbors)`
`2`	`2`
`3`	`3`	```@meta
`4`	`4`	`CurrentModule = PointNeighbors`