Welcome

to the Homework Directory of Caspar Gutsche for the course Solving Solving partial differential equations in parallel on GPUs by Ludovic Räss, Mauro Werder, Samuel Omlin and Ivan Utkin

Usage

Activate the environment using

(v1.0) pkg> activate .

(SomeProject) pkg> instantiate

NOTE: When you get an error that says Module XY with build ID 12345 is missing from the cache. it is because every worker tries to precomile the packege on it's own. Fix it by prcomiling beforehand using ] precompile.

The homework for every week can be found in the corresponding folder

If you have any questions write a mail to [email protected]

TODO

You'll find a version of the PorousConvection_3D_xpu.jl code in the solutions folder on Polybox after exercises deadline if needed to get you started.
Create a multi-xPU version of your thermal porous convection 3D xPU code you finalised in lecture 7
- Copy your working PorousConvection_3D_xpu.jl code developed for the exercises in Lecture 7 and rename it PorousConvection_3D_multixpu.jl.
- Add at the beginning of the code using ImplicitGlobalGrid import MPI
- Also add global maximum computation using MPI reduction function
- max_g(A) = (max_l = maximum(A); MPI.Allreduce(max_l, MPI.MAX, MPI.COMM_WORLD))
- In the # numerics section, initialise the global grid right after defining nx,ny,nz and use now global grid nx_g(),ny_g() and nz_g() for defining maxiter and ncheck, as well as in any other places when needed.
- Modify the temperature initialisation using ImplicitGlobalGrid's global coordinate helpers (x_g, etc...), including one internal boundary condition update (update halo):
- Prepare for visualisation, making sure only me==0 creates the output directory. Also, prepare an array for storing inner points only (no halo) T_inn as well as global array to gather subdomains T_v
- Use the max_g function in the timestep dt definition (instead of maximum) as one now needs to gather the global maximum among all MPI processes.
- Moving to the time loop, add halo update function update_halo! after the kernel that computes the fluid fluxes. You can additionally wrap it in the @hide_communication block to enable communication/computation overlap (using b_width defined above)
- Apply a similar step to the temperature update, where you can also include boundary condition computation as following (⚠️ no other construct is currently allowed)
- Use now the max_g function instead of maximum to collect the global maximum among all local arrays spanning all MPI processes.
- Make sure all printing statements are only executed by me==0 in order to avoid each MPI process to print to screen, and use nx_g() instead of local nx in the printed statements when assessing the iteration per number of grid points.
- Update the visualisation and output saving part
- Finalise the global grid before returning from the main function
- Make sure to have set following parameters:
```
Ra          = 1000
nz          = 63
nx,ny       = 2*(nz+1)-1,nz
b_width     = (8,8,4) # for comm / comp overlap
nt          = 500
nvis        = 50
```
- Then, launch the script on Piz Daint on 8 GPU nodes upon adapting the the runme_mpi_daint.sh or sbatch sbatch_mpi_daint.sh scripts (see here) using CUDA-aware MPI 🚀
  - The final 2D slice (at ny_g()/2) produced should look similar as the figure depicted in Lecture 9.
- Now that you made sure the code runs as expected, launch PorousConvection_3D_multixpu.jl for 2000 steps on 8 GPUs at higher resolution (global grid of 508x252x252) setting:
```
nz          = 127
nx,ny       = 2*(nz+1)-1,nz
nt          = 2000
nvis        = 100    
```
  -[X] Use sbtach command to launch a non-interactive job which may take about 5h30-6h to execute.
  - Produce a figure or animation showing the final stage of temperature distribution in 3D and add it to a new section titled ## Porous convection 3D MPI in the PorousConvection project subfolder's README.md. You can use the Makie visualisation helper script from Lecture 7 for this purpose (making sure to adapt the resolution and other input params if needed).
Keep it xPU compatible using ParallelStencil.jl
Deploy it on multiple xPUs using ImplicitGlobalGrid.jl

Exercise 2 - Automatic Documentation

write some documentation
- using doc-strings
  - Add doc-string to the functions of following scripts:
    - PorousConvection_3D_xpu.jl
    - PorousConvection_3D_multixpu.jl
- using Literate.jl
  - Add to the PorousConvection folder a Literate.jl script called bin_io_script.jl that contains and documents following save_array and load_array functions you may have used in your 3D script
  - Add to the bin_io_script.jl a main() function
  - Make the Literate-based workflow to automatically build on GitHub using GitHub Actions. For this, you need to add to the .github/workflow folder the Literate.yml script from the lecuture

Project

Make sure to have following items in your private GitHub repository:

In addition enhance the README.md within the PorousConvection folder to include:

a short motivation/introduction
concise information about the equations you are solving
concise information about the numerical method and implementation
the results, incl. figures with labels, captions, etc...
a short discussion/conclusion section about the performed work, results, and outlook

Name		Name	Last commit message	Last commit date
Latest commit History 108 Commits
.github/workflows		.github/workflows
PorousConvection		PorousConvection
lecture1		lecture1
lecture2		lecture2
lecture3		lecture3
lecture4		lecture4
lecture5		lecture5
lecture6		lecture6
lecture7		lecture7
lecture8		lecture8
.gitignore		.gitignore
Manifest.toml		Manifest.toml
Project.toml		Project.toml
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Welcome

Usage

TODO

Exercise 2 - Automatic Documentation

Project

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

TheFibonacciEffect/pde-on-gpu-gutsche

Folders and files

Latest commit

History

Repository files navigation

Welcome

Usage

TODO

Exercise 2 - Automatic Documentation

Project

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages