LAMMPS Documentation #96

rubber-duck-debug · 2025-04-16T12:58:04Z

Re-opening this PR. Will address the previous comments in future commits.

github-actions · 2025-04-16T12:58:53Z

preview available: https://docs.tds.cscs.ch/96

.github/CODEOWNERS

github-actions · 2025-05-09T10:49:35Z

preview available: https://docs.tds.cscs.ch/96

github-actions · 2025-05-09T10:59:04Z

preview available: https://docs.tds.cscs.ch/96

github-actions · 2025-05-09T11:04:52Z

preview available: https://docs.tds.cscs.ch/96

github-actions · 2025-05-09T11:06:48Z

preview available: https://docs.tds.cscs.ch/96

github-actions · 2025-05-09T11:08:53Z

preview available: https://docs.tds.cscs.ch/96

github-actions · 2025-05-09T11:10:21Z

preview available: https://docs.tds.cscs.ch/96

github-actions · 2025-05-09T11:12:09Z

preview available: https://docs.tds.cscs.ch/96

github-actions · 2025-05-09T11:17:00Z

preview available: https://docs.tds.cscs.ch/96

github-actions · 2025-05-09T11:24:17Z

preview available: https://docs.tds.cscs.ch/96

github-actions · 2025-05-12T10:26:02Z

preview available: https://docs.tds.cscs.ch/96

docs/software/sciapps/lammps.md

Co-authored-by: Rocco Meli <[email protected]>

github-actions · 2025-05-15T12:11:45Z

preview available: https://docs.tds.cscs.ch/96

msimberg

Minor formatting issue, which I think is not intentional?

The other question about launcher scripts can be discussed after merging or even offline.

docs/software/sciapps/lammps.md

msimberg · 2025-05-19T07:33:01Z

docs/software/sciapps/lammps.md

+
+export MPICH_GPU_SUPPORT_ENABLED=1
+
+numactl --cpunodebind=$NUMA_NODE --membind=$NUMA_NODE "$@"


Just out of curiosity: have you found using numactl necessary/useful to actually improve performance or is it there "just to be sure"? Not asking for a change, just asking to understand if we need to align other launcher scripts.

This launcher script is basically the same as the "single rank per gpu" case in the slurm docs, except for the memory binding. The CPU binding is already set by slurm, so should be redundant to set it with numactl. The visible devices should be equivalent to what we set with --gpus-per-task=1 (assuming we stick with four tasks per node). If the additional --membind seems useful, we might want to recommend it in the slurm docs. Conversely, if it doesn't seem to help, simply referring to the slurm docs would be simpler. First touch mostly takes care of the memory binding, except --membind obviously has the added benefit of disallowing overallocating on a numa node, which may also be good default to recommend for other applications.

What do you think?

yeah I think its probably possible to avoid using this wrapper with sbatch/srun commands. I'll have a look into this today or tomorrow depending on availability!

Co-authored-by: Mikael Simberg <[email protected]>

github-actions · 2025-05-19T07:45:22Z

preview available: https://docs.tds.cscs.ch/96

rubber-duck-debug · 2025-05-19T07:46:42Z

Please wait on a merge - I want to first test to see if numactl can be removed.

RMeli · 2025-05-19T07:49:44Z

Please wait on a merge

I converted it to a draft, so that it can't accidentally be merged.

msimberg · 2025-05-22T08:52:39Z

Ping @nickjbrowning? Reminder that if you need more time to check the numactl stuff, I'd say we merge as is. We can always update it later, and the numactl instructions don't hurt even if they may turn out to be unnecessary.

github-actions · 2025-05-22T10:03:36Z

preview available: https://docs.tds.cscs.ch/96

rubber-duck-debug · 2025-05-22T10:03:50Z

@msimberg I've just checked the numactl stuff and we can remove it. Recently had a ticket where something related came up and the latest commit here has the current best-practice wrt. kokkos + GPUs.

github-actions · 2025-05-22T10:05:52Z

preview available: https://docs.tds.cscs.ch/96

github-actions · 2025-05-22T11:14:22Z

preview available: https://docs.tds.cscs.ch/96

rubber-duck-debug · 2025-05-22T11:18:04Z

Ready to go from my perspective, maybe @RMeli and @msimberg check my changes for the eiger documentation (specifically the proc binding). I'm using whats recommended by LAMMPS here, but I don't have much insight as to what we should be recommending in terms of binding.

msimberg · 2025-05-22T11:54:18Z

docs/software/sciapps/lammps.md

+#SBATCH --gpus-per-node=4
+#SBATCH --gpus-per-task=1
+#SBATCH --gpu-bind=per_task:1


I think the --gpus-per-task=1 alone covers this. I've never seen the --gpus-per-node=4 + --gpu-bind=per_task:1 form before so can't say for sure if that does something different, but if the goal is to have four ranks and one GPU per task, then in my experience just having --gpus-per-task=1 is sufficient.

Suggested change

#SBATCH --gpus-per-node=4

#SBATCH --gpus-per-task=1

#SBATCH --gpu-bind=per_task:1

#SBATCH --gpus-per-task=1

If you're unsure and would like to leave the other options there for now that's also ok by me. @RMeli?

I've also never seen this combination. If they are equivalent, I'd go for --gpus-per-task=1 for consistency in our documentation. If it does something different, maybe it is worth commenting/adding a note explaining the difference?

docs/software/sciapps/lammps.md

msimberg

Thank you @nickjbrowning for pushing this through. I added a couple of minor comments, but not blocking in my opinion.

Co-authored-by: Mikael Simberg <[email protected]>

github-actions · 2025-05-27T08:38:04Z

preview available: https://docs.tds.cscs.ch/96

sekelle · 2025-06-02T08:56:34Z

Hey guys, can we finally merge this?

RMeli · 2025-06-02T09:24:06Z

@sekelle, @nickjbrowning was on holidays last week. I was waiting for the last conversation to be resolved before merging.

RMeli · 2025-06-02T12:03:04Z

Since the documentation is now live, I'll merge this and then we can go back to refining the last few details.

rubber-duck-debug added 2 commits April 10, 2025 11:10

added initial lammps docs.

4a7bf23

update to codeowners

2f3b5c5

rubber-duck-debug requested review from RMeli, bcumming and msimberg as code owners April 16, 2025 12:58

msimberg reviewed Apr 17, 2025

View reviewed changes

.github/CODEOWNERS Outdated Show resolved Hide resolved

rubber-duck-debug and others added 2 commits May 9, 2025 12:47

updates

1235f33

Merge branch 'main' into lammps_docs_v2

82198cf

more updates

52fe2c5

fix typo.

faf8eae

fix formatting.

0409ef1

fix formatting

f6e2e5e

oversub

adac13f

more updates

0b9b56b

collapsed input

96cb186

more updates

ea58325

more updates

1afab2e

rubber-duck-debug added 2 commits May 12, 2025 12:29

spelling.

ada2eb7

phrasing

a2943fe

RMeli approved these changes May 15, 2025

View reviewed changes

docs/software/sciapps/lammps.md Outdated Show resolved Hide resolved

RMeli requested a review from msimberg May 15, 2025 12:02

Update docs/software/sciapps/lammps.md

f1f5c56

Co-authored-by: Rocco Meli <[email protected]>

msimberg requested changes May 19, 2025

View reviewed changes

Update docs/software/sciapps/lammps.md

8299e6e

Co-authored-by: Mikael Simberg <[email protected]>

RMeli marked this pull request as draft May 19, 2025 07:49

added an update regarding GPU binding with kokkos

3863bb6

removed wrapper.sh statement

b549975

fixed issues with eiger

a1b01f3

msimberg reviewed May 22, 2025

View reviewed changes

docs/software/sciapps/lammps.md Outdated Show resolved Hide resolved

msimberg approved these changes May 22, 2025

View reviewed changes

Update docs/software/sciapps/lammps.md

3b64616

Co-authored-by: Mikael Simberg <[email protected]>

RMeli self-assigned this Jun 2, 2025

RMeli marked this pull request as ready for review June 2, 2025 12:02

RMeli merged commit 0f8e2d1 into eth-cscs:main Jun 2, 2025
1 check passed


		export MPICH_GPU_SUPPORT_ENABLED=1

		numactl --cpunodebind=$NUMA_NODE --membind=$NUMA_NODE "$@"

LAMMPS Documentation #96

LAMMPS Documentation #96

Uh oh!

Conversation

rubber-duck-debug commented Apr 16, 2025

Uh oh!

github-actions bot commented Apr 16, 2025

Uh oh!

Uh oh!

github-actions bot commented May 9, 2025

Uh oh!

github-actions bot commented May 9, 2025

Uh oh!

github-actions bot commented May 9, 2025

Uh oh!

github-actions bot commented May 9, 2025

Uh oh!

github-actions bot commented May 9, 2025

Uh oh!

github-actions bot commented May 9, 2025

Uh oh!

github-actions bot commented May 9, 2025

Uh oh!

github-actions bot commented May 9, 2025

Uh oh!

github-actions bot commented May 9, 2025

Uh oh!

github-actions bot commented May 12, 2025

Uh oh!

Uh oh!

github-actions bot commented May 15, 2025

Uh oh!

msimberg left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

msimberg May 19, 2025

Choose a reason for hiding this comment

Uh oh!

rubber-duck-debug May 19, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented May 19, 2025

Uh oh!

rubber-duck-debug commented May 19, 2025

Uh oh!

RMeli commented May 19, 2025

Uh oh!

msimberg commented May 22, 2025

Uh oh!

github-actions bot commented May 22, 2025

Uh oh!

rubber-duck-debug commented May 22, 2025

Uh oh!

github-actions bot commented May 22, 2025

Uh oh!

github-actions bot commented May 22, 2025

Uh oh!

rubber-duck-debug commented May 22, 2025

Uh oh!

msimberg May 22, 2025

Choose a reason for hiding this comment

Uh oh!

RMeli May 22, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

msimberg left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented May 27, 2025

Uh oh!

sekelle commented Jun 2, 2025

Uh oh!

RMeli commented Jun 2, 2025

Uh oh!

RMeli commented Jun 2, 2025

Uh oh!

Uh oh!

Reviewers