Skip to content

Commit 73c455e

Browse files
committed
requested changes
1 parent 9dba3bf commit 73c455e

File tree

5 files changed

+11
-10
lines changed

5 files changed

+11
-10
lines changed

docs/documentation/expectedPerformance.md

Lines changed: 8 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -11,9 +11,10 @@ The following table outlines expected performance in terms of number of time-ste
1111
| Hardware | # Ranks | 1M Cells | 4M Cells | 8M Cells | Compiler | Computer |
1212
| ---: | :----: | :----: | :---: | :---: | :----: | :--- |
1313
| Nvidia V100 | 1 | 88.5k | 18.7k | N/A | NVHPC 22.11 | PACE Phoenix |
14+
| Nvidia V100 | 1 | 78.8k | 18.8k | N/A | NVHPC 22.11 | OLCF Summit |
1415
| Nvidia A100 | 1 | 114.4k | 34.6k | 16.5k | NVHPC 23.5 | Wingtip |
1516
| AMD MI250x | 1 | 77.5k | 22.3k | 11.2k | CCE 16.0.1 | OLCF Frontier |
16-
| Intel Xeon Gold 6226 | 12 | 2.5k | 0.7k | 0.4k | GNU 10.3.0 | Pace Phoenix |
17+
| Intel Xeon Gold 6226 | 12 | 2.5k | 0.7k | 0.4k | GNU 10.3.0 | PACE Phoenix |
1718
| Apple Silicon M2 | 6 | 2.8k | 0.6k | 0.2k | GNU 13.2.0 | N/A |
1819

1920
If `'model_eqns' : 3` is replaced by `'model_eqns' : 2`, an inviscid 5-equation model is used.
@@ -23,10 +24,11 @@ The following table outlines expected performance in terms of number of time-ste
2324

2425
| Hardware | # Ranks | 1M Cells | 4M Cells | 8M Cells | Compiler | Computer |
2526
| ---: | :----: | :----: | :---: | :---: | :----: | :--- |
26-
| Nvidia V100 | 1 | 113.4k | 26.2k | N/A | NVHPC 22.11 | PACE Phoenix |
27+
| Nvidia V100 | 1 | 113.4k | 26.2k | 13.0k | NVHPC 22.11 | PACE Phoenix |
28+
| Nvidia V100 | 1 | 107.7k | 26.3k | 13.1k | NVHPC 22.11 | OLCF Summit |
2729
| Nvidia A100 | 1 | 153.5k | 48.0k | 22.5k | NVHPC 23.5 | Wingtip |
2830
| AMD MI250x | 1 | 104.2k | 31.0k | 14.8k | CCE 16.0.1 | OLCF Frontier |
29-
| Intel Xeon Gold 6226 | 12 | 5.4k | 1.6k | 0.8k | GNU 10.3.0 | Pace Phoenix |
31+
| Intel Xeon Gold 6226 | 12 | 5.4k | 1.6k | 0.8k | GNU 10.3.0 | PACE Phoenix |
3032
| Apple Silicon M2 | 6 | 3.7k | 11.0k | 0.3k | GNU 13.2.0 | N/A |
3133

3234
## Weak scaling
@@ -35,17 +37,17 @@ Strong scaling results are obtained by increasing the problem size with the numb
3537
so that work per process remains constant.
3638

3739
### AMD MI250X GPU
38-
MFC weask scales to 65,536 AMD MI250X GPUs on OLCF Frontier with 96% efficiency.
40+
MFC weask scales to 65,536 AMD MI250X GPUs on OLCF Frontier with 96% efficiency. This corresponds to 87% of the entire machine.
3941

4042
<img src="../res/weakScaling/frontier.svg" style="height: 50%; width:50%; border-radius: 10pt"/>
4143

4244
### Nvidia V100 GPU
43-
MFC weak scales to 13,824 V100 Nvidia V100 GPUs on OLCF Summit with 97% efficiency.
45+
MFC weak scales to 13,824 V100 Nvidia V100 GPUs on OLCF Summit with 97% efficiency. This corresponds to 50% of the entire machine.
4446

4547
<img src="../res/weakScaling/summit.svg" style="height: 50%; width:50%; border-radius: 10pt"/>
4648

4749
### IMB Power9 CPU
48-
MFC Weak scales to 13,824 Power9 CPU cores on OLCF Summit with 1% of ideal scaling.
50+
MFC Weak scales to 13,824 Power9 CPU cores on OLCF Summit to within 1% of ideal scaling.
4951

5052
<img src="../res/weakScaling/cpuScaling.svg" style="height: 50%; width:50%; border-radius: 10pt"/>
5153

docs/res/strongScaling/S01.95

Lines changed: 0 additions & 1 deletion
This file was deleted.

src/post_process/m_global_parameters.fpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -295,7 +295,7 @@ contains
295295
flux_lim = dflt_int
296296
flux_wrt = .false.
297297
parallel_io = .false.
298-
file_per_process = .true.
298+
file_per_process = .false.
299299
E_wrt = .false.
300300
pres_wrt = .false.
301301
alpha_wrt = .false.

src/pre_process/m_global_parameters.fpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -251,7 +251,7 @@ contains
251251
bc_z%beg = dflt_int; bc_z%end = dflt_int
252252

253253
parallel_io = .false.
254-
file_per_process = .true.
254+
file_per_process = .false.
255255
precision = 2
256256
vel_profile = .false.
257257
instability_wave = .false.

src/simulation/m_global_parameters.fpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -374,7 +374,7 @@ contains
374374
null_weights = .false.
375375
mixture_err = .false.
376376
parallel_io = .false.
377-
file_per_process = .true.
377+
file_per_process = .false.
378378
precision = 2
379379
hypoelasticity = .false.
380380
weno_flat = .true.

0 commit comments

Comments
 (0)