@@ -11,9 +11,10 @@ The following table outlines expected performance in terms of number of time-ste
1111| Hardware | # Ranks | 1M Cells | 4M Cells | 8M Cells | Compiler | Computer |
1212| ---: | :----: | :----: | :---: | :---: | :----: | :--- |
1313| Nvidia V100 | 1 | 88.5k | 18.7k | N/A | NVHPC 22.11 | PACE Phoenix |
14+ | Nvidia V100 | 1 | 78.8k | 18.8k | N/A | NVHPC 22.11 | OLCF Summit |
1415| Nvidia A100 | 1 | 114.4k | 34.6k | 16.5k | NVHPC 23.5 | Wingtip |
1516| AMD MI250x | 1 | 77.5k | 22.3k | 11.2k | CCE 16.0.1 | OLCF Frontier |
16- | Intel Xeon Gold 6226 | 12 | 2.5k | 0.7k | 0.4k | GNU 10.3.0 | Pace Phoenix |
17+ | Intel Xeon Gold 6226 | 12 | 2.5k | 0.7k | 0.4k | GNU 10.3.0 | PACE Phoenix |
1718| Apple Silicon M2 | 6 | 2.8k | 0.6k | 0.2k | GNU 13.2.0 | N/A |
1819
1920If ` 'model_eqns' : 3 ` is replaced by ` 'model_eqns' : 2 ` , an inviscid 5-equation model is used.
@@ -23,10 +24,11 @@ The following table outlines expected performance in terms of number of time-ste
2324
2425| Hardware | # Ranks | 1M Cells | 4M Cells | 8M Cells | Compiler | Computer |
2526| ---: | :----: | :----: | :---: | :---: | :----: | :--- |
26- | Nvidia V100 | 1 | 113.4k | 26.2k | N/A | NVHPC 22.11 | PACE Phoenix |
27+ | Nvidia V100 | 1 | 113.4k | 26.2k | 13.0k | NVHPC 22.11 | PACE Phoenix |
28+ | Nvidia V100 | 1 | 107.7k | 26.3k | 13.1k | NVHPC 22.11 | OLCF Summit |
2729| Nvidia A100 | 1 | 153.5k | 48.0k | 22.5k | NVHPC 23.5 | Wingtip |
2830| AMD MI250x | 1 | 104.2k | 31.0k | 14.8k | CCE 16.0.1 | OLCF Frontier |
29- | Intel Xeon Gold 6226 | 12 | 5.4k | 1.6k | 0.8k | GNU 10.3.0 | Pace Phoenix |
31+ | Intel Xeon Gold 6226 | 12 | 5.4k | 1.6k | 0.8k | GNU 10.3.0 | PACE Phoenix |
3032| Apple Silicon M2 | 6 | 3.7k | 11.0k | 0.3k | GNU 13.2.0 | N/A |
3133
3234## Weak scaling
@@ -35,17 +37,17 @@ Strong scaling results are obtained by increasing the problem size with the numb
3537so that work per process remains constant.
3638
3739### AMD MI250X GPU
38- MFC weask scales to 65,536 AMD MI250X GPUs on OLCF Frontier with 96% efficiency.
40+ MFC weask scales to 65,536 AMD MI250X GPUs on OLCF Frontier with 96% efficiency. This corresponds to 87% of the entire machine.
3941
4042<img src =" ../res/weakScaling/frontier.svg " style =" height : 50% ; width :50% ; border-radius : 10pt " />
4143
4244### Nvidia V100 GPU
43- MFC weak scales to 13,824 V100 Nvidia V100 GPUs on OLCF Summit with 97% efficiency.
45+ MFC weak scales to 13,824 V100 Nvidia V100 GPUs on OLCF Summit with 97% efficiency. This corresponds to 50% of the entire machine.
4446
4547<img src =" ../res/weakScaling/summit.svg " style =" height : 50% ; width :50% ; border-radius : 10pt " />
4648
4749### IMB Power9 CPU
48- MFC Weak scales to 13,824 Power9 CPU cores on OLCF Summit with 1% of ideal scaling.
50+ MFC Weak scales to 13,824 Power9 CPU cores on OLCF Summit to within 1% of ideal scaling.
4951
5052<img src =" ../res/weakScaling/cpuScaling.svg " style =" height : 50% ; width :50% ; border-radius : 10pt " />
5153
0 commit comments