Skip to content

Commit 3e5f307

Browse files
authored
add broadwell and p100 to performance (#552)
1 parent 32bf736 commit 3e5f307

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

docs/documentation/expectedPerformance.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -27,6 +27,7 @@ AMD MI250X GPUs have two graphics compute dies (GCDs) per MI250X device; we repo
2727
| NVIDIA A30 | 1 GPU | 1.06 | NVHPC 24.1 | GT Rogues Gallery |
2828
| AMD MI250X | 1 __GCD__ | 1.09 | CCE 16.0.1 | OLCF Frontier |
2929
| AMD MI100 | 1 GPU | 1.38 | CCE 16.0.1 | Cray internal system |
30+
| NVIDIA P100 | 1 GPU | 2.35 | NVHPC 23.5 | GT CSE Internal |
3031
| NVIDIA A40 (SP GPU) | 1 GPU | 3.3 | NVHPC 22.11 | NCSA Delta |
3132
| NVIDIA RTX6000 (SP GPU) | 1 GPU | 3.9 | NVHPC 22.11 | GT Phoenix |
3233
| Apple M1 Max | 8/10 cores | 72 | GNU 14.1.0 | N/A |
@@ -39,6 +40,8 @@ AMD MI250X GPUs have two graphics compute dies (GCDs) per MI250X device; we repo
3940
| Intel Xeon Platinum 8352Y (Ice Lake) | 12/32 cores | 128 | NVHPC 24.5 | GT Rogues Gallery |
4041
| AMD EPYC 7713 (Milan) | 32/64 cores | 137 | GNU 12.1.0 | GT Phoenix |
4142
| Intel Xeon Gold 6226 (Cascade Lake) | 12/12 cores | 152 | Intel oneAPI 2022.1 | GT Phoenix |
43+
| Intel Xeon E5-2650V4 (Broadwell) | 8/12 cores | 230 | NVHPC 23.5 | GT CSE Internal |
44+
4245

4346
__All grind times are in nanoseconds (ns) per grid point (gp) per equation (eq) per right-hand side (rhs) evaluation, so X ns/gp/eq/rhs. Lower is better.__
4447

0 commit comments

Comments
 (0)