Skip to content

Commit ca10839

Browse files
committed
Update perf number
1 parent 776b733 commit ca10839

File tree

1 file changed

+3
-4
lines changed

1 file changed

+3
-4
lines changed

mixtral-moe/README.md

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -12,11 +12,10 @@ python scripts/convert_hf_checkpoint.py --checkpoint_dir checkpoints/$MODEL_REPO
1212
## Benchmarks
1313
Benchmarks run on an 8xA100-80GB, power limited to 330W with a hybrid cube mesh topology. Note that all benchmarks are run at *batch size=1*, making the reported tokens/s numbers equivalent to "tokens/s/user". In addition, they are run with a very small prompt length (just 5 tokens).
1414

15-
| | 1 GPU | 2 GPU | 4 GPU | 8 GPU |
15+
| | 1 GPU | 2 GPU | 4 GPU | 8 GPU |
1616
|------------------|---------|-----------|--------|------------|
17-
|baseline(bfloat16)| OOM | 78.75 | 118.23 | 203.69 |
18-
| int8 | 56.04 | 99.91 | 149.53 | 218.48 |
19-
17+
|baseline(bfloat16)| OOM | 96.67 | 125.70 | 227.82 |
18+
| int8 | 97.92 | 155.03 | 162.58 | 279.35 |
2019

2120

2221
## Generate Text

0 commit comments

Comments
 (0)