Skip to content

Commit 96341ac

Browse files
committed
Revert "Revert "[sharktank] Perplexity refactor + add features (nod-ai#1172)""
This reverts commit 3d8105d.
1 parent 3d8105d commit 96341ac

File tree

14 files changed

+1255
-1383
lines changed

14 files changed

+1255
-1383
lines changed

.github/workflows/ci-sharktank-nightly.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -136,7 +136,7 @@ jobs:
136136
-s \
137137
sharktank/tests/evaluate/perplexity_iree_test.py \
138138
--run-nightly-llama-tests \
139-
--bs=100 \
139+
--bs=128 \
140140
--iree-device=hip://0 \
141141
--iree-hip-target=gfx942 \
142142
--iree-hal-target-device=hip \

sharktank/sharktank/evaluate/README.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,8 @@ pytest -n 8 -v -s sharktank/tests/evaluate/perplexity_iree_test.py -k test_llama
3939
--bs=4 \
4040
--iree-device=hip://0 \
4141
--iree-hip-target=gfx942 \
42-
--iree-hal-target-device=hip
42+
--iree-hal-target-device=hip \
43+
--run-nightly-llama-tests
4344
```
4445

4546
For a new model:
@@ -83,10 +84,10 @@ python -m sharktank.evaluate.perplexity_iree -h
8384

8485
| CPU | GPU | Num of prompts |
8586
|:-------------: |:----------:|:----------------:|
86-
| AMD EPYC 9554 | MI300X | 100 |
87+
| AMD EPYC 9554 | MI300X | 128 |
8788

8889
#### LLaMA 3.1
8990

9091
|Models |Torch score |IREE score | Model size (GB) |
9192
|:-------------------------------|:-------------|:-------------|:----------------|
92-
|8B FP16 Instruct TP1 |20.223236 |19.786807 |16.07 |
93+
|8B FP16 Instruct TP1 |17.555018 |14.6294 |16.07 |

0 commit comments

Comments
 (0)