Skip to content

Commit 3d8105d

Browse files
committed
Revert "[sharktank] Perplexity refactor + add features (nod-ai#1172)"
This reverts commit e5e85f5.
1 parent 79e5bd9 commit 3d8105d

File tree

14 files changed

+1383
-1255
lines changed

14 files changed

+1383
-1255
lines changed

.github/workflows/ci-sharktank-nightly.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -136,7 +136,7 @@ jobs:
136136
-s \
137137
sharktank/tests/evaluate/perplexity_iree_test.py \
138138
--run-nightly-llama-tests \
139-
--bs=128 \
139+
--bs=100 \
140140
--iree-device=hip://0 \
141141
--iree-hip-target=gfx942 \
142142
--iree-hal-target-device=hip \

sharktank/sharktank/evaluate/README.md

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -39,8 +39,7 @@ pytest -n 8 -v -s sharktank/tests/evaluate/perplexity_iree_test.py -k test_llama
3939
--bs=4 \
4040
--iree-device=hip://0 \
4141
--iree-hip-target=gfx942 \
42-
--iree-hal-target-device=hip \
43-
--run-nightly-llama-tests
42+
--iree-hal-target-device=hip
4443
```
4544

4645
For a new model:
@@ -84,10 +83,10 @@ python -m sharktank.evaluate.perplexity_iree -h
8483

8584
| CPU | GPU | Num of prompts |
8685
|:-------------: |:----------:|:----------------:|
87-
| AMD EPYC 9554 | MI300X | 128 |
86+
| AMD EPYC 9554 | MI300X | 100 |
8887

8988
#### LLaMA 3.1
9089

9190
|Models |Torch score |IREE score | Model size (GB) |
9291
|:-------------------------------|:-------------|:-------------|:----------------|
93-
|8B FP16 Instruct TP1 |17.555018 |14.6294 |16.07 |
92+
|8B FP16 Instruct TP1 |20.223236 |19.786807 |16.07 |

0 commit comments

Comments
 (0)