Skip to content

Commit fa02ded

Browse files
WuhanMonkeysubramen
authored andcommitted
Create README.md
Add benchmarks top-level README
1 parent 4901841 commit fa02ded

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

tools/benchmarks/README.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,4 @@
1+
# Benchmarks
2+
3+
* inference - a folder contains benchmark scripts that apply a throughput analysis for Llama models inference on various backends including on-prem, cloud and on-device.
4+
* llm_eval_harness - a folder contains a tool to evaluate fine-tuned Llama models including quantized models focusing on quality.

0 commit comments

Comments
 (0)