Skip to content

Commit 712b641

Browse files
authored
Update 07_evaluating_the_quantized_models.md
1 parent afdb537 commit 712b641

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

content/learning-paths/servers-and-cloud-computing/arcee-foundation-model-on-gcp/07_evaluating_the_quantized_models.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ layout: learningpathall
1010

1111
Use the [`llama-bench`](https://github.com/ggml-org/llama.cpp/tree/master/tools/llama-bench) tool to measure model performance on Google Cloud Axion Arm64, including inference speed and memory usage.
1212

13-
## Benchmark full, 8-bit, and 4-bit models
13+
## Benchmark half-precision floating point, integer 8-bit, and integer 4-bit models
1414

1515
Run benchmarks on multiple versions of AFM-4.5B:
1616

0 commit comments

Comments
 (0)