We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 5ddf998 commit 3b6e008Copy full SHA for 3b6e008
README.md
@@ -35,9 +35,9 @@ ggml_cuda_init: found 1 CUDA devices:
35
| qwen3moe 30B.A3B Q4_0 | 16.18 GiB | 30.53 B | CUDA | 10 | 32 | 512 | 1 | 1 | tg128 | 55.55 ± 0.26 |
36
| qwen3moe 30B.A3B Q4_0 | 16.18 GiB | 30.53 B | CUDA | 10 | 32 | 512 | 1 | 1 | pp512+tg512 | 77.62 ± 0.26 |
37
38
-##PP512 + 69.62 t/s (+32.47%)
39
-##TG128 + 9.88 t/s (+21.63%)
40
-##PP512+TG512 + 12.35 t/s (+18.92%)
+## PP512 + 69.62 t/s (+32.47%)
+## TG128 + 9.88 t/s (+21.63%)
+## PP512+TG512 + 12.35 t/s (+18.92%)
41
42
## CLI performance:
43
0 commit comments