You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: scripts/tool_bench.py
+6-6Lines changed: 6 additions & 6 deletions
Original file line number
Diff line number
Diff line change
@@ -1,20 +1,20 @@
1
1
#!/usr/bin/env uv run
2
2
'''
3
3
Simplistic tool call benchmarks for llama-server and ollama.
4
-
4
+
5
5
Essentially runs the tests at server/examples/server/tests/unit/test_tool_call.py N times, at different temperatures and on different backends (current llama-server, baseline llama-server and ollama),
6
6
and plots the results of multiple runs (from same .jsonl file or multiple ones) as a success rate heatmap.
0 commit comments