You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
docs: introduce gpustack and gguf-parser (ggml-org#8873)
* readme: introduce gpustack
GPUStack is an open-source GPU cluster manager for running large
language models, which uses llama.cpp as the backend.
Signed-off-by: thxCode <[email protected]>
* readme: introduce gguf-parser
GGUF Parser is a tool to review/check the GGUF file and estimate the
memory usage without downloading the whole model.
Signed-off-by: thxCode <[email protected]>
---------
Signed-off-by: thxCode <[email protected]>
Copy file name to clipboardExpand all lines: README.md
+2Lines changed: 2 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -220,10 +220,12 @@ Unless otherwise noted these projects are open-source with permissive licensing:
220
220
221
221
-[akx/ggify](https://github.com/akx/ggify) – download PyTorch models from HuggingFace Hub and convert them to GGML
222
222
-[crashr/gppm](https://github.com/crashr/gppm) – launch llama.cpp instances utilizing NVIDIA Tesla P40 or P100 GPUs with reduced idle power consumption
223
+
-[gpustack/gguf-parser](https://github.com/gpustack/gguf-parser-go/tree/main/cmd/gguf-parser) - review/check the GGUF file and estimate the memory usage
223
224
224
225
**Infrastructure:**
225
226
226
227
-[Paddler](https://github.com/distantmagic/paddler) - Stateful load balancer custom-tailored for llama.cpp
228
+
-[GPUStack](https://github.com/gpustack/gpustack) - Manage GPU clusters for running LLMs
227
229
228
230
**Games:**
229
231
-[Lucy's Labyrinth](https://github.com/MorganRO8/Lucys_Labyrinth) - A simple maze game where agents controlled by an AI model will try to trick you.
0 commit comments