From: Frank Mai <redacted>
Date: Mon, 12 Aug 2024 12:45:50 +0000 (+0800)
Subject: docs: introduce gpustack and gguf-parser (#8873)
X-Git-Tag: upstream/0.0.4488~911
X-Git-Url: https://git.djapps.eu/?a=commitdiff_plain;h=84eb2f4fad28ceadd415a4e775320c983f4d9a7d;p=pkg%2Fggml%2Fsources%2Fllama.cpp

docs: introduce gpustack and gguf-parser (#8873)

* readme: introduce gpustack

GPUStack is an open-source GPU cluster manager for running large
language models, which uses llama.cpp as the backend.

Signed-off-by: thxCode <redacted>

* readme: introduce gguf-parser

GGUF Parser is a tool to review/check the GGUF file and estimate the
memory usage without downloading the whole model.

Signed-off-by: thxCode <redacted>

---------

Signed-off-by: thxCode <redacted>
---

diff --git a/README.md b/README.md
index 1283f6805..7f48fde6e 100644
--- a/README.md
+++ b/README.md
@@ -186,10 +186,12 @@ Unless otherwise noted these projects are open-source with permissive licensing:
 
 - [akx/ggify](https://github.com/akx/ggify) â download PyTorch models from HuggingFace Hub and convert them to GGML
 - [crashr/gppm](https://github.com/crashr/gppm) â launch llama.cpp instances utilizing NVIDIA Tesla P40 or P100 GPUs with reduced idle power consumption
+- [gpustack/gguf-parser](https://github.com/gpustack/gguf-parser-go/tree/main/cmd/gguf-parser) - review/check the GGUF file and estimate the memory usage
 
 **Infrastructure:**
 
 - [Paddler](https://github.com/distantmagic/paddler) - Stateful load balancer custom-tailored for llama.cpp
+- [GPUStack](https://github.com/gpustack/gpustack) - Manage GPU clusters for running LLMs
 
 **Games:**
 - [Lucy's Labyrinth](https://github.com/MorganRO8/Lucys_Labyrinth) - A simple maze game where agents controlled by an AI model will try to trick you.