]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commitdiff
readme : add llama-swap to infrastructure section (#11032)
authorBenson Wong <redacted>
Thu, 2 Jan 2025 07:14:54 +0000 (23:14 -0800)
committerGitHub <redacted>
Thu, 2 Jan 2025 07:14:54 +0000 (09:14 +0200)
* list llama-swap under tools in README

* readme: add llama-swap to Infrastructure

README.md

index d6d1958c8fc033aea326e1172295681cd392cb6a..0126da89c9d6fe37987dd3fc99f21c8d61b4c8c2 100644 (file)
--- a/README.md
+++ b/README.md
@@ -201,6 +201,7 @@ Instructions for adding support for new models: [HOWTO-add-model.md](docs/develo
 - [Paddler](https://github.com/distantmagic/paddler) - Stateful load balancer custom-tailored for llama.cpp
 - [GPUStack](https://github.com/gpustack/gpustack) - Manage GPU clusters for running LLMs
 - [llama_cpp_canister](https://github.com/onicai/llama_cpp_canister) - llama.cpp as a smart contract on the Internet Computer, using WebAssembly
+- [llama-swap](https://github.com/mostlygeek/llama-swap) - transparent proxy that adds automatic model switching with llama-server
 
 </details>