readme : add llama-swap to infrastructure section (#11032)

author Benson Wong <redacted>

Thu, 2 Jan 2025 07:14:54 +0000 (23:14 -0800)

committer GitHub <redacted>

Thu, 2 Jan 2025 07:14:54 +0000 (09:14 +0200)
author Benson Wong <redacted>
Thu, 2 Jan 2025 07:14:54 +0000 (23:14 -0800)
committer GitHub <redacted>
Thu, 2 Jan 2025 07:14:54 +0000 (09:14 +0200)
diff --git a/README.md b/README.md

index d6d1958c8fc033aea326e1172295681cd392cb6a..0126da89c9d6fe37987dd3fc99f21c8d61b4c8c2 100644 (file)
--- a/README.md
+++ b/README.md
@@ -201,6 +201,7 @@ Instructions for adding support for new models: [HOWTO-add-model.md](docs/develo
  - [Paddler](https://github.com/distantmagic/paddler) - Stateful load balancer custom-tailored for llama.cpp
  - [GPUStack](https://github.com/gpustack/gpustack) - Manage GPU clusters for running LLMs
  - [llama_cpp_canister](https://github.com/onicai/llama_cpp_canister) - llama.cpp as a smart contract on the Internet Computer, using WebAssembly
+- [llama-swap](https://github.com/mostlygeek/llama-swap) - transparent proxy that adds automatic model switching with llama-server
  
  </details>
author	Benson Wong <redacted>
	Thu, 2 Jan 2025 07:14:54 +0000 (23:14 -0800)
committer	GitHub <redacted>
	Thu, 2 Jan 2025 07:14:54 +0000 (09:14 +0200)