From: Shelby Jenkins Date: Tue, 4 Feb 2025 11:20:55 +0000 (-0600) Subject: readme : add llm_client Rust crate to readme bindings (#11628) X-Git-Tag: upstream/0.0.4719~84 X-Git-Url: https://git.djapps.eu/?a=commitdiff_plain;h=106045e7bb8db481bb2ebbc60e3b53cb27ada117;p=pkg%2Fggml%2Fsources%2Fllama.cpp readme : add llm_client Rust crate to readme bindings (#11628) [This crate](https://github.com/ShelbyJenkins/llm_client) has been in a usable state for quite awhile, so I figured now is fair to add it. It installs from crates.io, and automatically downloads the llama.cpp repo and builds it for the target platform - with the goal being the easiest user experience possible. It also integrates model presets and choosing the largest quant given the target's available VRAM. So a user just has to specify one of the presets (I manually add the most popular models), and it will download from hugging face. So, it's like a Rust Ollama, but it's not really for chatting. It makes heavy use of llama.cpp's grammar system to do structured output for decision making and control flow tasks. --- diff --git a/README.md b/README.md index 7f306d19..d68330d2 100644 --- a/README.md +++ b/README.md @@ -136,6 +136,7 @@ Instructions for adding support for new models: [HOWTO-add-model.md](docs/develo - Rust (more features): [edgenai/llama_cpp-rs](https://github.com/edgenai/llama_cpp-rs) - Rust (nicer API): [mdrokz/rust-llama.cpp](https://github.com/mdrokz/rust-llama.cpp) - Rust (more direct bindings): [utilityai/llama-cpp-rs](https://github.com/utilityai/llama-cpp-rs) +- Rust (automated build from crates.io): [ShelbyJenkins/llm_client](https://github.com/ShelbyJenkins/llm_client) - C#/.NET: [SciSharp/LLamaSharp](https://github.com/SciSharp/LLamaSharp) - C#/VB.NET (more features - community license): [LM-Kit.NET](https://docs.lm-kit.com/lm-kit-net/index.html) - Scala 3: [donderom/llm4s](https://github.com/donderom/llm4s)