]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
llama : accept a list of devices to use to offload a model (#10497)
authorDiego Devesa <redacted>
Mon, 25 Nov 2024 18:30:06 +0000 (19:30 +0100)
committerGitHub <redacted>
Mon, 25 Nov 2024 18:30:06 +0000 (19:30 +0100)
commit10bce0450f0c4d80087e06312b9dbbab3e87f16b
tree7eb8de91d8000b6c3f8ff9a9ce3d6600d8382914
parent1f922254f0c984a8fb9fbaa0c390d7ffae49aedb
llama : accept a list of devices to use to offload a model (#10497)

* llama : accept a list of devices to use to offload a model

* accept `--dev none` to completely disable offloading

* fix dev list with dl backends

* rename env parameter to LLAMA_ARG_DEVICE for consistency
common/arg.cpp
common/common.cpp
common/common.h
examples/server/server.cpp
examples/speculative-simple/speculative-simple.cpp
examples/speculative/speculative.cpp
ggml/src/ggml-backend-reg.cpp
include/llama.h
src/llama.cpp