]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
Multi GPU support, CUDA refactor, CUDA scratch buffer (#1703)
authorJohannes Gäßler <redacted>
Tue, 6 Jun 2023 19:33:23 +0000 (21:33 +0200)
committerGitHub <redacted>
Tue, 6 Jun 2023 19:33:23 +0000 (21:33 +0200)
commit17366df842e358768c0df7024484fffecfc7865b
treef042c8142311d45f8712db10debf89111b2c7e57
parent44f906e8537fcec965e312d621c80556d6aa9bec
Multi GPU support, CUDA refactor, CUDA scratch buffer (#1703)

* CUDA multi GPU + scratch

ggml_cuda_compute_forward

Tensor parallelism

ggml_cuda_add

ggml_cuda_rms_norm

ggml_cuda_silu

CUDA scratch buffer

--main-gpu CLI option
12 files changed:
examples/common.cpp
examples/common.h
examples/main/README.md
examples/server/README.md
examples/server/server.cpp
ggml-cuda.cu
ggml-cuda.h
ggml-opencl.cpp
ggml.c
ggml.h
llama.cpp
llama.h