]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: fix bad asserts for partial offload (llama/13337)
authorJohannes Gäßler <redacted>
Tue, 6 May 2025 11:58:51 +0000 (13:58 +0200)
committerGeorgi Gerganov <redacted>
Wed, 7 May 2025 14:44:35 +0000 (17:44 +0300)
commit1985c3f680b6f32da15934251206ccf2c345b715
tree6d71d59e46a35b644192f3bc79a4c05874a160a6
parent312bc0eaeed7d98777fb7de29b2b237e5a725fc6
CUDA: fix bad asserts for partial offload (llama/13337)
include/ggml.h
src/ggml-cuda/fattn-common.cuh
src/ggml-cuda/ggml-cuda.cu
src/ggml-cuda/mmq.cu
src/ggml-cuda/mmvq.cu
src/ggml.c