]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: fix crash with partial offloading of MoE (llama/13439)
authorJohannes Gäßler <redacted>
Sun, 11 May 2025 14:09:33 +0000 (16:09 +0200)
committerGeorgi Gerganov <redacted>
Tue, 13 May 2025 10:02:19 +0000 (13:02 +0300)
commitf278116a357c5a5fc9df264cb31325602b6ed18c
tree4b1dda13342433d9bc9c6c495c91b46b3659cbca
parent6c46cbe30ef5fd644771570293d8190cb32b7348
CUDA: fix crash with partial offloading of MoE (llama/13439)
src/ggml-cuda/ggml-cuda.cu
src/ggml-cuda/mmq.cu
src/ggml-cuda/mmvq.cu