]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CUDA: fix crash on large batch size for quant. MoE (llama/13537)
authorJohannes Gäßler <redacted>
Wed, 14 May 2025 14:41:02 +0000 (16:41 +0200)
committerGeorgi Gerganov <redacted>
Mon, 19 May 2025 10:37:56 +0000 (13:37 +0300)
commit9f87acbcffb22c26fe9359d40afb07fc0eb10901
treee567bb9551aff919fa440c4c209ea360f08e633e
parentc91952e75b208479622a395456307dc40cd7c827
CUDA: fix crash on large batch size for quant. MoE (llama/13537)
src/ggml-cuda/mmq.cu
src/ggml-cuda/quantize.cu