]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: fix MMQ nwarps for AMD with warp_size==32 (#15014)
authorJohannes Gäßler <redacted>
Fri, 1 Aug 2025 18:47:32 +0000 (20:47 +0200)
committerGitHub <redacted>
Fri, 1 Aug 2025 18:47:32 +0000 (20:47 +0200)
commit9c35706b98ea271858acef4194f526a71b24cdc9
tree5985558fffc2d3cbf78127e1f201228c3ca43dfe
parentc76b420e4ce06f7b7cdfbb0b85d02c90e5cc5a3a
CUDA: fix MMQ nwarps for AMD with warp_size==32 (#15014)
ggml/src/ggml-cuda/mmq.cuh