]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
mmq.cu: tune mmq/rocblas switching for RDNA (llama/18537)
authorBeinsezii <redacted>
Tue, 6 Jan 2026 15:26:07 +0000 (07:26 -0800)
committerGeorgi Gerganov <redacted>
Wed, 14 Jan 2026 07:11:59 +0000 (09:11 +0200)
commited674cfc1080474998c57c89c328e83b543e6341
tree2f65c658ae6947159ff96ddf9d87589f866d56a0
parent5520f273634e3bdd5758b44940d14ebd4d313144
mmq.cu: tune mmq/rocblas switching for RDNA (llama/18537)

* Patch perf regression for mmq kernels in ROCm

recover performance regression for https://github.com/ggml-org/llama.cpp/issues/17917

* add n_experts branch like the cdna path

* mmq.cu: tune mmq/wmma switching for RDNA

* mmq.cu: move amd wmma mmq/wmma switching behind IS_RDNA3

* Update ggml/src/ggml-cuda/mmq.cu

Co-authored-by: Johannes Gäßler <redacted>
---------

Co-authored-by: Jiacheng (Jason) Chen <redacted>
Co-authored-by: jiachengjason <redacted>
Co-authored-by: Johannes Gäßler <redacted>
ggml/src/ggml-cuda/mmq.cu