]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
opencl: add kernel to handle mat mul in attention to improve encoding speed (llama...
authorshaofeiqi <redacted>
Sun, 16 Nov 2025 01:33:10 +0000 (17:33 -0800)
committerGeorgi Gerganov <redacted>
Mon, 17 Nov 2025 19:05:46 +0000 (21:05 +0200)
commitc78845bfa902f8d813c3bf7e8d6f1edf8c8c2341
treeeb44abb03edfc5a07ed659f8357180d8acda8875
parent1fd63da9f24e951146971df2bc8f0f011c42b21e
opencl: add kernel to handle mat mul in attention to improve encoding speed (llama/17181)

* Add mul_mm_f16_f32_kq_kqv kernel

* Add ggml_cl_mul_mat_kq_kqv_adreno func

* fix whitespace

* remove unused variable

* remove redundant

* refactor and clean up

* remove trailing whitespace
ggml/src/ggml-opencl/CMakeLists.txt
ggml/src/ggml-opencl/ggml-opencl.cpp
ggml/src/ggml-opencl/kernels/mul_mm_f16_f32_kq_kqv.cl [new file with mode: 0644]