]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
opencl: add optimized q4_1 mm kernel for adreno (llama/19840)
authorshaofeiqi <redacted>
Tue, 3 Mar 2026 03:49:41 +0000 (19:49 -0800)
committerGeorgi Gerganov <redacted>
Mon, 16 Mar 2026 11:10:15 +0000 (13:10 +0200)
commit3a96680718399a9b61d0ad5c41438ca2096893a9
treea2e12ee13d03672a80e0413fd61aafc837ad04d1
parent3145384715c1af0a4895fcac5a77d75b6f02028c
opencl: add optimized q4_1 mm kernel for adreno (llama/19840)

* Add Q4_1 OpenCL Kernels

* opencl: refactor transpose

* opencl: format

* opencl: refactor q4_1 unpack

* opencl: move `ggml_cl_mul_mat_q4_1_f32_adreno`

* opencl: refactor `ggml_cl_mul_mat_q4_1_f32_adreno` and kernels

* opencl: rename kernel files and kernes

* opencl: fix build for non adreno

* opencl: move code around and format

---------

Co-authored-by: Li He <redacted>
ggml/src/ggml-opencl/CMakeLists.txt
ggml/src/ggml-opencl/ggml-opencl.cpp
ggml/src/ggml-opencl/kernels/cvt.cl
ggml/src/ggml-opencl/kernels/gemm_noshuffle_q4_1_f32.cl [new file with mode: 0644]
ggml/src/ggml-opencl/kernels/gemv_noshuffle_general_q8_0_f32.cl
ggml/src/ggml-opencl/kernels/gemv_noshuffle_q4_1_f32.cl [new file with mode: 0644]
ggml/src/ggml-opencl/kernels/transpose.cl