]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
opencl: add q4_K gemm and gemv kernels for Adreno (#20919)
authorshaofeiqi <redacted>
Mon, 30 Mar 2026 19:19:16 +0000 (12:19 -0700)
committerGitHub <redacted>
Mon, 30 Mar 2026 19:19:16 +0000 (12:19 -0700)
commit08f21453aec846867b39878500d725a05bd32683
tree04c3bf5520f57819404adcb230060a6181e4dfcc
parent84ae8434d0d00b52a8bab7f584b2963e129ccbfc
opencl: add q4_K gemm and gemv kernels for Adreno (#20919)

* opencl: add q4_K gemm and gemv kernels for Adreno

* opencl: fix whitespace

* opencl: add workarounds for compiler bugs on older devices

* opencl: handle fp16 denorm on X Elite

* opencl: fix kernel build error

* opencl: fix whitespace

* opencl: make q4_K cvt kernels signature consistent

---------

Co-authored-by: Li He <redacted>
ggml/src/ggml-opencl/CMakeLists.txt
ggml/src/ggml-opencl/ggml-opencl.cpp
ggml/src/ggml-opencl/kernels/cvt.cl
ggml/src/ggml-opencl/kernels/gemm_noshuffle_q4_k_f32.cl [new file with mode: 0644]
ggml/src/ggml-opencl/kernels/gemv_noshuffle_q4_k_f32.cl [new file with mode: 0644]