]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
opencl: fix for small models (llama/11950)
authorlhez <redacted>
Mon, 24 Feb 2025 21:47:07 +0000 (13:47 -0800)
committerGeorgi Gerganov <redacted>
Thu, 27 Feb 2025 06:55:36 +0000 (08:55 +0200)
commitb43b9d928c0164d094cc5db82b5f7e2f95af51da
tree0a27a09991048ab4e1bf1905b67de12fb9743796
parente3cb412a59fcc33d5543d66fc2146b5c1e126cb9
opencl: fix for small models (llama/11950)

* opencl: fix small shape gemv, remove unused extensions

* opencl: fix `transpose_16`, `dump_tensor`, enforce subgroup size

* opencl: fix for token length < 4

* opencl: use wave size of 64 for all Adreno GPUs

---------

Co-authored-by: Shawn Gu <redacted>
Co-authored-by: Skyler Szot <redacted>
ggml/src/ggml-opencl/ggml-opencl.cpp
ggml/src/ggml-opencl/kernels/ggml-opencl.cl
ggml/src/ggml-opencl/kernels/ggml-opencl_gemv_noshuffle.cl
ggml/src/ggml-opencl/kernels/ggml-opencl_gemv_noshuffle_general.cl
ggml/src/ggml-opencl/kernels/ggml-opencl_mul_mat_Ab_Bi_8x4.cl
ggml/src/ggml-opencl/kernels/ggml-opencl_transpose_16.cl