]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
opencl: fix for small models (#11950)
authorlhez <redacted>
Mon, 24 Feb 2025 21:47:07 +0000 (13:47 -0800)
committerGitHub <redacted>
Mon, 24 Feb 2025 21:47:07 +0000 (14:47 -0700)
commit34a846b5847a18d133b360074f1fb485b2632b2d
treed62b702618bd054b3f77fe1f6f8632079f40d979
parent7a2c913e66353362d7f28d612fd3c9d51a831eda
opencl: fix for small models (#11950)

* opencl: fix small shape gemv, remove unused extensions

* opencl: fix `transpose_16`, `dump_tensor`, enforce subgroup size

* opencl: fix for token length < 4

* opencl: use wave size of 64 for all Adreno GPUs

---------

Co-authored-by: Shawn Gu <redacted>
Co-authored-by: Skyler Szot <redacted>
ggml/src/ggml-opencl/ggml-opencl.cpp
ggml/src/ggml-opencl/kernels/ggml-opencl.cl
ggml/src/ggml-opencl/kernels/ggml-opencl_gemv_noshuffle.cl
ggml/src/ggml-opencl/kernels/ggml-opencl_gemv_noshuffle_general.cl
ggml/src/ggml-opencl/kernels/ggml-opencl_mul_mat_Ab_Bi_8x4.cl
ggml/src/ggml-opencl/kernels/ggml-opencl_transpose_16.cl