]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
opencl: add q8_0 mm support (llama/16469)
authorlhez <redacted>
Wed, 15 Oct 2025 17:51:04 +0000 (10:51 -0700)
committerGeorgi Gerganov <redacted>
Wed, 22 Oct 2025 09:58:11 +0000 (12:58 +0300)
commitbef9f74553e4dde2e1ac19f116b41a11bc4ce283
treec05037956ecfaa65597b322a813955614e532bef
parent16dab3d122232fc09d2c05a9ed7732f429164c6a
opencl: add q8_0 mm support (llama/16469)

* opencl: add mm_q8_0_f32

* opencl: fix data loading for incomplete tile

* opencl: use q8_0 mm for larger matrix

* opencl: add some tests to cover the path
ggml/src/ggml-opencl/CMakeLists.txt
ggml/src/ggml-opencl/ggml-opencl.cpp
ggml/src/ggml-opencl/kernels/mul_mm_f16_f32_l4_lm.cl
ggml/src/ggml-opencl/kernels/mul_mm_f32_f32_l4_lm.cl
ggml/src/ggml-opencl/kernels/mul_mm_q8_0_f32_l4_lm.cl [new file with mode: 0644]