]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
opencl: optimize mxfp4 kernels (llama/16037)
authorShawn Gu <redacted>
Thu, 18 Sep 2025 19:03:34 +0000 (12:03 -0700)
committerGeorgi Gerganov <redacted>
Sat, 20 Sep 2025 10:46:39 +0000 (13:46 +0300)
commitf4a225cea61ac214adc24b90145042d5b4bca933
tree5a041c781a6af3eeba8c5a319b117ac44b1cda9c
parent7fcb7e83eca1cff7380c332500412722f410502f
opencl: optimize mxfp4 kernels (llama/16037)

- flatten mxfp4 and packed fp4->fp16 bit-wise convert function (replace lut)
- MoE kernel optimizations

---------

Co-authored-by: Li He <redacted>
ggml/src/ggml-opencl/CMakeLists.txt
ggml/src/ggml-opencl/ggml-opencl.cpp
ggml/src/ggml-opencl/kernels/cvt.cl
ggml/src/ggml-opencl/kernels/mul_mv_id_mxfp4_f32_flat.cl [new file with mode: 0644]
ggml/src/ggml-opencl/kernels/mul_mv_mxfp4_f32_flat.cl [new file with mode: 0644]