git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

overview / pkg / ggml / sources / whisper.cpp / commit

author	shaofeiqi <redacted>
	Fri, 30 Jan 2026 18:19:27 +0000 (10:19 -0800)
committer	Georgi Gerganov <redacted>
	Sun, 8 Feb 2026 07:29:10 +0000 (09:29 +0200)
commit	db9c88744de9e7fa775284929fa03f32e6e813ec
tree	1cc18f84a50639c76522915dea389f957f74af01	tree
parent	efd6344939a03c0e5fd41220856055040d0712fd	commit \| diff

opencl: add optimized q8_0 mm kernel for adreno (llama/18871)

* Add Q8_0 OpenCL kernel

Co-authored-by: yunjie <redacted>
* opencl: fix build for non-adreno

* opencl: refactor q8_0

* opencl: enforce subgroup size of 64 for adreno for q8_0

* For A750 and older generations, subgroup size can be 64 or 128.
This kernel assumes subgroup size 64.

* opencl: suppress warning when adreno kernels are disabled

---------

Co-authored-by: yunjie <redacted>
Co-authored-by: Li He <redacted>

ggml/src/ggml-opencl/CMakeLists.txt		diff \| blob \| history
ggml/src/ggml-opencl/ggml-opencl.cpp		diff \| blob \| history
ggml/src/ggml-opencl/kernels/cvt.cl		diff \| blob \| history
ggml/src/ggml-opencl/kernels/gemv_noshuffle_general_q8_0_f32.cl	[new file with mode: 0644]	blob
ggml/src/ggml-opencl/kernels/mul_mm_q8_0_f32_8x4.cl	[new file with mode: 0644]	blob

Packaging of ggerganov/whisper.cpp

RSS Atom