]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
hexagon: further optimization and tuning of matmul and dot kernels (llama/19407)
authorMax Krasnyansky <redacted>
Thu, 12 Feb 2026 07:04:27 +0000 (23:04 -0800)
committerGeorgi Gerganov <redacted>
Sat, 14 Feb 2026 22:20:18 +0000 (00:20 +0200)
commitab9a5484bf52fc8648501354120dc3bd34e7587c
tree736cec7cc9052897da68eb40c198edb37de242b4
parent2eb9686f401782c710fc361f44f145a6324bf93d
hexagon: further optimization and tuning of matmul and dot kernels (llama/19407)

* ggml-hexagon: implement 2x2 matmul kernel

* hexmm: implement vec_dot_rx2x2 for Q8_0 and MXFP4

* hexagon: fix editor config failures

* hexagon: refactor matmul ops to use context struct and remove wrappers

Also implement vec_dot_f16 2x2

* hexagon: refactor dyn quantizers to use mmctx

* hexagon: remove mm fastdiv from op_ctx

* hexagon: refactor matmul entry point to reduce code duplication

---------

Co-authored-by: Trivikram Reddy <redacted>
src/ggml-hexagon/htp/htp-ops.h
src/ggml-hexagon/htp/matmul-ops.c