]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
hexagon: further optimization and tuning of matmul and dot kernels (#19407)
authorMax Krasnyansky <redacted>
Thu, 12 Feb 2026 07:04:27 +0000 (23:04 -0800)
committerGitHub <redacted>
Thu, 12 Feb 2026 07:04:27 +0000 (23:04 -0800)
commitb1ff83bbb08bd946f79b89bc7adbd7f08ae61303
tree3197d30c41bbd1d449a2ba100acb89ee784e9fdd
parent4ae1b7517a787f5b37776b0598ed2b69e6caf5bd
hexagon: further optimization and tuning of matmul and dot kernels (#19407)

* ggml-hexagon: implement 2x2 matmul kernel

* hexmm: implement vec_dot_rx2x2 for Q8_0 and MXFP4

* hexagon: fix editor config failures

* hexagon: refactor matmul ops to use context struct and remove wrappers

Also implement vec_dot_f16 2x2

* hexagon: refactor dyn quantizers to use mmctx

* hexagon: remove mm fastdiv from op_ctx

* hexagon: refactor matmul entry point to reduce code duplication

---------

Co-authored-by: Trivikram Reddy <redacted>
ggml/src/ggml-hexagon/htp/htp-ops.h
ggml/src/ggml-hexagon/htp/matmul-ops.c