git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	Max Krasnyansky <redacted>
	Thu, 12 Feb 2026 07:04:27 +0000 (23:04 -0800)
committer	GitHub <redacted>
	Thu, 12 Feb 2026 07:04:27 +0000 (23:04 -0800)
commit	b1ff83bbb08bd946f79b89bc7adbd7f08ae61303
tree	3197d30c41bbd1d449a2ba100acb89ee784e9fdd	tree
parent	4ae1b7517a787f5b37776b0598ed2b69e6caf5bd	commit \| diff

hexagon: further optimization and tuning of matmul and dot kernels (#19407)

* ggml-hexagon: implement 2x2 matmul kernel

* hexmm: implement vec_dot_rx2x2 for Q8_0 and MXFP4

* hexagon: fix editor config failures

* hexagon: refactor matmul ops to use context struct and remove wrappers

Also implement vec_dot_f16 2x2

* hexagon: refactor dyn quantizers to use mmctx

* hexagon: remove mm fastdiv from op_ctx

* hexagon: refactor matmul entry point to reduce code duplication

---------

Co-authored-by: Trivikram Reddy <redacted>

ggml/src/ggml-hexagon/htp/htp-ops.h		diff \| blob \| history
ggml/src/ggml-hexagon/htp/matmul-ops.c		diff \| blob \| history