git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Vishal Singh <redacted>
	Fri, 3 Apr 2026 09:19:08 +0000 (14:49 +0530)
committer	GitHub <redacted>
	Fri, 3 Apr 2026 09:19:08 +0000 (12:19 +0300)
commit	f1ac84119ccc8e72dafd9e9f8fc3b9399917ce11
tree	5cfcaeeafb4eb288f76050ce96caa3a0e1f47457	tree
parent	b069b10ab48f25ba119e59d0b8bf35d4f06e093f	commit \| diff

ggml-zendnn : add MUL_MAT_ID op support for MoE models (#21315)

* ggml-zendnn : add MUL_MAT_ID op support for MoE models
- Add MUL_MAT_ID op acceleration for Mixture-of-Experts models
- MUL_MAT_ID op fallback to CPU backend if total experts > 32
- Point ZenDNN lib to latest bits ZenDNN-2026-WW13

* ggml-zendnn : add braces to sgemm failure condition for consistency

Co-authored-by: Aaron Teo <redacted>
---------

Co-authored-by: Aaron Teo <redacted>

Packaging of ggml-org/llama.cpp

RSS Atom

docs/backend/ZenDNN.md		diff \| blob \| history
docs/ops.md		diff \| blob \| history
docs/ops/ZenDNN.csv		diff \| blob \| history
ggml/src/ggml-zendnn/CMakeLists.txt		diff \| blob \| history
ggml/src/ggml-zendnn/ggml-zendnn.cpp		diff \| blob \| history