]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CANN: support gated linear attn (#18653)
authorhipudding <redacted>
Fri, 16 Jan 2026 08:18:49 +0000 (16:18 +0800)
committerGitHub <redacted>
Fri, 16 Jan 2026 08:18:49 +0000 (16:18 +0800)
commitbaa4ba0aecde5bce2e801b6bd7ecf020219bf2b7
treeba6b0190a236bc29d0f31dad213bfcdce9969de7
parent785a71008573e2d84728fb0ba9e851d72d3f8fab
CANN: support gated linear attn (#18653)

* CANN: support gated linear attn

This change adds support for the GGML_OP_GATED_LINEAR_ATTN operator.
The feature was implemented by YushengZhao. Because the previous
submission was based on an outdated codebase, this PR was rebased to
merge.

Co-authored-by: YushengZhao <redacted>
Co-authored-by: hipudding <redacted>
* CANN: optimize OP gla

Optimize gla for high preformance

* Remove unused comments

---------

Co-authored-by: 赵禹昇 <redacted>
Co-authored-by: YushengZhao <redacted>
ggml/src/ggml-cann/aclnn_ops.cpp
ggml/src/ggml-cann/aclnn_ops.h
ggml/src/ggml-cann/ggml-cann.cpp