]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CANN: support gated linear attn (llama/18653)
authorhipudding <redacted>
Fri, 16 Jan 2026 08:18:49 +0000 (16:18 +0800)
committerGeorgi Gerganov <redacted>
Fri, 30 Jan 2026 13:56:40 +0000 (15:56 +0200)
commit854274a297335c8e91df00cc14e3deb802b73367
tree25db8e2ce7d7e2b676de3234fa4269601dd88975
parented6004d051b9b914d4bb94fe9595cb0f6df93aa5
CANN: support gated linear attn (llama/18653)

* CANN: support gated linear attn

This change adds support for the GGML_OP_GATED_LINEAR_ATTN operator.
The feature was implemented by YushengZhao. Because the previous
submission was based on an outdated codebase, this PR was rebased to
merge.

Co-authored-by: YushengZhao <redacted>
Co-authored-by: hipudding <redacted>
* CANN: optimize OP gla

Optimize gla for high preformance

* Remove unused comments

---------

Co-authored-by: 赵禹昇 <redacted>
Co-authored-by: YushengZhao <redacted>
ggml/src/ggml-cann/aclnn_ops.cpp
ggml/src/ggml-cann/aclnn_ops.h
ggml/src/ggml-cann/ggml-cann.cpp