]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
CANN: Add the basic supports of Flash Attention kernel (llama/13627)
authorBizhao Shi <redacted>
Mon, 26 May 2025 02:20:18 +0000 (10:20 +0800)
committerGeorgi Gerganov <redacted>
Tue, 27 May 2025 13:58:49 +0000 (16:58 +0300)
commit676fa0d7cb453cb6c54b34624d1ca096aab4dac1
tree336c5bfaf6e1b250f0cd964c3ee9b98f3a4c98f3
parent6757e3217897ee033a0a3178203b5c4b0254d29e
CANN: Add the basic supports of Flash Attention kernel (llama/13627)

* cann: add the basic FA support

* cann: update the readme

* cann: update the FlashAttention with PSEShift

* cann: update the input parameters in FA

* cann: update the alibi with max_bias

* cann: add the constrints of softcap

* cann: update the docs CANN.md

* cann: update the docs CANN.md

* cann: fix typo of CANN.md

* cann: add some comments and update the CANN.md

* cann: update the CANN.md

* cann: update the inner precise for fusedInferAttention

* cann: update the constraints of flash_attn_ext on ggml-cann.cpp

* cann: clean the whitespace

* cann: clean the whitespace

* cann: add a new endline
src/ggml-cann/CMakeLists.txt [changed mode: 0644->0755]
src/ggml-cann/Doxyfile [changed mode: 0644->0755]
src/ggml-cann/acl_tensor.cpp [changed mode: 0644->0755]
src/ggml-cann/acl_tensor.h [changed mode: 0644->0755]
src/ggml-cann/aclnn_ops.cpp [changed mode: 0644->0755]
src/ggml-cann/aclnn_ops.h [changed mode: 0644->0755]
src/ggml-cann/common.h [changed mode: 0644->0755]
src/ggml-cann/ggml-cann.cpp [changed mode: 0644->0755]