]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CANN: Add the basic supports of Flash Attention kernel (llama/13627)
authorBizhao Shi <redacted>
Mon, 26 May 2025 02:20:18 +0000 (10:20 +0800)
committerGeorgi Gerganov <redacted>
Tue, 27 May 2025 15:03:00 +0000 (18:03 +0300)
commite35fecc2a1da4f39c7f215502253ddf70d8e615f
tree24bc60307b970361b92169fafd10e312a1ac64c6
parent1cd702842889833b554e1496c9175fdc744c5601
CANN: Add the basic supports of Flash Attention kernel (llama/13627)

* cann: add the basic FA support

* cann: update the readme

* cann: update the FlashAttention with PSEShift

* cann: update the input parameters in FA

* cann: update the alibi with max_bias

* cann: add the constrints of softcap

* cann: update the docs CANN.md

* cann: update the docs CANN.md

* cann: fix typo of CANN.md

* cann: add some comments and update the CANN.md

* cann: update the CANN.md

* cann: update the inner precise for fusedInferAttention

* cann: update the constraints of flash_attn_ext on ggml-cann.cpp

* cann: clean the whitespace

* cann: clean the whitespace

* cann: add a new endline
ggml/src/ggml-cann/CMakeLists.txt [changed mode: 0644->0755]
ggml/src/ggml-cann/Doxyfile [changed mode: 0644->0755]
ggml/src/ggml-cann/acl_tensor.cpp [changed mode: 0644->0755]
ggml/src/ggml-cann/acl_tensor.h [changed mode: 0644->0755]
ggml/src/ggml-cann/aclnn_ops.cpp [changed mode: 0644->0755]
ggml/src/ggml-cann/aclnn_ops.h [changed mode: 0644->0755]
ggml/src/ggml-cann/common.h [changed mode: 0644->0755]
ggml/src/ggml-cann/ggml-cann.cpp [changed mode: 0644->0755]