]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CANN: Improve the Inferencing Performance for Ascend NPU Device (llama/10454)
authorShanshan Shen <redacted>
Tue, 26 Nov 2024 10:08:37 +0000 (18:08 +0800)
committerGeorgi Gerganov <redacted>
Sun, 8 Dec 2024 18:14:35 +0000 (20:14 +0200)
commit9a5ef7b1693a3081a3a50c467abf1c70d6625b18
tree608e8009ea915ee1db7b4842e3aee4d8a698eeaf
parent453cc0fcf192e638feaa3661031f4d1589b99c26
CANN: Improve the Inferencing Performance for Ascend NPU Device (llama/10454)

* improve inferencing performance for ascend npu.

Co-authored-by: Frank Mai <redacted>
* some modification after review

* some modifications after review

* restore some modifications

* restore some modifications

---------

Co-authored-by: shanshan shen <redacted>
Co-authored-by: Frank Mai <redacted>
ggml/src/ggml-cann/aclnn_ops.cpp
ggml/src/ggml-cann/common.h
ggml/src/ggml-cann/ggml-cann.cpp