]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CANN: Improve the Inferencing Performance for Ascend NPU Device (#10454)
authorShanshan Shen <redacted>
Tue, 26 Nov 2024 10:08:37 +0000 (18:08 +0800)
committerGitHub <redacted>
Tue, 26 Nov 2024 10:08:37 +0000 (18:08 +0800)
commit9a4b79bcfa4338b922fa8cf903bd5ac058aaf46f
tree35787a6ef86c0a4f69d74eb8f81081c6ebb7ede4
parent7066b4cce2898993e943ad6af5d8f1de5840c8e9
CANN: Improve the Inferencing Performance for Ascend NPU Device (#10454)

* improve inferencing performance for ascend npu.

Co-authored-by: Frank Mai <redacted>
* some modification after review

* some modifications after review

* restore some modifications

* restore some modifications

---------

Co-authored-by: shanshan shen <redacted>
Co-authored-by: Frank Mai <redacted>
ggml/src/ggml-cann/aclnn_ops.cpp
ggml/src/ggml-cann/common.h
ggml/src/ggml-cann/ggml-cann.cpp