git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	hipudding <redacted>
	Tue, 2 Sep 2025 09:12:37 +0000 (17:12 +0800)
committer	Georgi Gerganov <redacted>
	Fri, 5 Sep 2025 09:54:11 +0000 (12:54 +0300)
commit	0db3a9b20a8d6c945fcf96f1ea87e8107ce910a7
tree	b22845f70170b36b445bee5513365fbbe4456a68	tree
parent	3f53ddab61ff8cedca002b65bc6f340903872050	commit \| diff

CANN: Resolve soft_max precision issue (llama/15730)

Previously, the slope tensor was set to fp16 to improve efficiency.
While this worked correctly in FA, it caused precision issues in soft_max.
This change applies different data types for different operators
to balance both accuracy and performance.

src/ggml-cann/aclnn_ops.cpp

diff | blob | history

Packaging of ggml-org/ggml

RSS Atom