git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Reese Levine <redacted>
	Thu, 2 Oct 2025 18:00:31 +0000 (11:00 -0700)
committer	GitHub <redacted>
	Thu, 2 Oct 2025 18:00:31 +0000 (11:00 -0700)
commit	ef07a4090672a3438d7f64f197795d7dc1c18957
tree	3ebd6f92316f83e507e7823f4e0635c075c76175	tree
parent	34fcc5a4ace8c69476ef2ea3857f39a60334acc4	commit \| diff

ggml webgpu: add support for soft_max, optimize rms_norm (#16357)

* Add inplace softmax

* Move rms_norm to split row approach

* Update debug for supports_op

* clean up debug statements

* Update tests/test-backend-ops.cpp

Co-authored-by: Georgi Gerganov <redacted>
---------

Co-authored-by: Georgi Gerganov <redacted>

ggml/include/ggml.h		diff \| blob \| history
ggml/src/ggml-webgpu/ggml-webgpu.cpp		diff \| blob \| history
ggml/src/ggml-webgpu/wgsl-shaders/rms_norm.wgsl		diff \| blob \| history
ggml/src/ggml-webgpu/wgsl-shaders/soft_max.tmpl.wgsl	[new file with mode: 0644]	blob
ggml/src/ggml.c		diff \| blob \| history
tests/test-backend-ops.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom