]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
ggml webgpu: add support for soft_max, optimize rms_norm (#16357)
authorReese Levine <redacted>
Thu, 2 Oct 2025 18:00:31 +0000 (11:00 -0700)
committerGitHub <redacted>
Thu, 2 Oct 2025 18:00:31 +0000 (11:00 -0700)
commitef07a4090672a3438d7f64f197795d7dc1c18957
tree3ebd6f92316f83e507e7823f4e0635c075c76175
parent34fcc5a4ace8c69476ef2ea3857f39a60334acc4
ggml webgpu: add support for soft_max, optimize rms_norm (#16357)

* Add inplace softmax

* Move rms_norm to split row approach

* Update debug for supports_op

* clean up debug statements

* Update tests/test-backend-ops.cpp

Co-authored-by: Georgi Gerganov <redacted>
---------

Co-authored-by: Georgi Gerganov <redacted>
ggml/include/ggml.h
ggml/src/ggml-webgpu/ggml-webgpu.cpp
ggml/src/ggml-webgpu/wgsl-shaders/rms_norm.wgsl
ggml/src/ggml-webgpu/wgsl-shaders/soft_max.tmpl.wgsl [new file with mode: 0644]
ggml/src/ggml.c
tests/test-backend-ops.cpp