]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
ggml webgpu: faster matrix multiplication/matrix-vector multiplication (llama/17031)
authorReese Levine <redacted>
Sat, 8 Nov 2025 03:27:20 +0000 (19:27 -0800)
committerGeorgi Gerganov <redacted>
Sun, 9 Nov 2025 21:38:03 +0000 (23:38 +0200)
commit78ea6c5b67d39faf2f6fc8c005170090394ea8ea
treef920492a487d33b8e2f3da3b2e2d9de6259ffc10
parent547724b0a5d43540255647788b59ca2e78bba27d
ggml webgpu: faster matrix multiplication/matrix-vector multiplication (llama/17031)

* Faster tensors (llama/8)

Add fast matrix and matrix/vector multiplication.

* Use map for shader replacements instead of pair of strings
ggml/src/ggml-webgpu/ggml-webgpu.cpp
ggml/src/ggml-webgpu/wgsl-shaders/embed_wgsl.py
ggml/src/ggml-webgpu/wgsl-shaders/mul_mat.tmpl.wgsl
ggml/src/ggml-webgpu/wgsl-shaders/mul_mat_decls.tmpl [new file with mode: 0644]
ggml/src/ggml-webgpu/wgsl-shaders/mul_mat_reg_tile.tmpl.wgsl [new file with mode: 0644]
ggml/src/ggml-webgpu/wgsl-shaders/mul_mat_subgroup_matrix.tmpl.wgsl [new file with mode: 0644]
ggml/src/ggml-webgpu/wgsl-shaders/mul_mat_vec.tmpl.wgsl [new file with mode: 0644]