]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
metal: optimise `GGML_OP_SUM` (llama/16559)
authorSam/Samuel <redacted>
Wed, 15 Oct 2025 14:05:56 +0000 (23:05 +0900)
committerGeorgi Gerganov <redacted>
Wed, 22 Oct 2025 09:58:11 +0000 (12:58 +0300)
commitd8a146b0f9a1af396e1812e3fc6859483752dab1
tree977d8e9bdc3d9ac79e151b89e55f926ad8290aa8
parent0c9d49927c3e90949e4c9db5f44583ad4ba7660a
metal: optimise `GGML_OP_SUM` (llama/16559)

* optimise GGML_OP_SUM

* add non-contiguous tests by permuting the input

* change tests to require full contiguity of OP_SUM

* cuda : add check GGML_OP_SUM

---------

Co-authored-by: Georgi Gerganov <redacted>
ggml/src/ggml-cuda/ggml-cuda.cu
ggml/src/ggml-metal/ggml-metal-device.m
ggml/src/ggml-metal/ggml-metal-ops.cpp
ggml/src/ggml-metal/ggml-metal.metal