]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
metal : optimize multi-sequence FA vec kernel (#13493)
authorGeorgi Gerganov <redacted>
Tue, 13 May 2025 15:04:00 +0000 (18:04 +0300)
committerGitHub <redacted>
Tue, 13 May 2025 15:04:00 +0000 (18:04 +0300)
commitc252e0c4097b34666e5a81db9d0450d71fa3098f
tree685a93398bc7134c51bd0cde9756dfdf5d357da0
parent4f711afed5e7ef4304b567c8888ee1aa60e868eb
metal : optimize multi-sequence FA vec kernel (#13493)

* batched-bench : fix pp batch contents

* metal : optimize multi-sequence FA vec kernel

ggml-ci
ggml/src/ggml-metal/ggml-metal.metal