]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
metal : use FA-vec kernel up to batch size 20 (llama/13496)
authorGeorgi Gerganov <redacted>
Tue, 13 May 2025 15:04:39 +0000 (18:04 +0300)
committerGeorgi Gerganov <redacted>
Mon, 19 May 2025 11:58:39 +0000 (14:58 +0300)
commit08436716aed001d4b3004a8cf676101374ae66eb
tree7db679626c368908aad96d0323d07a35bd80ce77
parente11fc21e6cb8ff4a38cffa534be85bf867f1a232
metal : use FA-vec kernel up to batch size 20 (llama/13496)

* batched-bench : fix pp batch contents

* metal : optimize multi-sequence FA vec kernel

ggml-ci

* metal : use FA-vec kernel up to batch size 20

ggml-ci
ggml/src/ggml-metal/ggml-metal.m