]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: faster large batch FA without tensor cores (llama/7314)
authorJohannes Gäßler <redacted>
Fri, 17 May 2024 16:54:52 +0000 (18:54 +0200)
committerGeorgi Gerganov <redacted>
Sun, 16 Jun 2024 15:19:48 +0000 (18:19 +0300)
commitec52f900e433ccbb3db1828015d1dee213c725dd
tree7ea30e19c0022bac268c531f44913fe0cc08e2e1
parent77d708fabbd4bf1c4569bc4067dd785db3a18617
CUDA: faster large batch FA without tensor cores (llama/7314)
ggml-cuda/fattn-tile-f16.cu [new file with mode: 0644]
ggml-cuda/fattn-tile-f16.cuh [new file with mode: 0644]
ggml-cuda/fattn-tile-f32.cu [new file with mode: 0644]
ggml-cuda/fattn-tile-f32.cuh [new file with mode: 0644]
ggml-cuda/fattn.cu