]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: faster tile FA (Pascal/AMD), headsize 256 (llama/15769)
authorJohannes Gäßler <redacted>
Sat, 6 Sep 2025 22:26:28 +0000 (00:26 +0200)
committerGeorgi Gerganov <redacted>
Sat, 20 Sep 2025 10:42:50 +0000 (13:42 +0300)
commitcd70d896285be4653bf0bf474625158c6439bae7
tree653ca654048873eb9c30cecde4cc14d563aa4265
parentbe2676bb1c33904ae106771bbbe8381a6ff73b90
CUDA: faster tile FA (Pascal/AMD), headsize 256 (llama/15769)
ggml/src/ggml-cuda/fattn-tile-f16.cu [deleted file]
ggml/src/ggml-cuda/fattn-tile-f16.cuh [deleted file]
ggml/src/ggml-cuda/fattn-tile-f32.cu [deleted file]
ggml/src/ggml-cuda/fattn-tile-f32.cuh [deleted file]
ggml/src/ggml-cuda/fattn-tile.cu [new file with mode: 0644]
ggml/src/ggml-cuda/fattn-tile.cuh [new file with mode: 0644]
ggml/src/ggml-cuda/fattn.cu