]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: app option to compile without FlashAttention (llama/12025)
authorJohannes Gäßler <redacted>
Sat, 22 Feb 2025 19:44:34 +0000 (20:44 +0100)
committerGeorgi Gerganov <redacted>
Thu, 27 Feb 2025 06:55:36 +0000 (08:55 +0200)
commit38ac47cd4d8876e28f509254f9c70e896c89466c
tree4b96f692450d3bca8763833b723a40a7b444093d
parent2d70cd36d7228fdfd911b247bd315b0aa2937ca9
CUDA: app option to compile without FlashAttention (llama/12025)
12 files changed:
ggml/CMakeLists.txt
ggml/src/ggml-cuda/CMakeLists.txt
ggml/src/ggml-cuda/common.cuh
ggml/src/ggml-cuda/fattn-mma-f16.cuh
ggml/src/ggml-cuda/fattn-tile-f16.cu
ggml/src/ggml-cuda/fattn-tile-f32.cu
ggml/src/ggml-cuda/fattn-vec-f16.cuh
ggml/src/ggml-cuda/fattn-vec-f32.cuh
ggml/src/ggml-cuda/fattn-wmma-f16.cu
ggml/src/ggml-cuda/ggml-cuda.cu
ggml/src/ggml-hip/CMakeLists.txt
ggml/src/ggml-musa/CMakeLists.txt