]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (llama/12032)
authorDavid Huang <redacted>
Mon, 3 Mar 2025 21:10:54 +0000 (05:10 +0800)
committerGeorgi Gerganov <redacted>
Sat, 8 Mar 2025 13:13:01 +0000 (15:13 +0200)
commitedd1d8686a34eb30c3be340a0167d3b928dde60d
tree92e322fa93fce7ef4f8ec3d7938a37c2a69ecd3c
parentdc6f4e7c05fcc12cf6cf99aaf2e8fb6e2a641c1c
HIP: implement FlashAttention via rocWMMA for CDNA and RDNA3+ (llama/12032)

Adds GGML_HIP_ROCWMMA_FATTN and rocwmma header check
Adds rocWMMA support to fattn-wmma-f16
ggml/CMakeLists.txt
ggml/src/ggml-cuda/common.cuh
ggml/src/ggml-cuda/fattn-common.cuh
ggml/src/ggml-cuda/fattn-wmma-f16.cu
ggml/src/ggml-cuda/fattn.cu
ggml/src/ggml-hip/CMakeLists.txt