]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
OpenCL: add initial FA support (#14987)
authorrmatif <redacted>
Sat, 16 Aug 2025 08:05:55 +0000 (10:05 +0200)
committerGitHub <redacted>
Sat, 16 Aug 2025 08:05:55 +0000 (01:05 -0700)
commit912ff8c119f01ae029543c7fdf7a84f91a0437a3
tree56561ca043a5134f5256d429bc3ce5a3ec6c377e
parent5e6229a8409ac786e62cb133d09f1679a9aec13e
OpenCL: add initial FA support (#14987)

* add F16/F16 fa support

* fix kernel init

* use mad instead of fma

* use inline function

* mark FA with sinks as unsupported for now

* add pragma unroll to loops
ggml/src/ggml-opencl/CMakeLists.txt
ggml/src/ggml-opencl/ggml-opencl.cpp
ggml/src/ggml-opencl/kernels/flash_attn_f16.cl [new file with mode: 0644]
ggml/src/ggml-opencl/kernels/flash_attn_f32.cl [new file with mode: 0644]
ggml/src/ggml-opencl/kernels/flash_attn_f32_f16.cl [new file with mode: 0644]