git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	rmatif <redacted>
	Sat, 16 Aug 2025 08:05:55 +0000 (10:05 +0200)
committer	GitHub <redacted>
	Sat, 16 Aug 2025 08:05:55 +0000 (01:05 -0700)
commit	912ff8c119f01ae029543c7fdf7a84f91a0437a3
tree	56561ca043a5134f5256d429bc3ce5a3ec6c377e	tree
parent	5e6229a8409ac786e62cb133d09f1679a9aec13e	commit \| diff

OpenCL: add initial FA support (#14987)

* add F16/F16 fa support

* fix kernel init

* use mad instead of fma

* use inline function

* mark FA with sinks as unsupported for now

* add pragma unroll to loops

ggml/src/ggml-opencl/CMakeLists.txt		diff \| blob \| history
ggml/src/ggml-opencl/ggml-opencl.cpp		diff \| blob \| history
ggml/src/ggml-opencl/kernels/flash_attn_f16.cl	[new file with mode: 0644]	blob
ggml/src/ggml-opencl/kernels/flash_attn_f32.cl	[new file with mode: 0644]	blob
ggml/src/ggml-opencl/kernels/flash_attn_f32_f16.cl	[new file with mode: 0644]	blob

Packaging of ggml-org/llama.cpp

RSS Atom