git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

summary | shortlog | log | commit | commitdiff | tree
(parent: aaa3d07)

author	Johannes Gäßler <redacted>
	Fri, 8 Aug 2025 06:19:58 +0000 (08:19 +0200)
committer	GitHub <redacted>
	Fri, 8 Aug 2025 06:19:58 +0000 (08:19 +0200)
commit	1425f587a82bc303469b5c32759a2746ba4e1e20
tree	e9dc1fc1b17f1748fac23ed90108dcfff22bbb23	tree
parent	aaa3d07ae749b781d6135eaff23c7fa8a4ab404a	commit \| diff

CUDA: attention sinks for mma FlashAttention (#15157)

ggml/src/ggml-cuda/fattn-mma-f16.cuh		diff \| blob \| history
ggml/src/ggml-cuda/fattn.cu		diff \| blob \| history
ggml/src/ggml-cuda/ggml-cuda.cu		diff \| blob \| history

Packaging of ggml-org/llama.cpp