git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	Johannes Gäßler <redacted>
	Wed, 23 Jul 2025 10:35:53 +0000 (12:35 +0200)
committer	Georgi Gerganov <redacted>
	Thu, 24 Jul 2025 17:57:40 +0000 (20:57 +0300)
commit	bf82b8786519124d4d370a9176b82c14b58e22b1
tree	0ac4ed5f9a12bcdb1f0fefe11178f34e3aefbe29	tree
parent	56c9cd2bab7f3c0befee70ad48672d0003fa6e91	commit \| diff

CUDA: fix quantized KV cache + multiple sequences (llama/14822)

* CUDA: fix quantized KV cache + multiple sequences

* Update src/ggml-cuda/fattn-common.cuh

Co-authored-by: Georgi Gerganov <redacted>
---------

Co-authored-by: Georgi Gerganov <redacted>

src/ggml-cuda/convert.cu		diff \| blob \| history
src/ggml-cuda/fattn-common.cuh		diff \| blob \| history

Packaging of ggml-org/ggml

RSS Atom