git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

summary | shortlog | log | commit | commitdiff | tree
(parent: b7d2672)

author	Johannes Gäßler <redacted>
	Wed, 14 May 2025 14:41:02 +0000 (16:41 +0200)
committer	GitHub <redacted>
	Wed, 14 May 2025 14:41:02 +0000 (16:41 +0200)
commit	4696d5674999dc10a7fb8c27b33406a929f7463a
tree	780299c8c82dd9eef9b26269c5f45c688787542a	tree
parent	b7d26720821823e23e2273a99e38398d511242e9	commit \| diff

CUDA: fix crash on large batch size for quant. MoE (#13537)

ggml/src/ggml-cuda/mmq.cu		diff \| blob \| history
ggml/src/ggml-cuda/quantize.cu		diff \| blob \| history

Packaging of ggml-org/llama.cpp