git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	leejet <redacted>
	Sun, 26 Oct 2025 18:13:31 +0000 (02:13 +0800)
committer	Georgi Gerganov <redacted>
	Sat, 1 Nov 2025 07:41:35 +0000 (09:41 +0200)
commit	ebc499315cfac3b56ccebe3fdd9f006989815d28
tree	c8342dcb75149b624059e7beaedcdb30485f9849	tree
parent	d8ba8a34a68048f5b667017e776d168f54daee26	commit \| diff

ggml: fix cuda kernel launch configuration for k_compute_batched_ptrs to support large batch (llama/16744)

* fix k_compute_batched_ptrs

* add backend ops test

* Update ggml/src/ggml-cuda/ggml-cuda.cu

Co-authored-by: Johannes Gäßler <redacted>
* reduce the batch size

---------

Co-authored-by: Johannes Gäßler <redacted>

src/ggml-cuda/ggml-cuda.cu		diff \| blob \| history
tests/test-backend-ops.cpp		diff \| blob \| history

Packaging of ggml-org/ggml

RSS Atom