git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	Ruben Ortlam <redacted>
	Sun, 1 Mar 2026 16:32:14 +0000 (17:32 +0100)
committer	Georgi Gerganov <redacted>
	Sun, 15 Mar 2026 19:50:13 +0000 (21:50 +0200)
commit	162fd7c136861a1ce1d97177b031526a6428edc4
tree	5a13d292dff34d87813730bae85e93539cb30d64	tree
parent	f4eb7267dad77f0b38ea107d734cb20102b37656	commit \| diff

vulkan: improve partial offloading performance on AMD (llama/19976)

* vulkan: fix and enable cpy_tensor_async function

* use transfer_queue for async transfers on AMD, synchronize with timeline semaphore

* update offload_op logic

* fix missing transfer submission

* disable async transfer queue on AMD GCN

* revert op batch size change

* fix cpy_tensor_async checks

src/ggml-vulkan/ggml-vulkan.cpp

diff | blob | history

Packaging of ggml-org/ggml

RSS Atom