git.djapps.eu Git - pkg/ggml/sources/ggml/commit

author	slaren <redacted>
	Tue, 26 Dec 2023 20:23:59 +0000 (21:23 +0100)
committer	Georgi Gerganov <redacted>
	Wed, 27 Dec 2023 09:40:50 +0000 (11:40 +0200)
commit	49e47713deb6a9d1b6290da804a5391c85b79b8a
tree	a349b28350d48032447708f0ca3eb446f06f5523	tree
parent	175434bd592409678f20cdf8c6e8b2131c5827d5	commit \| diff

cuda : fix vmm pool with multi GPU (llama/4620)

* cuda : fix vmm pool with multi GPU

* hip

* use recommended granularity instead of minimum

* better error checking

* fix mixtral

* use cudaMemcpy3DPeerAsync

* use cuda_pool_alloc in ggml_cuda_op_mul_mat

* consolidate error checking in ggml_cuda_set_device

* remove unnecessary inlines

ggml-ci

* style fixes

* only use vmm for the main device

* fix scratch buffer size, re-enable vmm pool for all devices

* remove unnecessary check id != g_main_device

src/ggml-cuda.cu		diff \| blob \| history
src/ggml.c		diff \| blob \| history