git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	slaren <redacted>
	Mon, 18 Mar 2024 10:03:04 +0000 (11:03 +0100)
committer	Georgi Gerganov <redacted>
	Wed, 27 Mar 2024 11:20:00 +0000 (13:20 +0200)
commit	952fb4cc11830060625f7dc23e3026030bc42f1b
tree	aa95bebb8f3893393b70d583accaf4e9ba73c90b	tree
parent	e1998f7365a1e7588b3c1ed93c9ce9d991f370b8	commit \| diff

backend : offload large batches to GPU (llama/6083)

* backend : offload large batches to GPU

* fix hip

* code cleanup

* fix CUDA split buffers

* Update ggml-backend-impl.h

Co-authored-by: Johannes Gäßler <redacted>
* cuda : fix memset without set_device

* imatrix : remove sched affix from weight names

* sched : add a new split if the current one has too many inputs
reduce max inputs per split
more cleanup

* update backends

ggml-ci

---------

Co-authored-by: Johannes Gäßler <redacted>

include/ggml/ggml-backend.h		diff \| blob \| history
src/ggml-alloc.c		diff \| blob \| history
src/ggml-backend-impl.h		diff \| blob \| history
src/ggml-backend.c		diff \| blob \| history
src/ggml-cuda.cu		diff \| blob \| history
src/ggml-cuda.h		diff \| blob \| history
src/ggml-kompute.cpp		diff \| blob \| history
src/ggml-metal.m		diff \| blob \| history
src/ggml-sycl.cpp		diff \| blob \| history
src/ggml-vulkan.cpp		diff \| blob \| history
src/ggml.c		diff \| blob \| history

Packaging of ggml-org/ggml

RSS Atom