git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	slaren <redacted>
	Tue, 22 Aug 2023 13:25:19 +0000 (15:25 +0200)
committer	GitHub <redacted>
	Tue, 22 Aug 2023 13:25:19 +0000 (15:25 +0200)
commit	1123f7fbdfb8012e46f05e903e6f675922916378
tree	27f3700a672e8f0d09d86797ce1c199ff72a4d51	tree
parent	ef3f333d3775600d1646a9fa249aca532d15fb89	commit \| diff

ggml-cuda : use graph allocator (#2684)

use a different function for no_alloc to avoid breaking backwards compat, fixes lora

remove 512 n_batch limit

fixed 2048 batch size

cleanup

Co-authored-by: Johannes Gäßler <redacted>

common/common.cpp		diff \| blob \| history
ggml-cuda.cu		diff \| blob \| history
ggml-cuda.h		diff \| blob \| history
llama.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom