git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Kerfuffle <redacted>
	Mon, 13 Nov 2023 08:58:15 +0000 (01:58 -0700)
committer	GitHub <redacted>
	Mon, 13 Nov 2023 08:58:15 +0000 (01:58 -0700)
commit	bb50a792ec2a49944470c82694fa364345e95170
tree	1ad53a7f00d4cc76a91943a51729806db16988db	tree
parent	21fd874c8d2a14dea2d56724e4357c0824aee6a8	commit \| diff

Add ReLU and SQR CUDA ops to (partially) fix Persimmon offloading (#4041)

* Add ReLU and SQR CUDA ops to fix Persimmon offloading

* Persimmon loader: More helpful error on CUDA/ROCM when offloading too many layers

ggml-cuda.cu		diff \| blob \| history
llama.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom