git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Georgi Gerganov <redacted>
	Sun, 27 Aug 2023 13:40:48 +0000 (16:40 +0300)
committer	GitHub <redacted>
	Sun, 27 Aug 2023 13:40:48 +0000 (16:40 +0300)
commit	eaa13a48ff4136f01c1cdb79cacd61b67ec53095
tree	1e22d465164eb73b72dd6dab345987ea5691e6f2	tree
parent	da7455d0467b5f5cc2e45d0dcffaf098df13db63	commit \| diff

falcon : fix CUDA inference by making K and Q contiguous (#2830)

* falcon : fix CUDA inference by making K and Q contiguous

ggml-ci

* cuda : add assert to guard from non-cont ropes

ggml-cuda.cu		diff \| blob \| history
llama.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom