git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Aman Gupta <redacted>
	Sat, 24 Jan 2026 06:25:20 +0000 (14:25 +0800)
committer	GitHub <redacted>
	Sat, 24 Jan 2026 06:25:20 +0000 (14:25 +0800)
commit	81ab64f3c858c0db8c7c3a6bccd4cbbe624f52a3
tree	c29786be85199c33de26e0e7a20c42757ffcb78c	tree
parent	8af1f5f430baaab1719db8f0a259bcc2a1cfdaa0	commit \| diff

ggml-cuda: enable cuda-graphs for `n-cpu-moe` (#18934)

* ggml-cuda: add split-wise cuda graph

* add n-cpu-moe compare_llama_bench.py

* fix hip/musa builds

ggml/src/ggml-cuda/common.cuh		diff \| blob \| history
ggml/src/ggml-cuda/ggml-cuda.cu		diff \| blob \| history
ggml/src/ggml-cuda/mean.cu		diff \| blob \| history
scripts/compare-llama-bench.py		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom