git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	Diego Devesa <redacted>
	Sun, 31 Aug 2025 13:49:03 +0000 (06:49 -0700)
committer	Georgi Gerganov <redacted>
	Fri, 5 Sep 2025 09:54:09 +0000 (12:54 +0300)
commit	319bf932319ab96617ffa7ea93c5be2574edbdb0
tree	9191ae0682c6cd482fee2a572422a334af6ce8b0	tree
parent	cd2cdfdad83421a9744e5518619b2c8b9bcd68d0	commit \| diff

llama : separate compute buffer reserve from fattn check (llama/15696)

Exposes ggml_backend_sched_split_graph() to allow splitting the graph without allocating compute buffers and uses it to split the graph for the automatic Flash Attention check.

include/ggml-backend.h		diff \| blob \| history
src/ggml-backend.cpp		diff \| blob \| history

Packaging of ggml-org/ggml

RSS Atom