git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	Georgi Gerganov <redacted>
	Sat, 14 Feb 2026 10:57:36 +0000 (12:57 +0200)
committer	Georgi Gerganov <redacted>
	Sat, 14 Feb 2026 22:20:18 +0000 (00:20 +0200)
commit	f61050d0c0771749179486f1672d4b0b43f97637
tree	0552524a8b2abd47de17422e4ac23b44970973c7	tree
parent	d07b0e5a9575a6faff2054eec7595c2f7645b34c	commit \| diff

models : optimize qwen3next graph (llama/19375)

* models : optimizing qwen3next graph

* cont

* wip

* wip

* wip

* wip

* wip

* wip

* wip

* wip

* wip

* wip

* cont : remove redundant q, g chunking

* minor

* minor

* avoid passing masks around

* avoid concats during chunking

* naming + shapes

* update names and use prefix to disable CUDA graphs

src/ggml-cuda/ggml-cuda.cu		diff \| blob \| history
src/ggml-metal/ggml-metal-common.cpp		diff \| blob \| history

Packaging of ggml-org/ggml

RSS Atom