git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Max Krasnyansky <redacted>
	Thu, 30 Oct 2025 16:06:13 +0000 (09:06 -0700)
committer	GitHub <redacted>
	Thu, 30 Oct 2025 16:06:13 +0000 (09:06 -0700)
commit	517b7170e1a4d733583c4b07c5b7a49acc05911c
tree	6d8de747282fd662e964777e27b50ce6cce93e8d	tree
parent	835e918d8428f5119927d7150bf5a26176dedda0	commit \| diff

cpu: introduce chunking for repack matmuls and enable matmul-id chunking on ARM64 (#16833)

Very similar implementation to the flash-attention chunking, with similar benefits.

ggml/src/ggml-cpu/ggml-cpu.c		diff \| blob \| history
ggml/src/ggml-cpu/repack.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom