git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Georgi Gerganov <redacted>
	Tue, 26 Aug 2025 11:22:14 +0000 (14:22 +0300)
committer	GitHub <redacted>
	Tue, 26 Aug 2025 11:22:14 +0000 (14:22 +0300)
commit	b3964c1e890ef8c947afb36a5124ce6fcb2136d4
tree	ba7664d4ae07bda38f443673d34876d6400da612	tree
parent	79a546220c719e6a70627b243a478ab8d84dc9e1	commit \| diff

metal : optimize FA vec for large sequences and BS <= 8 (#15566)

* metal : optmize FA vec for large heads and sequences

* metal : adjust small-batch mul mv kernels

ggml-ci

* batched-bench : fix total speed computation

ggml-ci

* cont : add comments

ggml-ci

ggml/src/ggml-metal/ggml-metal-impl.h		diff \| blob \| history
ggml/src/ggml-metal/ggml-metal.m		diff \| blob \| history
ggml/src/ggml-metal/ggml-metal.metal		diff \| blob \| history
tools/batched-bench/batched-bench.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom