git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Georgi Gerganov <redacted>
	Fri, 13 Sep 2024 06:53:38 +0000 (09:53 +0300)
committer	GitHub <redacted>
	Fri, 13 Sep 2024 06:53:38 +0000 (09:53 +0300)
commit	0abc6a2c25272d5cf01384dda8ee8bfec4ba8745
tree	ca075a9182e60fab558d7e5ca0d6dc0609426db0	tree
parent	bd35cb0ae357185c173345f10dc89a4ff925fc25	commit \| diff

llama : llama_perf + option to disable timings during decode (#9355)

* llama : llama_perf + option to disable timings during decode

ggml-ci

* common : add llama_arg

* Update src/llama.cpp

Co-authored-by: Xuan Son Nguyen <redacted>
* perf : separate functions in the API

ggml-ci

* perf : safer pointer handling + naming update

ggml-ci

* minor : better local var name

* perf : abort on invalid sampler pointer

ggml-ci

---------

Co-authored-by: Xuan Son Nguyen <redacted>

23 files changed:

common/arg.cpp		diff \| blob \| history
common/common.cpp		diff \| blob \| history
common/common.h		diff \| blob \| history
common/sampling.cpp		diff \| blob \| history
examples/batched-bench/batched-bench.cpp		diff \| blob \| history
examples/batched.swift/Sources/main.swift		diff \| blob \| history
examples/batched/batched.cpp		diff \| blob \| history
examples/embedding/embedding.cpp		diff \| blob \| history
examples/eval-callback/eval-callback.cpp		diff \| blob \| history
examples/imatrix/imatrix.cpp		diff \| blob \| history
examples/llama-bench/llama-bench.cpp		diff \| blob \| history
examples/llava/llava-cli.cpp		diff \| blob \| history
examples/llava/minicpmv-cli.cpp		diff \| blob \| history
examples/lookup/lookup.cpp		diff \| blob \| history
examples/parallel/parallel.cpp		diff \| blob \| history
examples/passkey/passkey.cpp		diff \| blob \| history
examples/perplexity/perplexity.cpp		diff \| blob \| history
examples/retrieval/retrieval.cpp		diff \| blob \| history
examples/simple/simple.cpp		diff \| blob \| history
examples/speculative/speculative.cpp		diff \| blob \| history
include/llama.h		diff \| blob \| history
src/llama-sampling.cpp		diff \| blob \| history
src/llama.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom