git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Djip007 <redacted>
	Tue, 24 Dec 2024 17:54:49 +0000 (18:54 +0100)
committer	GitHub <redacted>
	Tue, 24 Dec 2024 17:54:49 +0000 (18:54 +0100)
commit	2cd43f4900ba0e34124fdcbf02a7f9df25a10a3d
tree	68007c7db007f7b21c3b68b59c96f31d6bf6a7c6	tree
parent	09fe2e76137dde850b13313f720e7ffa17efdefa	commit \| diff

ggml : more perfo with llamafile tinyblas on x86_64 (#10714)

* more perfo with llamafile tinyblas on x86_64.

- add bf16 suport
- change dispache strategie (thanks:
https://github.com/ikawrakow/ik_llama.cpp/pull/71 )
- reduce memory bandwidth

simple tinyblas dispache and more cache freindly

* tinyblas dynamic dispaching

* sgemm: add M blocs.

* - git 2.47 use short id of len 9.
- show-progress is not part of GNU Wget2

* remove not stable test

Packaging of ggml-org/llama.cpp

RSS Atom

examples/server/tests/unit/test_completion.py		diff \| blob \| history
ggml/src/ggml-cpu/ggml-cpu.c		diff \| blob \| history
ggml/src/ggml-cpu/llamafile/sgemm.cpp		diff \| blob \| history
ggml/src/ggml-cpu/llamafile/sgemm.h		diff \| blob \| history
scripts/compare-llama-bench.py		diff \| blob \| history
scripts/hf.sh		diff \| blob \| history