]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
ggml : more perfo with llamafile tinyblas on x86_64 (llama/10714)
authorDjip007 <redacted>
Tue, 24 Dec 2024 17:54:49 +0000 (18:54 +0100)
committerGeorgi Gerganov <redacted>
Fri, 3 Jan 2025 12:00:38 +0000 (14:00 +0200)
commit0aadb9fce17b79b191202982287396378f110317
tree9a75b98558dd125764eb76c0c848d96603a1ec3b
parent2a3325f937146b383b79058a2f183ff57232afd4
ggml : more perfo with llamafile tinyblas on x86_64 (llama/10714)

* more perfo with llamafile tinyblas on x86_64.

- add bf16 suport
- change dispache strategie (thanks:
https://github.com/ikawrakow/ik_llama.cpp/pull/71 )
- reduce memory bandwidth

simple tinyblas dispache and more cache freindly

* tinyblas dynamic dispaching

* sgemm: add M blocs.

* - git 2.47 use short id of len 9.
- show-progress is not part of GNU Wget2

* remove not stable test
src/ggml-cpu/ggml-cpu.c
src/ggml-cpu/llamafile/sgemm.cpp
src/ggml-cpu/llamafile/sgemm.h