]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
llama : add gguf_remove_key + remove split meta during quantize (#6591)
authorjiez <redacted>
Fri, 12 Apr 2024 10:45:06 +0000 (18:45 +0800)
committerGitHub <redacted>
Fri, 12 Apr 2024 10:45:06 +0000 (13:45 +0300)
commit91c736015b66ba1d0b82cbae6313b6d5eaa61b68
tree098b60b95e78a1062daf0fe2b362de506eb23df7
parent5c4d767ac028c0f9c31cba3fceaf765c6097abfc
llama : add gguf_remove_key + remove split meta during quantize (#6591)

* Remove split metadata when quantize model shards

* Find metadata key by enum

* Correct loop range for gguf_remove_key and code format

* Free kv memory

---------

Co-authored-by: z5269887 <redacted>
ggml.c
ggml.h
llama.cpp