]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
llama : add gguf_remove_key + remove split meta during quantize (llama/6591)
authorjiez <redacted>
Fri, 12 Apr 2024 10:45:06 +0000 (18:45 +0800)
committerGeorgi Gerganov <redacted>
Mon, 13 May 2024 08:02:26 +0000 (11:02 +0300)
commit60f3713026a76ea6e196bb187df9dcdfb63fc94e
treead0d694e05b4603a6bd06f939a297b31594ed686
parent37e6757453d4157bf0588e1f65e31931d3849628
llama : add gguf_remove_key + remove split meta during quantize (llama/6591)

* Remove split metadata when quantize model shards

* Find metadata key by enum

* Correct loop range for gguf_remove_key and code format

* Free kv memory

---------

Co-authored-by: z5269887 <redacted>
ggml.c
ggml.h