]>
git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
model : support Step3.5-Flash (#19283)
* Support Step3.5-Flash
* fix: norm.weight + 1 (HF zero_centered=true)
* step35: simplify GGUF conversion + drop redundant rope KVs
* Address review feedback
* rename limits -> clamp
* Apply suggestions from code review
Co-authored-by: Sigbjørn Skjæret <redacted>
* Apply suggestion from @CISC
Co-authored-by: Sigbjørn Skjæret <redacted>
* rename swiglu limits -> swiglu clamp in LLM_KV
* avoid CI fail
* Apply suggestions from code review
* Apply suggestions from code review
* disabled KV shifting for LLM_ARCH_STEP35
* Apply suggestions from code review
* mistakenly removed cmath
* add model size && apply missed suggestion
* assert partial_rotary_factors
* fix CI errors:
* load freq_base_swa
---------
Co-authored-by: lvyichen <redacted>
Co-authored-by: Sigbjørn Skjæret <redacted>
15 files changed: