git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	Tarek Dakhran <redacted>
	Thu, 19 Feb 2026 08:54:48 +0000 (09:54 +0100)
committer	GitHub <redacted>
	Thu, 19 Feb 2026 08:54:48 +0000 (09:54 +0100)
commit	8004f3a8d1f9e66ed534bc46f98c3a16c27f8c4a
tree	c9fb8c9d319cb8a3e2c86e8db889bd88d0f8d7d4	tree
parent	eacb4b67a2905d1f53e5f55033f81c4824b49e64	commit \| diff

model : add tokenizer from LFM2.5-Audio-1.5B (#19687)

* model : Add tokenizer from LFM2.5-Audio-1.5B

[LFM2.5-Audio-1.5B](https://huggingface.co/LiquidAI/LFM2.5-Audio-1.5B) introduced lightweight audio tokenizer.

Tokenizer based on LFM2 architecture and acts as "embedding" model with
different input `n_embd` and output `n_embd_out`.

To be used in https://github.com/ggml-org/llama.cpp/pull/18641.

To convert use

```shell
python3 convert_hf_to_gguf.py /path/to/LFM2.5-Audio-1.5B/audio_detokenizer
```

* Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret <redacted>
* Formatting

* Rework check for attention layers

* Add LFM2 SWA model support

* Address PR feedback

* Set vocab to none

* Move helper function definitions to cpp file

---------

Co-authored-by: Sigbjørn Skjæret <redacted>

convert_hf_to_gguf.py		diff \| blob \| history
src/llama-graph.cpp		diff \| blob \| history
src/llama-graph.h		diff \| blob \| history
src/llama-model.cpp		diff \| blob \| history
src/llama-model.h		diff \| blob \| history
src/models/lfm2.cpp		diff \| blob \| history
src/models/models.h		diff \| blob \| history