]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
model : add tokenizer from LFM2.5-Audio-1.5B (#19687)
authorTarek Dakhran <redacted>
Thu, 19 Feb 2026 08:54:48 +0000 (09:54 +0100)
committerGitHub <redacted>
Thu, 19 Feb 2026 08:54:48 +0000 (09:54 +0100)
commit8004f3a8d1f9e66ed534bc46f98c3a16c27f8c4a
treec9fb8c9d319cb8a3e2c86e8db889bd88d0f8d7d4
parenteacb4b67a2905d1f53e5f55033f81c4824b49e64
model : add tokenizer from LFM2.5-Audio-1.5B (#19687)

* model : Add tokenizer from LFM2.5-Audio-1.5B

[LFM2.5-Audio-1.5B](https://huggingface.co/LiquidAI/LFM2.5-Audio-1.5B) introduced lightweight audio tokenizer.

Tokenizer based on LFM2 architecture and acts as "embedding" model with
different input `n_embd` and output `n_embd_out`.

To be used in https://github.com/ggml-org/llama.cpp/pull/18641.

To convert use

```shell
python3 convert_hf_to_gguf.py /path/to/LFM2.5-Audio-1.5B/audio_detokenizer
```

* Update convert_hf_to_gguf.py

Co-authored-by: Sigbjørn Skjæret <redacted>
* Formatting

* Rework check for attention layers

* Add LFM2 SWA model support

* Address PR feedback

* Set vocab to none

* Move helper function definitions to cpp file

---------

Co-authored-by: Sigbjørn Skjæret <redacted>
convert_hf_to_gguf.py
src/llama-graph.cpp
src/llama-graph.h
src/llama-model.cpp
src/llama-model.h
src/models/lfm2.cpp
src/models/models.h