]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
gemma2: add sliding window mask (#8227)
authorXuan Son Nguyen <redacted>
Mon, 1 Jul 2024 16:48:34 +0000 (18:48 +0200)
committerGitHub <redacted>
Mon, 1 Jul 2024 16:48:34 +0000 (18:48 +0200)
commit49122a873f54615626d1b49a2a39013ed4be98d5
tree26edc7f818cf4b3a2e15e419c801004c243b305f
parent0ddeff10230b88f1fa9866bbe5fe0d71ba2323a0
gemma2: add sliding window mask (#8227)

* gemma2: add sliding window mask

* fix data_swa uninitialized

* better naming

* add co-author

Co-authored-by: Arlo Phoenix <redacted>
* replace list with single tensor

* update

* llama : minor styling

* convert : add sanity check for query_pre_attn_scalar

* fix small typo in README

---------

Co-authored-by: Arlo Phoenix <redacted>
Co-authored-by: Georgi Gerganov <redacted>
README.md
convert-hf-to-gguf.py
gguf-py/gguf/constants.py
gguf-py/gguf/gguf_writer.py
src/llama.cpp