git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Xuan-Son Nguyen <redacted>
	Mon, 7 Apr 2025 21:06:44 +0000 (23:06 +0200)
committer	GitHub <redacted>
	Mon, 7 Apr 2025 21:06:44 +0000 (23:06 +0200)
commit	1466621e738779eefe1bb672e17dc55d63d166bb
tree	414f66ff30d3c00e121c6db75dd7e009480b0dca	tree
parent	82974011f312057b446c27267105bd7ad3810599	commit \| diff

llama : Support llama 4 text-only (#12791)

* llama4 conversion

* initial support, no chat template

* clean up a bit

* fix tokenizer conversion

* correct hparams

* try this

* fix shexp

* ffn_inp_normed

* chat template

* clean up model conversion

* add_bos

* add scale_before_ffn

* fix order

* weight_before_ffn

* llm_graph_input_attn_temp

* add chunk attn mask

* build_inp_attn_scale()

* add comment about ggml_repeat

* clarify comments

* fix build

17 files changed:

convert_hf_to_gguf.py		diff \| blob \| history
convert_hf_to_gguf_update.py		diff \| blob \| history
gguf-py/gguf/constants.py		diff \| blob \| history
gguf-py/gguf/gguf_writer.py		diff \| blob \| history
include/llama.h		diff \| blob \| history
models/ggml-vocab-llama4.gguf.inp	[new file with mode: 0644]	blob
models/ggml-vocab-llama4.gguf.out	[new file with mode: 0644]	blob
src/llama-arch.cpp		diff \| blob \| history
src/llama-arch.h		diff \| blob \| history
src/llama-chat.cpp		diff \| blob \| history
src/llama-chat.h		diff \| blob \| history
src/llama-graph.cpp		diff \| blob \| history
src/llama-graph.h		diff \| blob \| history
src/llama-hparams.h		diff \| blob \| history
src/llama-model.cpp		diff \| blob \| history
src/llama-model.h		diff \| blob \| history
src/llama-vocab.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom