git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Douglas Hanley <redacted>
	Fri, 21 Jun 2024 05:38:22 +0000 (00:38 -0500)
committer	GitHub <redacted>
	Fri, 21 Jun 2024 05:38:22 +0000 (08:38 +0300)
commit	80ea089d771f0c2d97afa8bead80ded412f600d7
tree	25c04a967b5913ffdc00d1a851dcfbeb9ab37a37	tree
parent	0e64591e8290037db6412665a56354b789a0597e	commit \| diff

llama : allow pooled embeddings on any model (#7477)

* create append_pooling operation; allow to specify attention_type; add last token pooling; update examples

* find result_norm/result_embd tensors properly; update output allocation logic

* only use embd output for pooling_type NONE

* get rid of old causal_attn accessor

* take out attention_type; add in llama_set_embeddings

* bypass logits when doing non-NONE pooling

Packaging of ggml-org/llama.cpp

RSS Atom

common/common.cpp		diff \| blob \| history
examples/embedding/embedding.cpp		diff \| blob \| history
examples/gritlm/gritlm.cpp		diff \| blob \| history
examples/retrieval/retrieval.cpp		diff \| blob \| history
llama.cpp		diff \| blob \| history
llama.h		diff \| blob \| history