git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Douglas Hanley <redacted>
	Tue, 13 Feb 2024 12:06:58 +0000 (06:06 -0600)
committer	GitHub <redacted>
	Tue, 13 Feb 2024 12:06:58 +0000 (14:06 +0200)
commit	03bf161eb6dea6400ee49c6dc6b69bdcfa9fd3fc
tree	49320ac8aca35d2ba8162c2a280924bacbd7e06b	tree
parent	ad014bba97ef6ef6c3e2f78b2fc463e91ae94579	commit \| diff

llama : support batched embeddings (#5466)

* batched embedding: pool outputs by sequence id. updated embedding example

* bring back non-causal attention

* embd : minor improvements

* llama : minor

---------

Co-authored-by: Georgi Gerganov <redacted>

Packaging of ggml-org/llama.cpp

RSS Atom

convert-hf-to-gguf.py		diff \| blob \| history
examples/embedding/embedding.cpp		diff \| blob \| history
gguf-py/gguf/constants.py		diff \| blob \| history
gguf-py/gguf/gguf_writer.py		diff \| blob \| history
llama.cpp		diff \| blob \| history
llama.h		diff \| blob \| history