git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Pierrick Hymbert <redacted>
	Sat, 13 Apr 2024 09:33:52 +0000 (11:33 +0200)
committer	GitHub <redacted>
	Sat, 13 Apr 2024 09:33:52 +0000 (11:33 +0200)
commit	4bd0f93e4ab4fe6682e7d0241c1bdec1397e954a
tree	da912ccbf957473fb5aa6868c9cd73f0fcc42e63	tree
parent	ab9a3240a9da941fdef5cd4a25f2b97c2f5a67aa	commit \| diff

model: support arch `DbrxForCausalLM` (#6515)

* model: dbrx convert to gguf
#6344

* llama: support dbrx
#6344

* doc: dbrx: add the model as supported

* scripts: get-wikitext-2 add unzip

* llama: increase maximum experts allowed

* llama: factorize moe graph implementation between grok, mixtral and dbrx

---------

Co-authored-by: Megha Agarwal <redacted>

Packaging of ggml-org/llama.cpp

RSS Atom

README.md		diff \| blob \| history
convert-hf-to-gguf.py		diff \| blob \| history
examples/eval-callback/eval-callback.cpp		diff \| blob \| history
gguf-py/gguf/constants.py		diff \| blob \| history
gguf-py/gguf/tensor_mapping.py		diff \| blob \| history
llama.cpp		diff \| blob \| history
scripts/get-wikitext-2.sh		diff \| blob \| history