git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	fairydreaming <redacted>
	Thu, 23 May 2024 09:49:53 +0000 (11:49 +0200)
committer	GitHub <redacted>
	Thu, 23 May 2024 09:49:53 +0000 (11:49 +0200)
commit	9b82476ee9e73065a759f8bcc4cf27ec7ab2ed8c
tree	d4881d12bc7e60750f90e642e3fabbdf4029fc53	tree
parent	a61a94e543e3c6877c087e80fca27a0313ce5fd5	commit \| diff

Add missing inference support for GPTNeoXForCausalLM (Pythia and GPT-NeoX base models) (#7461)

* convert-hf : add conversion of bloom-style qkv tensor to gpt-style qkv (code borrowed from BloomModel)

* llama : add inference support for LLM_ARCH_GPTNEOX

* llama : add model types for every Pythia variant and GPT-NeoX

Co-authored-by: Stanisław Szymczyk <redacted>

convert-hf-to-gguf.py		diff \| blob \| history
llama.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom