git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Georgi Gerganov <redacted>
	Sat, 11 May 2024 07:32:41 +0000 (10:32 +0300)
committer	GitHub <redacted>
	Sat, 11 May 2024 07:32:41 +0000 (10:32 +0300)
commit	9cb317f77e53067f7a138cc89ef7657148eae8e6
tree	3ba1d2d80d1d7c8b4ab01f6396a3febaae26e91b	tree
parent	e849648888a11de13aaaa4cb2eda3f5a9c7b444d	commit \| diff

ggml : full ALiBi support (#7192)

* ggml : full ALiBi support

* ggml : update ggml_soft_max_ext() CUDA, SYCL

* ggml : ggml_flash_attn_ext() support ALiBi (CPU)

* ggml : ggml_flash_attn_ext() support ALiBi (Metal)

* ggml : fix warning

* ggml : ggml_flash_attn_ext() support ALiBi (CUDA)

ggml-ci

* ggml : fix assert message

* vulkan : add dev notes

* ggml : require mask when using ALiBi

ggml-ci

* convert : fix convert for refact models

16 files changed:

convert-hf-to-gguf.py		diff \| blob \| history
ggml-cuda.cu		diff \| blob \| history
ggml-cuda/alibi.cu	[deleted file]	blob \| history
ggml-cuda/alibi.cuh	[deleted file]	blob \| history
ggml-cuda/fattn.cu		diff \| blob \| history
ggml-cuda/softmax.cu		diff \| blob \| history
ggml-kompute.cpp		diff \| blob \| history
ggml-metal.m		diff \| blob \| history
ggml-metal.metal		diff \| blob \| history
ggml-sycl.cpp		diff \| blob \| history
ggml-vulkan.cpp		diff \| blob \| history
ggml.c		diff \| blob \| history
ggml.h		diff \| blob \| history
gguf-py/gguf/tensor_mapping.py		diff \| blob \| history
llama.cpp		diff \| blob \| history
tests/test-backend-ops.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom