git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	Georgi Gerganov <redacted>
	Sun, 14 May 2023 08:23:02 +0000 (11:23 +0300)
committer	Georgi Gerganov <redacted>
	Sun, 14 May 2023 12:18:34 +0000 (15:18 +0300)
commit	fe48e22fd65ec4e0b3eb15a0809d6f85d1d6dee8
tree	198bc0835b5fa35190146fbe0077640ec0bc4418	tree
parent	effcfa62da543e71affe6c39b78d0064f0c5d71d	commit \| diff

ggml : new Q4 and Q5 quantization formats + backward ops

sync llama.cpp

- bump GGML_QNT_VERSION -> 1
- increase cwggml object overhead size from 256 to 512 in examples
- drop Q4_2 support
- tensor backend support CUDA

14 files changed:

Packaging of ggml-org/ggml

RSS Atom

examples/common-ggml.cpp		diff \| blob \| history
examples/dolly-v2/main.cpp		diff \| blob \| history
examples/gpt-2/main.cpp		diff \| blob \| history
examples/gpt-j/main.cpp		diff \| blob \| history
examples/gpt-neox/main.cpp		diff \| blob \| history
examples/mnist/main.cpp		diff \| blob \| history
examples/starcoder/main.cpp		diff \| blob \| history
examples/whisper/quantize.cpp		diff \| blob \| history
examples/whisper/whisper.cpp		diff \| blob \| history
include/ggml/ggml.h		diff \| blob \| history
src/ggml-cuda.cu		diff \| blob \| history
src/ggml-cuda.h		diff \| blob \| history
src/ggml-opencl.c		diff \| blob \| history
src/ggml.c		diff \| blob \| history