git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

overview / pkg / ggml / sources / whisper.cpp / commit

summary | shortlog | log | commit | commitdiff | tree
(parent: 08e8414)

author	Ivan <redacted>
	Tue, 24 Sep 2024 00:14:24 +0000 (03:14 +0300)
committer	Georgi Gerganov <redacted>
	Tue, 24 Sep 2024 16:45:08 +0000 (19:45 +0300)
commit	2fc1d20f9ee2e66c199ec104e73e8c3dd3e57312
tree	fc4745ed57f7cbac3a823ec28770b9eb1ee6e8ee	tree
parent	08e8414f277a1a559d52dd5a474f777353ec61fc	commit \| diff

cuda: add q8_0->f32 cpy operation (llama/9571)

llama: enable K-shift for quantized KV cache
It will fail on unsupported backends or quant types.

ggml/src/ggml-cuda.cu		diff \| blob \| history
ggml/src/ggml-cuda/cpy.cu		diff \| blob \| history

Packaging of ggerganov/whisper.cpp