]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: faster q8_0 -> f16 dequantization (llama/4895)
authorJohannes Gäßler <redacted>
Fri, 12 Jan 2024 19:38:54 +0000 (20:38 +0100)
committerGeorgi Gerganov <redacted>
Sat, 13 Jan 2024 22:11:44 +0000 (00:11 +0200)
commit12490f4398f38e1b5ded7a5c01d035f41388c8f2
tree6113e9d26c4e6ce4d1f68d1d9367fa5ab9a4580f
parentdb078a9ba8aeb575a34ef5e648fa09e3f79c89a8
CUDA: faster q8_0 -> f16 dequantization (llama/4895)
ggml-cuda.cu