]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: faster q8_0 -> f16 dequantization (#4895)
authorJohannes Gäßler <redacted>
Fri, 12 Jan 2024 19:38:54 +0000 (20:38 +0100)
committerGitHub <redacted>
Fri, 12 Jan 2024 19:38:54 +0000 (20:38 +0100)
commit3fe81781e3bf98b8e44946240a19f3a6ad08a11a
treea54105fc0dea1b131c358d291f6498bb08f7c61f
parente7e4df031b9e29d4b55a4e0b0295187f6b213db1
CUDA: faster q8_0 -> f16 dequantization (#4895)
ggml-cuda.cu