]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
CUDA: faster dequantize kernels for Q4_0 and Q4_1 (#4938)
authorKawrakow <redacted>
Mon, 15 Jan 2024 05:48:06 +0000 (07:48 +0200)
committerGitHub <redacted>
Mon, 15 Jan 2024 05:48:06 +0000 (07:48 +0200)
commit4a3156de2fac9a8ee4279de7804d4e352dcfe121
treee927e6fb6769d3447590c2f92bc34d432b83a858
parenta836c8f534ab789b02da149fbdaf7735500bff74
CUDA: faster dequantize kernels for Q4_0 and Q4_1 (#4938)

Co-authored-by: Iwan Kawrakow <redacted>
ggml-cuda.cu