]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (llama/12000)
authorGian-Carlo Pascutto <redacted>
Sat, 22 Feb 2025 08:43:24 +0000 (09:43 +0100)
committerGeorgi Gerganov <redacted>
Tue, 25 Feb 2025 11:33:09 +0000 (13:33 +0200)
commit71efb7582ba4d9e5045f15514e65d35357021fe5
treeb10b269b80e0f8c36dbb215f5fe57f992d37e69c
parent76e8f910a197c64172ac9fcf54e1f84799bba5d3
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (llama/12000)
src/ggml-cuda/cpy.cu
src/ggml-cuda/ggml-cuda.cu