]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (llama/12000)
authorGian-Carlo Pascutto <redacted>
Sat, 22 Feb 2025 08:43:24 +0000 (09:43 +0100)
committerGeorgi Gerganov <redacted>
Thu, 27 Feb 2025 06:55:36 +0000 (08:55 +0200)
commit98dab49b9aec703ba2ac13e7e86be6122a4adbed
tree835aa089531015489ef32286d3a6767dead28859
parentb1385e9aa97a356a85a55fade422f26bc064552f
cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion support. (llama/12000)
ggml/src/ggml-cuda/cpy.cu
ggml/src/ggml-cuda/ggml-cuda.cu