]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: revise q8_1 data layout for mul_mat_q (llama/7824)
authorJohannes Gäßler <redacted>
Sun, 9 Jun 2024 07:42:25 +0000 (09:42 +0200)
committerGeorgi Gerganov <redacted>
Sun, 16 Jun 2024 15:19:48 +0000 (18:19 +0300)
commit760497e1abd683f2c61bb408696257bfb0d4f901
tree3a4fa802ee4726096b31f7ad900ea8805c44a20b
parentb172e7714c418bfb97998f2b8b97cad745c1a771
CUDA: revise q8_1 data layout for mul_mat_q (llama/7824)
ggml-cuda.cu
ggml-cuda/mmq.cu
ggml-cuda/mmq.cuh
ggml-cuda/quantize.cu
ggml-cuda/quantize.cuh