]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: fix Pascal FA, deq. KV to FP16 for batch > 8 (llama/7681)
authorJohannes Gäßler <redacted>
Sat, 1 Jun 2024 13:47:04 +0000 (15:47 +0200)
committerGeorgi Gerganov <redacted>
Sun, 16 Jun 2024 15:19:48 +0000 (18:19 +0300)
commita16137d13dac76230fe7a24ba3719c4f67694155
tree6471bfe64593f68775158bc2f0505df36c1cefc0
parent5582039d0a7f1454449e42e7c12c698ea4358dfb
CUDA: fix Pascal FA, deq. KV to FP16 for batch > 8 (llama/7681)
ggml-cuda/fattn-common.cuh
ggml-cuda/fattn-tile-f16.cu
ggml-cuda/fattn-tile-f32.cu
ggml-cuda/fattn-vec-f16.cuh
ggml-cuda/fattn-vec-f32.cuh
ggml-cuda/fattn-wmma-f16.cuh
ggml-cuda/fattn.cu