]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: generalize FP16 fattn vec kernel (llama/7061)
authorJohannes Gäßler <redacted>
Thu, 9 May 2024 12:32:02 +0000 (14:32 +0200)
committerGeorgi Gerganov <redacted>
Mon, 13 May 2024 08:02:26 +0000 (11:02 +0300)
commit4be936b88ba64faa027fca89af5e2cfcaa64e926
tree2c2e7ae20a6eb7ac2f376ad4af07fc49cb0e0279
parent26c550f77287158d14f3e8b15486cec86ee8a42d
CUDA: generalize FP16 fattn vec kernel (llama/7061)

* CUDA: generalize FP16 fattn vec kernel

* disable unsupported head sizes for AMD in test

* try AMD fix

* fix batch size 2-8

* partially revert changes
ggml-cuda/common.cuh
ggml-cuda/fattn.cu