]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
enable fp16/fast_fp16/bf16_mma on PH1 (llama/17551)
authorR0CKSTAR <redacted>
Fri, 28 Nov 2025 13:08:29 +0000 (21:08 +0800)
committerGeorgi Gerganov <redacted>
Fri, 12 Dec 2025 15:53:12 +0000 (17:53 +0200)
commitc372bdbb3c6bf897d41aedfeaac1f57e55247f66
treefa295a60aeea0753d2b78f908f93f4d7a5595363
parent90ca4e0a07d2050306f0ce44751193bef1d8d173
enable fp16/fast_fp16/bf16_mma on PH1 (llama/17551)

* [MUSA] enable fp16/fast_fp16/bf16_mma on PH1

Signed-off-by: Xiaodong Ye <redacted>
* Update ggml/src/ggml-cuda/fattn-vec.cuh

Co-authored-by: Johannes Gäßler <redacted>
* Update ggml/src/ggml-cuda/fattn-vec.cuh

Co-authored-by: Johannes Gäßler <redacted>
* Update ggml/src/ggml-cuda/fattn-tile.cuh

Co-authored-by: Johannes Gäßler <redacted>
* Address review comments

Signed-off-by: Xiaodong Ye <redacted>
---------

Signed-off-by: Xiaodong Ye <redacted>
Co-authored-by: Johannes Gäßler <redacted>
ggml/src/ggml-cuda/common.cuh
ggml/src/ggml-cuda/cpy.cu
ggml/src/ggml-cuda/fattn-tile.cuh
ggml/src/ggml-cuda/fattn-vec.cuh
ggml/src/ggml-cuda/mma.cuh