]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: add bf16 and f32 support to cublas_mul_mat_batched (llama/14361)
authorAman Gupta <redacted>
Sat, 28 Jun 2025 17:30:53 +0000 (01:30 +0800)
committerGeorgi Gerganov <redacted>
Tue, 1 Jul 2025 14:54:53 +0000 (17:54 +0300)
commitb900ee424c67a14fd3f650553e6445b83186e39b
treefd308949e8ce2824fc365fa6211425229d287cdb
parentf641a4c410832a29f4389043cf932811de8e377a
CUDA: add bf16 and f32 support to cublas_mul_mat_batched (llama/14361)

* CUDA: add bf16 and f32 support to cublas_mul_mat_batched

* Review: add type traits and make function more generic

* Review: make check more explicit, add back comments, and fix formatting

* Review: fix formatting, remove useless type conversion, fix naming for bools
ggml/src/ggml-cuda/convert.cu
ggml/src/ggml-cuda/convert.cuh
ggml/src/ggml-cuda/ggml-cuda.cu