]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
cuda : add f32 to bf16 copy op (llama/12806)
authorSigbjørn Skjæret <redacted>
Tue, 8 Apr 2025 21:21:31 +0000 (23:21 +0200)
committerGeorgi Gerganov <redacted>
Thu, 24 Apr 2025 17:39:16 +0000 (20:39 +0300)
commit79f23d9132a984fd7f84741e5c3e9343adee7553
treeb914df4caae9a0238563dab8c5031e05a81df581
parentee2cbeeb740fee02ac0919c709c398ffc2025775
cuda : add f32 to bf16 copy op (llama/12806)

This allows BF16 KV-cache on CUDA.
ggml/src/ggml-cuda/cpy.cu
ggml/src/ggml-cuda/ggml-cuda.cu