]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
cuda : add f32 to bf16 copy op (#12806)
authorSigbjørn Skjæret <redacted>
Tue, 8 Apr 2025 21:21:31 +0000 (23:21 +0200)
committerGitHub <redacted>
Tue, 8 Apr 2025 21:21:31 +0000 (23:21 +0200)
commit7538246e7ce0606694c38055cc2fc9f60535be6c
tree7188f7c57b086f08e7d0c13b76577eccbbffa28d
parentb32efad2bc42460637c3a364c9554ea8217b3d7f
cuda : add f32 to bf16 copy op (#12806)

This allows BF16 KV-cache on CUDA.
ggml/src/ggml-cuda/cpy.cu
ggml/src/ggml-cuda/ggml-cuda.cu