]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
Use fp32 in cuBLAS V100 to avoid overflows, env variables to override cuBLAS compute...
authorWallentri <redacted>
Sat, 14 Mar 2026 07:43:13 +0000 (10:43 +0300)
committerGeorgi Gerganov <redacted>
Sun, 15 Mar 2026 19:50:13 +0000 (21:50 +0200)
commit7dfc929b5784b9d122d0d67cd96515effc18f00d
tree51cea414fd34c09a73f88c74bb8e9a53e2348205
parent8872f41bebf6b005d22f92aa38c33e2efa3c2ae3
Use fp32 in cuBLAS V100 to avoid overflows, env variables to override cuBLAS compute type (llama/19959)

* Update ggml-cuda.cu

* Update ggml-cuda.cu

* Update build.md

* Update build.md

* Update ggml/src/ggml-cuda/ggml-cuda.cu

Co-authored-by: Johannes Gäßler <redacted>
* Update ggml-cuda.cu

* Update build.md

* Update ggml/src/ggml-cuda/ggml-cuda.cu

Co-authored-by: Johannes Gäßler <redacted>
* Update build.md

* Update ggml-cuda.cu

* Update ggml-cuda.cu

---------

Co-authored-by: Johannes Gäßler <redacted>
src/ggml-cuda/ggml-cuda.cu