]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
Use fp32 in cuBLAS V100 to avoid overflows, env variables to override cuBLAS compute...
authorWallentri <redacted>
Sat, 14 Mar 2026 07:43:13 +0000 (10:43 +0300)
committerGitHub <redacted>
Sat, 14 Mar 2026 07:43:13 +0000 (15:43 +0800)
commitf2c0dfb7394b3abb5a5afd1c2a94f621bb64236f
tree4b71940d7de75a88ca8412e1c58fa58b91065633
parent9789c4ecdc01d571331c14e5197514b53839de4b
Use fp32 in cuBLAS V100 to avoid overflows, env variables to override cuBLAS compute type (#19959)

* Update ggml-cuda.cu

* Update ggml-cuda.cu

* Update build.md

* Update build.md

* Update ggml/src/ggml-cuda/ggml-cuda.cu

Co-authored-by: Johannes Gäßler <redacted>
* Update ggml-cuda.cu

* Update build.md

* Update ggml/src/ggml-cuda/ggml-cuda.cu

Co-authored-by: Johannes Gäßler <redacted>
* Update build.md

* Update ggml-cuda.cu

* Update ggml-cuda.cu

---------

Co-authored-by: Johannes Gäßler <redacted>
docs/build.md
ggml/src/ggml-cuda/ggml-cuda.cu