]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
ggml-cuda : perform cublas fp16 matrix multiplication as fp16 (#3370)
authorslaren <redacted>
Thu, 28 Sep 2023 10:08:28 +0000 (12:08 +0200)
committerGitHub <redacted>
Thu, 28 Sep 2023 10:08:28 +0000 (13:08 +0300)
commitda0400344be12074e67dcabc565140289cf7efaa
tree1f668aaaaf54c33699494a786d4249fc8ca08591
parente519621010cac02c6fec0f8f3b16cda0591042c0
ggml-cuda : perform cublas fp16 matrix multiplication as fp16 (#3370)

* ggml-cuda : perform cublas fp16 matrix multiplication as fp16

* try to fix rocm build

* restrict fp16 mat mul to volta and up
ggml-cuda.cu