]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
Improve cuBLAS performance by dequantizing on the GPU (#1065)
authorslaren <redacted>
Thu, 20 Apr 2023 01:14:14 +0000 (03:14 +0200)
committerGitHub <redacted>
Thu, 20 Apr 2023 01:14:14 +0000 (03:14 +0200)
commit02d6988121510c067e06d498a273a351a888f5b9
tree98c6204ad4f3db40bc49595bb7705e8bcd699e5d
parent834695fe3a3ed2a962e774c9615e3f7b41d360a8
Improve cuBLAS performance by dequantizing on the GPU (#1065)
CMakeLists.txt
Makefile
ggml-cuda.cu [new file with mode: 0644]
ggml-cuda.h [new file with mode: 0644]
ggml.c