]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
OpenCL Token Generation Acceleration (#1459)
author0cc4m <redacted>
Mon, 22 May 2023 21:33:24 +0000 (23:33 +0200)
committerGitHub <redacted>
Mon, 22 May 2023 21:33:24 +0000 (00:33 +0300)
commit2e6cd4b02549e343bef3768e6b946f999c82e823
tree70ce2c5dcb9beaac230dfa23d531f6e195d12975
parent7e4ea5beff567f53be92f75f9089e6f11fa5dabd
OpenCL Token Generation Acceleration (#1459)

* Move back to C++ for OpenCL

* Refactor OpenCL code to work more like the CUDA code, add missing functions

* Deduplicate dequant kernels

* Add OpenCL compile options

* Use compile args for preprocessing constants

* Restore default platform + device selection by id behavior

---------

Co-authored-by: Johannes Gäßler <redacted>
Co-authored-by: Henri Vasserman <redacted>
CMakeLists.txt
Makefile
ggml-opencl.c [deleted file]
ggml-opencl.cpp [new file with mode: 0644]
ggml-opencl.h
ggml.c
ggml.h
llama.cpp