]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
sycl: quantize and reorder the input to q8_1 when reorder is enabled (#13826)
authorAtharva Dubey <redacted>
Mon, 2 Jun 2025 09:12:20 +0000 (10:12 +0100)
committerGitHub <redacted>
Mon, 2 Jun 2025 09:12:20 +0000 (10:12 +0100)
commit663445b0deb21fb602176da030d4154197a4fca6
treeb5c2c17d4ccefb419f1fcf0120cebd2b91ba77c9
parent7675c555a13c9f473249e59a54db35032ce8e0fc
sycl: quantize and reorder the input to q8_1 when reorder is enabled (#13826)

* [WIP]: fuse q8 quantization and reorder

* wip2: fuse q8 quantization and reorder

* working q8 reorder commit

* restored common.hpp

* remove debug prints

* remove unnecessary headers and remove trailing whitespace

* Update ggml/src/ggml-sycl/ggml-sycl.cpp

Co-authored-by: Alberto Cabrera Pérez <redacted>
---------

Co-authored-by: Alberto Cabrera Pérez <redacted>
ggml/src/ggml-sycl/ggml-sycl.cpp
ggml/src/ggml-sycl/mmvq.cpp
ggml/src/ggml-sycl/vecdotq.hpp