]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
sycl: quantize and reorder the input to q8_1 when reorder is enabled (llama/13826)
authorAtharva Dubey <redacted>
Mon, 2 Jun 2025 09:12:20 +0000 (10:12 +0100)
committerGeorgi Gerganov <redacted>
Tue, 10 Jun 2025 09:40:33 +0000 (12:40 +0300)
commitef2a79d2b83d92953690efec23060668677e530e
treec18a345d5da66bd3677bd5c9957c2afa72739a2c
parent9589645e721ebd10dfe743e41eff314006c0d855
sycl: quantize and reorder the input to q8_1 when reorder is enabled (llama/13826)

* [WIP]: fuse q8 quantization and reorder

* wip2: fuse q8 quantization and reorder

* working q8 reorder commit

* restored common.hpp

* remove debug prints

* remove unnecessary headers and remove trailing whitespace

* Update ggml/src/ggml-sycl/ggml-sycl.cpp

Co-authored-by: Alberto Cabrera Pérez <redacted>
---------

Co-authored-by: Alberto Cabrera Pérez <redacted>
ggml/src/ggml-sycl/ggml-sycl.cpp
ggml/src/ggml-sycl/mmvq.cpp
ggml/src/ggml-sycl/vecdotq.hpp