]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
Optimize mul_mat for Q4_0 on Intel GPU (llama/12035)
authorNeo Zhang Jianyu <redacted>
Mon, 24 Feb 2025 14:33:23 +0000 (22:33 +0800)
committerGeorgi Gerganov <redacted>
Tue, 25 Feb 2025 11:33:09 +0000 (13:33 +0200)
commit0beb02b81aa503bafdaebec74506fba7a433c7c3
tree6ab2eac64faf01892199cb7cc0e2aff001a469b0
parent8be36cae6efe7dcac5d939f675bcf952527f86f7
Optimize mul_mat for Q4_0 on Intel GPU (llama/12035)

* opt performance by reorder for Intel GPU

* detect hw type and save opt feature, and print opt feature

* correct name

* support optimize graph once when compute graph, record the opt status in tensor->extra, make CI passed

* add env variable GGML_SYCL_DISABLE_OPT for debug

* use syclex::architecture replace the custom hw define, update the guide for GGML_SYCL_DISABLE_OPT

* add performance data

* mv getrows functions to separeted files

* fix global variables

---------

Co-authored-by: arthw <redacted>
12 files changed:
src/ggml-sycl/CMakeLists.txt
src/ggml-sycl/common.cpp
src/ggml-sycl/common.hpp
src/ggml-sycl/convert.cpp
src/ggml-sycl/convert.hpp
src/ggml-sycl/dequantize.hpp
src/ggml-sycl/dmmv.cpp
src/ggml-sycl/getrows.cpp [new file with mode: 0644]
src/ggml-sycl/getrows.hpp [new file with mode: 0644]
src/ggml-sycl/ggml-sycl.cpp
src/ggml-sycl/sycl_hw.cpp [new file with mode: 0644]
src/ggml-sycl/sycl_hw.hpp [new file with mode: 0644]