]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
[SYCL] Optimize mul_mat for Q4_0 on Intel GPU (#12035)
authorNeo Zhang Jianyu <redacted>
Mon, 24 Feb 2025 14:33:23 +0000 (22:33 +0800)
committerGitHub <redacted>
Mon, 24 Feb 2025 14:33:23 +0000 (22:33 +0800)
commit08d5986290cc42d2c52739e046642b8252f97e4b
treec3ff560c972dc200e77d3dfe053df06440bd6d89
parent651adf4b6675339465179b81b194d98cc14704d6
[SYCL] Optimize mul_mat for Q4_0 on Intel GPU (#12035)

* opt performance by reorder for Intel GPU

* detect hw type and save opt feature, and print opt feature

* correct name

* support optimize graph once when compute graph, record the opt status in tensor->extra, make CI passed

* add env variable GGML_SYCL_DISABLE_OPT for debug

* use syclex::architecture replace the custom hw define, update the guide for GGML_SYCL_DISABLE_OPT

* add performance data

* mv getrows functions to separeted files

* fix global variables

---------

Co-authored-by: arthw <redacted>
14 files changed:
docs/backend/SYCL.md
examples/sycl/run-llama2.sh
ggml/src/ggml-sycl/CMakeLists.txt
ggml/src/ggml-sycl/common.cpp
ggml/src/ggml-sycl/common.hpp
ggml/src/ggml-sycl/convert.cpp
ggml/src/ggml-sycl/convert.hpp
ggml/src/ggml-sycl/dequantize.hpp
ggml/src/ggml-sycl/dmmv.cpp
ggml/src/ggml-sycl/getrows.cpp [new file with mode: 0644]
ggml/src/ggml-sycl/getrows.hpp [new file with mode: 0644]
ggml/src/ggml-sycl/ggml-sycl.cpp
ggml/src/ggml-sycl/sycl_hw.cpp [new file with mode: 0644]
ggml/src/ggml-sycl/sycl_hw.hpp [new file with mode: 0644]