git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Neo Zhang Jianyu <redacted>
	Mon, 24 Feb 2025 14:33:23 +0000 (22:33 +0800)
committer	GitHub <redacted>
	Mon, 24 Feb 2025 14:33:23 +0000 (22:33 +0800)
commit	08d5986290cc42d2c52739e046642b8252f97e4b
tree	c3ff560c972dc200e77d3dfe053df06440bd6d89	tree
parent	651adf4b6675339465179b81b194d98cc14704d6	commit \| diff

[SYCL] Optimize mul_mat for Q4_0 on Intel GPU (#12035)

* opt performance by reorder for Intel GPU

* detect hw type and save opt feature, and print opt feature

* correct name

* support optimize graph once when compute graph, record the opt status in tensor->extra, make CI passed

* add env variable GGML_SYCL_DISABLE_OPT for debug

* use syclex::architecture replace the custom hw define, update the guide for GGML_SYCL_DISABLE_OPT

* add performance data

* mv getrows functions to separeted files

* fix global variables

---------

Co-authored-by: arthw <redacted>

14 files changed:

docs/backend/SYCL.md		diff \| blob \| history
examples/sycl/run-llama2.sh		diff \| blob \| history
ggml/src/ggml-sycl/CMakeLists.txt		diff \| blob \| history
ggml/src/ggml-sycl/common.cpp		diff \| blob \| history
ggml/src/ggml-sycl/common.hpp		diff \| blob \| history
ggml/src/ggml-sycl/convert.cpp		diff \| blob \| history
ggml/src/ggml-sycl/convert.hpp		diff \| blob \| history
ggml/src/ggml-sycl/dequantize.hpp		diff \| blob \| history
ggml/src/ggml-sycl/dmmv.cpp		diff \| blob \| history
ggml/src/ggml-sycl/getrows.cpp	[new file with mode: 0644]	blob
ggml/src/ggml-sycl/getrows.hpp	[new file with mode: 0644]	blob
ggml/src/ggml-sycl/ggml-sycl.cpp		diff \| blob \| history
ggml/src/ggml-sycl/sycl_hw.cpp	[new file with mode: 0644]	blob
ggml/src/ggml-sycl/sycl_hw.hpp	[new file with mode: 0644]	blob

Packaging of ggml-org/llama.cpp

RSS Atom