]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
Fix WARP_SIZE=16 bug of Intel GPU (llama/8266)
authorluoyu-intel <redacted>
Fri, 5 Jul 2024 05:06:13 +0000 (05:06 +0000)
committerGeorgi Gerganov <redacted>
Mon, 8 Jul 2024 10:03:28 +0000 (13:03 +0300)
commitea98f39dedcd0391c75ca4f1c5a35d5becea8dbc
treebac4c2ee2c413c255a81e21931c0f5aee037a3c1
parentc2b13bbef9ab16b96f8a0e609b28e5b67b547202
Fix WARP_SIZE=16 bug of Intel GPU (llama/8266)

* fix group_norm ut

* split softmax

* fix softmax

* add concat support condition

* revert debug code

* move QK_WARP_SIZE to presets.hpp
src/CMakeLists.txt
src/ggml-sycl.cpp
src/ggml-sycl/backend.hpp
src/ggml-sycl/dmmv.cpp
src/ggml-sycl/norm.cpp
src/ggml-sycl/presets.hpp
src/ggml-sycl/softmax.cpp [new file with mode: 0644]
src/ggml-sycl/softmax.hpp [new file with mode: 0644]