]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
Fix WARP_SIZE=16 bug of Intel GPU (llama/8266)
authorluoyu-intel <redacted>
Fri, 5 Jul 2024 05:06:13 +0000 (05:06 +0000)
committerGeorgi Gerganov <redacted>
Mon, 8 Jul 2024 11:53:55 +0000 (14:53 +0300)
commit29a2739d279c871b0a0ec5fb00586cd158aab0e7
treeaa0217be8f9c0159151e1d5c8674a365aee34928
parentee6d17f6b43013b898fb00552193e9be7bc47dc4
Fix WARP_SIZE=16 bug of Intel GPU (llama/8266)

* fix group_norm ut

* split softmax

* fix softmax

* add concat support condition

* revert debug code

* move QK_WARP_SIZE to presets.hpp
ggml/src/CMakeLists.txt
ggml/src/ggml-sycl.cpp