]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
Vulkan Optimizations and Fixes (#8959)
author0cc4m <redacted>
Wed, 14 Aug 2024 16:32:53 +0000 (18:32 +0200)
committerGitHub <redacted>
Wed, 14 Aug 2024 16:32:53 +0000 (18:32 +0200)
commit5fd89a70ead34d1a17015ddecad05aaa2490ca46
tree6da95da58a500f02ff042862563611be49eaaafd
parent98a532d474c73d3494a5353024cb6a4fbbabbb35
Vulkan Optimizations and Fixes (#8959)

* Optimize Vulkan REPEAT performance

* Use Vulkan GLSL fused multiply-add instruction where possible

* Add GGML_VULKAN_PERF option to output performance data per operator

* Rework and fix Vulkan descriptor set and descriptor pool handling

* Fix float32 concat f16 shader validation error

* Add Vulkan GROUP_NORM eps parameter

* Fix validation error with transfer queue memory barrier flags

* Remove trailing whitespaces
16 files changed:
Makefile
ggml/CMakeLists.txt
ggml/src/CMakeLists.txt
ggml/src/ggml-vulkan.cpp
ggml/src/vulkan-shaders/concat.comp
ggml/src/vulkan-shaders/mul_mat_vec.comp
ggml/src/vulkan-shaders/mul_mat_vec_nc.comp
ggml/src/vulkan-shaders/mul_mat_vec_p021.comp
ggml/src/vulkan-shaders/mul_mat_vec_q2_k.comp
ggml/src/vulkan-shaders/mul_mat_vec_q3_k.comp
ggml/src/vulkan-shaders/mul_mat_vec_q4_k.comp
ggml/src/vulkan-shaders/mul_mat_vec_q5_k.comp
ggml/src/vulkan-shaders/mul_mat_vec_q6_k.comp
ggml/src/vulkan-shaders/mul_mm.comp
ggml/src/vulkan-shaders/repeat.comp [new file with mode: 0644]
ggml/src/vulkan-shaders/vulkan-shaders-gen.cpp