]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
vulkan: Hybrid waitForFences/getFenceStatus to reduce fence latency (llama/12630)
authorJeff Bolz <redacted>
Fri, 4 Apr 2025 05:54:35 +0000 (00:54 -0500)
committerGeorgi Gerganov <redacted>
Thu, 24 Apr 2025 17:39:16 +0000 (20:39 +0300)
commit76231bda566a1ebeb7254b93974bf953073b764e
tree3836836a491ec5b301a7611d12e1665423be75bc
parent785437c253769560c2e386b2f18021795c64140f
vulkan: Hybrid waitForFences/getFenceStatus to reduce fence latency (llama/12630)

There seems to be a bubble waking up from waitForFences, which costs a few
percent performance and also increased variance in performance. This change
inserts an "almost_ready" fence when the graph is about 80% complete and we
waitForFences for the almost_ready fence and then spin (with _mm_pauses) waiting
for the final fence to be signaled.
ggml/src/ggml-vulkan/ggml-vulkan.cpp