]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
vulkan: optimize UMA buffer operations and fix driver hangs (#16059)
authorGiuseppe Scrivano <redacted>
Sun, 21 Sep 2025 06:31:55 +0000 (08:31 +0200)
committerGitHub <redacted>
Sun, 21 Sep 2025 06:31:55 +0000 (08:31 +0200)
commit1eeb523c3e0c7ffbd59469f5463dcbdecba3535e
treef51ed71dc78055cf0135809a4a9b13af3edc7849
parent5bb4a3edec297e74b0f7bd4ed5d0fdd12e28d858
vulkan: optimize UMA buffer operations and fix driver hangs (#16059)

* vulkan: optimize UMA buffer operations and fix driver hangs

The previous implementation was blocking the GPU for extended periods,
causing the i915 driver to reset the context due to the hangcheck
protection.

[32628.443070] i915 0000:00:02.0: [drm] GPU HANG: ecode 12:1:85dffffb, in llama-server [194114]
[32628.443091] i915 0000:00:02.0: [drm] llama-server[194114] context reset due to GPU hang

* vulkan: implement deferred_memset on UMA

---------

Signed-off-by: Giuseppe Scrivano <redacted>
ggml/src/ggml-vulkan/ggml-vulkan.cpp