]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
vulkan: Update topk_moe fusion to handle gpt's late softmax (llama/16656)
authorJeff Bolz <redacted>
Wed, 29 Oct 2025 13:44:29 +0000 (08:44 -0500)
committerGeorgi Gerganov <redacted>
Sun, 9 Nov 2025 21:38:03 +0000 (23:38 +0200)
commit35a3fda24075c106f4d4595508cb3c1eebd5f0aa
treeed11ad39e5fd58384fffb5ccb3efb43026da0367
parentbc944bddc81ce3a262f531192c8e604fdd97fe61
vulkan: Update topk_moe fusion to handle gpt's late softmax (llama/16656)

* vulkan: Update topk_moe fusion to handle gpt's late softmax

Based on #16649.

* Add ggml_check_edges

* Add sync logging to show fusion effects

* handle clamp added in #16655

* Update ggml/src/ggml-impl.h

Co-authored-by: Diego Devesa <redacted>
ggml/src/ggml-impl.h
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/topk_moe.comp