git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

author	Jeff Bolz <redacted>
	Mon, 8 Sep 2025 18:10:07 +0000 (13:10 -0500)
committer	Georgi Gerganov <redacted>
	Sat, 20 Sep 2025 10:42:52 +0000 (13:42 +0300)
commit	c29cd54818e1c1dce5f675a3d4810fff566082ba
tree	f62f98d260c9ecb4d7d9164a342337217b41d052	tree
parent	70ee808f3d4d95772e7be990f5c33c43ac3f6e7a	commit \| diff

vulkan: sort graph to allow more parallel execution (llama/15850)

* vulkan: sort graph to allow more parallel execution

Add a backend proc to allow the backend to modify the graph. The
vulkan implementation looks at which nodes depend on each other
and greedily reorders them to group together nodes that don't
depend on each other. It only reorders the nodes, doesn't change
the contents of any of them.

With #15489, this reduces the number of synchronizations needed.

* call optimize_graph per-split

ggml/src/ggml-backend-impl.h		diff \| blob \| history
ggml/src/ggml-backend.cpp		diff \| blob \| history
ggml/src/ggml-blas/ggml-blas.cpp		diff \| blob \| history
ggml/src/ggml-cann/ggml-cann.cpp		diff \| blob \| history
ggml/src/ggml-cpu/ggml-cpu.cpp		diff \| blob \| history
ggml/src/ggml-cuda/ggml-cuda.cu		diff \| blob \| history
ggml/src/ggml-metal/ggml-metal.m		diff \| blob \| history
ggml/src/ggml-opencl/ggml-opencl.cpp		diff \| blob \| history
ggml/src/ggml-rpc/ggml-rpc.cpp		diff \| blob \| history
ggml/src/ggml-sycl/ggml-sycl.cpp		diff \| blob \| history
ggml/src/ggml-vulkan/ggml-vulkan.cpp		diff \| blob \| history
ggml/src/ggml-webgpu/ggml-webgpu.cpp		diff \| blob \| history
ggml/src/ggml-zdnn/ggml-zdnn.cpp		diff \| blob \| history