git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

author	Changyeon Kim <redacted>
	Tue, 29 Oct 2024 08:52:56 +0000 (17:52 +0900)
committer	Georgi Gerganov <redacted>
	Fri, 15 Nov 2024 13:21:04 +0000 (15:21 +0200)
commit	307712a90378c6b4641746b9a2c1f0f341c03ca1
tree	0b68a76c07bace02229346b2cf1c085656440a5b	tree
parent	fbc9a05ddf352f9a6e6cf53cea187fda87d64a57	commit \| diff

ggml: Add POOL2D OP for GPU acceleration to the Vulkan backend in the MobileVLM model. (llama/9763)

* ggml: Add POOL2D OP for GPU ACC to the Vulkan.

- The MobileVLM model now supports inference acceleration through GPU by utilizing the Vulkan backend.
- A GGML_OP_POOL_2D shader has been added. (Pooling)
- The encoding performance of the CLIP model improved from 2.8s on the CPU to 0.7s on the GPU.

Signed-off-by: Changyeon Kim <redacted>
* [fix] Correct the incorrect order of the parameters.

fix casting to int.

Signed-off-by: Changyeon Kim <redacted>
---------

Signed-off-by: Changyeon Kim <redacted>

ggml/src/ggml-vulkan.cpp		diff \| blob \| history
ggml/src/vulkan-shaders/pool2d.comp	[new file with mode: 0644]	blob
ggml/src/vulkan-shaders/vulkan-shaders-gen.cpp		diff \| blob \| history