]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
metal : single allocation of encode_async block (llama/9747)
authorPaul Tsochantaris <redacted>
Mon, 7 Oct 2024 12:26:31 +0000 (13:26 +0100)
committerGeorgi Gerganov <redacted>
Fri, 1 Nov 2024 08:19:05 +0000 (10:19 +0200)
commit80753d4da8cd161eb8d87bf8df90c777d537e906
tree883eb3f104771b651b2d842877b4893c6a7839df
parent8f9bdca4c4cfec309295e2c416c743c060c72ebf
metal : single allocation of encode_async block (llama/9747)

* Single allocation of encode_async block with non-ARC capture in ggml-metal.m

* Moving Block_release to the deallocation code

* Release encode block when re-setting encoding buffer count if needed

* Update ggml/src/ggml-metal.m

---------

Co-authored-by: Georgi Gerganov <redacted>
ggml/src/ggml-metal.m