]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
ggml : use dynamic thread scheduling for matrix multiplication (llama/6915)
authorkunnis <redacted>
Wed, 15 May 2024 17:59:12 +0000 (12:59 -0500)
committerGeorgi Gerganov <redacted>
Sun, 16 Jun 2024 15:19:48 +0000 (18:19 +0300)
commit7178cceeaab0fa18cdeda4cb8f3b128b93f39237
treec5e28d64186c652a2246e91d13d555c985034eb3
parent8d55ccdb8cafe5a5e9b5b8ed5b6ce0e9a6a642af
ggml : use dynamic thread scheduling for matrix multiplication (llama/6915)

* Just reordering some structs.

* Adding in the calls to mm_pause

* Passing around the state

* Renaming and moving a bunch of variables around.

* Extracting the logic to it's own function.

* Moving some variable definitions into the chunk function.

* Moving some variables around

* moving src1_cont inside

* Moving row_size

* adding the current_chunk

* Reorg the code.

* Formatting to match the orig patch

* starting to setup the chunking variables

* Starting the buildup of the loop

* The yield shouldn't be necessary.

* adding the looping structure based on the chunk configuration.

* Add in the re-chunking code.

* Making it much more likely to rechunk.

* disable resizing if numa is enabled.

* Updating comments with what we've learned.

* Fix formatting

* Couple more formatting fixes.

* More style fixes.

* Fix Warnings

* Going with unused because there's conditional logic that needs it.

* Update ggml.c

* Update ggml.c

---------
ggml.c