]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
ggml-cpu : add chunking support to mul_mat_id (llama/11666)
authorDiego Devesa <redacted>
Thu, 13 Feb 2025 00:02:38 +0000 (01:02 +0100)
committerGeorgi Gerganov <redacted>
Thu, 27 Feb 2025 06:55:36 +0000 (08:55 +0200)
commitd2c5154bb5aa1810ee8998fd5980f61e6f973bfe
tree9815310fd6235ce839b9ee003e33d75da1f9ab65
parent4fac43fe00c4734c9c5844be2d566fe4806646f2
ggml-cpu : add chunking support to mul_mat_id (llama/11666)

* ggml-cpu : add chunking support to mul_mat_id

* allocate chunk counter in wdata
parallelize src1 quantization by column to allows parallelization even when there is only one row

* disable for arm

* cleanup

* better way to disable for arm

* fix uninitialized counter when using 1 thread only

* revert test-backend-ops changes
ggml/src/ggml-cpu/ggml-cpu.c