]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
ggml: fix cuda kernel launch configuration for k_compute_batched_ptrs to support...
authorleejet <redacted>
Sun, 26 Oct 2025 18:13:31 +0000 (02:13 +0800)
committerGeorgi Gerganov <redacted>
Sun, 9 Nov 2025 21:38:03 +0000 (23:38 +0200)
commit4f4246dcb4657db3ba220502b237cddd7d68cf4c
treea6d9180a249b8a50ca041847b0bd767bad2c09c3
parent9f75cc7eef7a985c89b95b8767a0337a5bed9750
ggml: fix cuda kernel launch configuration for k_compute_batched_ptrs to support large batch (llama/16744)

* fix k_compute_batched_ptrs

* add backend ops test

* Update ggml/src/ggml-cuda/ggml-cuda.cu

Co-authored-by: Johannes Gäßler <redacted>
* reduce the batch size

---------

Co-authored-by: Johannes Gäßler <redacted>
ggml/src/ggml-cuda/ggml-cuda.cu