]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA: use shared mem for ssm_conv (llama/20128)
authorAman Gupta <redacted>
Fri, 6 Mar 2026 15:09:59 +0000 (23:09 +0800)
committerGeorgi Gerganov <redacted>
Mon, 16 Mar 2026 11:10:15 +0000 (13:10 +0200)
commitd2d235f4679b3177e4c662d589d973550e06ab2c
tree6c8a9314d77b68fe33d65f8c35565303155bc773
parent596b655dbd8ebbd88de1a8857ebd5318c867ccad
CUDA: use shared mem for ssm_conv (llama/20128)

* CUDA: use shared mem for ssm_conv

* fuse silu + ssm_conv

* fuse unary + mul

* enable for fp16

* formatting

Co-authored-by: Johannes Gäßler <redacted>
---------

Co-authored-by: Johannes Gäßler <redacted>
ggml/src/ggml-cuda/ggml-cuda.cu
ggml/src/ggml-cuda/ssm-conv.cu
ggml/src/ggml-cuda/ssm-conv.cuh
ggml/src/ggml-cuda/unary.cu
ggml/src/ggml-cuda/unary.cuh