]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
metal : optimize MoE for large batches (llama/13388)
authorGeorgi Gerganov <redacted>
Fri, 9 May 2025 12:14:56 +0000 (15:14 +0300)
committerGeorgi Gerganov <redacted>
Tue, 13 May 2025 10:02:19 +0000 (13:02 +0300)
commit878a093fe0cec5ed25abe80281b957fcf03ff6eb
tree119028abb0e3eb4cd7f164a1562d98541966f05b
parentbdfb7fc0c02b2be2abf05aaec8adc0c0f249b0a3
metal : optimize MoE for large batches (llama/13388)

ggml-ci
src/ggml-metal/ggml-metal-impl.h
src/ggml-metal/ggml-metal.m
src/ggml-metal/ggml-metal.metal
src/ggml.c