]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
metal : optimize MoE for large batches (#13388)
authorGeorgi Gerganov <redacted>
Fri, 9 May 2025 12:14:56 +0000 (15:14 +0300)
committerGitHub <redacted>
Fri, 9 May 2025 12:14:56 +0000 (15:14 +0300)
commit611aa914ef4231fab5d1ad04773c42e119ae2d2e
tree48ae2088d0b54f3f95827010a978968a2a0ba094
parent0cf6725e9f9a164c39f7a87214d60342f7f946d8
metal : optimize MoE for large batches (#13388)

ggml-ci
ggml/src/ggml-metal/ggml-metal-impl.h
ggml/src/ggml-metal/ggml-metal.m
ggml/src/ggml-metal/ggml-metal.metal
ggml/src/ggml.c