git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B (#13386)

common/arg.cpp		diff \| blob \| history
common/common.cpp		diff \| blob \| history
common/common.h		diff \| blob \| history
ggml/include/ggml-backend.h		diff \| blob \| history
ggml/src/ggml-backend.cpp		diff \| blob \| history
include/llama.h		diff \| blob \| history
src/llama-context.cpp		diff \| blob \| history
src/llama-cparams.h		diff \| blob \| history
tests/test-opt.cpp		diff \| blob \| history
tools/llama-bench/llama-bench.cpp		diff \| blob \| history
tools/mtmd/clip.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp