]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
metal : copy kernels for quant to F32/F16 conversions (#12017)
authorGian-Carlo Pascutto <redacted>
Tue, 25 Feb 2025 09:27:58 +0000 (10:27 +0100)
committerGitHub <redacted>
Tue, 25 Feb 2025 09:27:58 +0000 (11:27 +0200)
commit58d07a8043a1395177cf77b3e4f388e34182ae64
treeeefe85e9f08f61ccd82662c3c456ed984c159b9a
parent34a846b5847a18d133b360074f1fb485b2632b2d
metal : copy kernels for quant to F32/F16 conversions (#12017)

metal: use dequantize_q templates

---------

Co-authored-by: Georgi Gerganov <redacted>
ggml/src/ggml-metal/ggml-metal.m
ggml/src/ggml-metal/ggml-metal.metal