]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions (llama/12154)
authorRémy O <redacted>
Thu, 6 Mar 2025 01:26:10 +0000 (02:26 +0100)
committerGeorgi Gerganov <redacted>
Fri, 7 Mar 2025 13:19:59 +0000 (15:19 +0200)
commit15ab85c92d308fc5f501f2e465750bd5472f0c84
tree970ea8c2db35520d68ba615760a23f307febe52d
parent5ce9f195e8d1706ecdd07499a0799d59df763ae7
ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions (llama/12154)

* ggml-cpu: Faster IQ1 mul_mat_vec on AVX2 using BMI2 instructions

* cmake: Add GGML_BMI2 build option

* ggml: enable BMI2 on relevant CPU variants

* ggml-cpu: include BMI2 in backend score

* ggml-cpu: register BMI2 in ggml_backend_cpu_get_features

* ggml-cpu: add __BMI2__ define when using MSVC
CMakeLists.txt
include/ggml-cpu.h
src/CMakeLists.txt
src/ggml-cpu/CMakeLists.txt
src/ggml-cpu/cpu-feats-x86.cpp
src/ggml-cpu/ggml-cpu-quants.c
src/ggml-cpu/ggml-cpu.c
src/ggml-cpu/ggml-cpu.cpp