]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
kompute : improve backend to pass test_backend_ops (llama/10542)
authorSergio López <redacted>
Thu, 28 Nov 2024 11:51:38 +0000 (12:51 +0100)
committerGeorgi Gerganov <redacted>
Sun, 8 Dec 2024 18:14:35 +0000 (20:14 +0200)
commit42099a9342077ce70545d6a17c298cd273bb486d
tree0924ccff5467bbe4542f6bfc257fc87c2391970b
parent90dd5fca9c00a89b57f175a9c4491886aac538dc
kompute : improve backend to pass test_backend_ops (llama/10542)

* kompute: op_unary: reject unsupported parameters

Signed-off-by: Sergio Lopez <redacted>
* kompute: softmax: implement ALiBi support

Signed-off-by: Sergio Lopez <redacted>
* kompute: rope: implement neox and phi3 support

Signed-off-by: Sergio Lopez <redacted>
* kompute: op_mul_mat_q4_k permutted support

Signed-off-by: Sergio Lopez <redacted>
* kompute: op_mul_mat_[q4_0|q4_1|q8_0] permutted support

Signed-off-by: Sergio Lopez <redacted>
* kompute: op_mul_mat_f16 permutted support

Signed-off-by: Sergio Lopez <redacted>
* kompute: op_mul_mat_q6_k permutted support

Signed-off-by: Sergio Lopez <redacted>
---------

Signed-off-by: Sergio Lopez <redacted>
16 files changed:
ggml/src/ggml-kompute/CMakeLists.txt
ggml/src/ggml-kompute/ggml-kompute.cpp
ggml/src/ggml-kompute/kompute-shaders/common.comp
ggml/src/ggml-kompute/kompute-shaders/op_mul_mat_f16.comp
ggml/src/ggml-kompute/kompute-shaders/op_mul_mat_q4_k.comp
ggml/src/ggml-kompute/kompute-shaders/op_mul_mat_q6_k.comp
ggml/src/ggml-kompute/kompute-shaders/op_mul_mv_q_n.comp
ggml/src/ggml-kompute/kompute-shaders/op_mul_mv_q_n_pre.comp
ggml/src/ggml-kompute/kompute-shaders/op_rope_f16.comp [deleted file]
ggml/src/ggml-kompute/kompute-shaders/op_rope_f32.comp [deleted file]
ggml/src/ggml-kompute/kompute-shaders/op_rope_neox_f16.comp [new file with mode: 0644]
ggml/src/ggml-kompute/kompute-shaders/op_rope_neox_f32.comp [new file with mode: 0644]
ggml/src/ggml-kompute/kompute-shaders/op_rope_norm_f16.comp [new file with mode: 0644]
ggml/src/ggml-kompute/kompute-shaders/op_rope_norm_f32.comp [new file with mode: 0644]
ggml/src/ggml-kompute/kompute-shaders/op_softmax.comp
ggml/src/ggml-kompute/kompute-shaders/rope_common.comp