]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit
llama: Add support for RWKV v7 architecture (#12412)
authorMolly Sophia <redacted>
Mon, 17 Mar 2025 23:27:50 +0000 (07:27 +0800)
committerGitHub <redacted>
Mon, 17 Mar 2025 23:27:50 +0000 (07:27 +0800)
commit7dfad387e3f6ac98d383ded2d175eb59736a3993
tree5a29b2c9766b2001f89a2a4b99fff1a33dfd1cf4
parent60c902926c928f9c2cd6390ce411876f92feeaf3
llama: Add support for RWKV v7 architecture (#12412)

* ggml: Add op l2_norm

Signed-off-by: Molly Sophia <redacted>
* ggml: Add op rwkv_wkv7

Signed-off-by: Molly Sophia <redacted>
* llama: Add support for RWKV7 and ARWKV7 models

Signed-off-by: Molly Sophia <redacted>
* llama: fix inference with RWKV6Qwen2

Signed-off-by: Molly Sophia <redacted>
* llama: add more (a)rwkv7 variants in size

Signed-off-by: Molly Sophia <redacted>
* Apply code-format changes

Signed-off-by: Molly Sophia <redacted>
* fix MUSA build

Signed-off-by: Molly Sophia <redacted>
* llama: fix shape error with rwkv using llama-parallel

Signed-off-by: Molly Sophia <redacted>
---------

Signed-off-by: Molly Sophia <redacted>
36 files changed:
convert_hf_to_gguf.py
ggml/include/ggml.h
ggml/src/ggml-cpu/ggml-cpu.c
ggml/src/ggml-cuda/ggml-cuda.cu
ggml/src/ggml-cuda/norm.cu
ggml/src/ggml-cuda/norm.cuh
ggml/src/ggml-cuda/wkv.cu [new file with mode: 0644]
ggml/src/ggml-cuda/wkv.cuh [new file with mode: 0644]
ggml/src/ggml-cuda/wkv6.cu [deleted file]
ggml/src/ggml-cuda/wkv6.cuh [deleted file]
ggml/src/ggml-metal/ggml-metal-impl.h
ggml/src/ggml-metal/ggml-metal.m
ggml/src/ggml-metal/ggml-metal.metal
ggml/src/ggml-sycl/backend.hpp
ggml/src/ggml-sycl/ggml-sycl.cpp
ggml/src/ggml-sycl/norm.cpp
ggml/src/ggml-sycl/norm.hpp
ggml/src/ggml-sycl/wkv.cpp [new file with mode: 0644]
ggml/src/ggml-sycl/wkv.hpp [new file with mode: 0644]
ggml/src/ggml-sycl/wkv6.cpp [deleted file]
ggml/src/ggml-sycl/wkv6.hpp [deleted file]
ggml/src/ggml-vulkan/ggml-vulkan.cpp
ggml/src/ggml-vulkan/vulkan-shaders/l2_norm.comp [new file with mode: 0644]
ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp
ggml/src/ggml-vulkan/vulkan-shaders/wkv7.comp [new file with mode: 0644]
ggml/src/ggml.c
gguf-py/gguf/constants.py
gguf-py/gguf/gguf_writer.py
gguf-py/gguf/tensor_mapping.py
src/llama-arch.cpp
src/llama-arch.h
src/llama-hparams.h
src/llama-model.cpp
src/llama-model.h
src/llama-quant.cpp
tests/test-backend-ops.cpp