git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	Daniel Bevenius <redacted>
	Tue, 13 Feb 2024 13:15:42 +0000 (14:15 +0100)
committer	GitHub <redacted>
	Tue, 13 Feb 2024 13:15:42 +0000 (15:15 +0200)
commit	263978904c7472db1865409a7ff1129599f6a40b
tree	9c6f6f7732f474c74a9a1eafe8b52bcd8936d221	tree
parent	cf45252a7cfcb998bade46a886e20477cecc538a	commit \| diff

finetune : rename feed-forward tensors (w1/w2/w3) (#4839)

* finetune: rename feed-forward tensors (w1/w2/w3)

This commit renames the feed-forward tensors w1, w2 and w3 to ffn_gate,
ffn_down and ffn_up respectively.

The motivation for this change is to make it easier to understand the
purpose of the tensors. This also seems to be inline with the names
used in the llama_layer struct in llama.cpp.

Signed-off-by: Daniel Bevenius <redacted>
* train-text-from-scratch: rename ff tensors

This commit renames the feed-forward tensors w1, w2 and w3 to ffn_gate,
ffn_down and ffn_up respectively.

The motivation for this change is to make it easier to understand the
purpose of the tensors. This also seems to be inline with the names
used in the llama_layer struct in llama.cpp

Signed-off-by: Daniel Bevenius <redacted>
---------

Signed-off-by: Daniel Bevenius <redacted>

examples/finetune/README.md		diff \| blob \| history
examples/finetune/finetune.cpp		diff \| blob \| history
examples/train-text-from-scratch/train-text-from-scratch.cpp		diff \| blob \| history