git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

author	Eddie-Wang <redacted>
	Sun, 23 Jun 2024 18:27:57 +0000 (02:27 +0800)
committer	GitHub <redacted>
	Sun, 23 Jun 2024 18:27:57 +0000 (21:27 +0300)
commit	e112b610a1a75cb7fa8351e1a933e2e7a755a5ce
tree	e67c7781a37063f59a7ccd728492fe33c6373df0	tree
parent	6a2f298bd784403c5c33eebb822217ec5d9b5590	commit \| diff

llama : add support for BitnetForCausalLM (#7931)

* hf bitnet v1

* hf bitnet e2e v2

* finish bitnet e2e

* finish f16 hf bitnet e2e

* remove unsed

* finish bitnet i2 e2e

* move i2s to quantize v1

* move i2 to quantize

* clean code

* clean code 2

* fix codestyle

* fix code

* fix

* fix code

* fix merge

* remove unused

* change table name

* fix whitespace

* delete redundant

* i2_s to absmax

* finish i2_s/i8_s vec_dot x86 simd

* i2s->q22

* fix code

* remove block scale

* add dequantize

* fix seq

* update avx2

* remove q2_2

* remove q22_grid

* fix whitespace

* reuse llm_build_kv

* fix bo

---------

Co-authored-by: root <redacted>

convert-hf-to-gguf.py		diff \| blob \| history
gguf-py/gguf/constants.py		diff \| blob \| history
gguf-py/gguf/tensor_mapping.py		diff \| blob \| history
llama.cpp		diff \| blob \| history