git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Stephan Walter <redacted>
	Wed, 5 Jul 2023 16:13:06 +0000 (16:13 +0000)
committer	GitHub <redacted>
	Wed, 5 Jul 2023 16:13:06 +0000 (19:13 +0300)
commit	1b107b8550dced48dc5f41184640061354226b96
tree	a09a4c33c865828cd753c19af71c580f98735be5	tree
parent	8567c76b5326e862be0755a8dc1dd988223fcae3	commit \| diff

ggml : generalize `quantize_fns` for simpler FP16 handling (#1237)

* Generalize quantize_fns for simpler FP16 handling

* Remove call to ggml_cuda_mul_mat_get_wsize

* ci : disable FMA for mac os actions

---------

Co-authored-by: Georgi Gerganov <redacted>

Packaging of ggml-org/llama.cpp

RSS Atom

.github/workflows/build.yml		diff \| blob \| history
examples/quantize-stats/quantize-stats.cpp		diff \| blob \| history
ggml.c		diff \| blob \| history
ggml.h		diff \| blob \| history
llama.cpp		diff \| blob \| history
pocs/vdot/q8dot.cpp		diff \| blob \| history
pocs/vdot/vdot.cpp		diff \| blob \| history
tests/test-quantize-fns.cpp		diff \| blob \| history
tests/test-quantize-perf.cpp		diff \| blob \| history