]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
sync : llama.cpp (ggml_scale, ggml_row_size, ggml_mul_mat_set_prec) (#662)
authorGeorgi Gerganov <redacted>
Fri, 22 Dec 2023 15:53:50 +0000 (17:53 +0200)
committerGitHub <redacted>
Fri, 22 Dec 2023 15:53:50 +0000 (17:53 +0200)
commit845d01bab3a20a01889476c2b53e003517d06b77
tree800820119e0b815abfacee0a4645cef3569fc263
parentc80e07e9d4724392aaf02cdf32d1a1fb7228bea9
sync : llama.cpp (ggml_scale, ggml_row_size, ggml_mul_mat_set_prec) (#662)

* sync : llama.cpp (ggml_scale, ggml_row_size, ggml_mul_mat_set_prec)

ggml-ci

* ggml : add comment about backward GGML_OP_DIAG_MASK_INF (#4203)

* llama : fix platforms without mmap (#4578)

* llama : fix platforms without mmap

* win32 : limit prefetch size to the file size

* fix win32 error clobber, unnecessary std::string in std::runtime_error

* ggml-alloc : fix ggml_tallocr_is_own

* whisper : minor

* ggml : cuda jetson + arm quants warnings

ggml-ci

---------

Co-authored-by: Herman Semenov <redacted>
Co-authored-by: slaren <redacted>
33 files changed:
examples/dolly-v2/main.cpp
examples/gpt-2/main-alloc.cpp
examples/gpt-2/main-backend.cpp
examples/gpt-2/main-batched.cpp
examples/gpt-2/main-ctx.cpp
examples/gpt-2/main.cpp
examples/gpt-j/main.cpp
examples/gpt-neox/main.cpp
examples/mnist/main.cpp
examples/mpt/main.cpp
examples/replit/main.cpp
examples/sam/main.cpp
examples/starcoder/main.cpp
examples/starcoder/starcoder-mmap.cpp
examples/whisper/whisper.cpp
include/ggml/ggml-backend.h
include/ggml/ggml.h
src/ggml-alloc.c
src/ggml-backend-impl.h
src/ggml-backend.c
src/ggml-cuda.cu
src/ggml-metal.h
src/ggml-metal.m
src/ggml-metal.metal
src/ggml-quants.c
src/ggml.c
tests/test-backend-ops.cpp
tests/test-conv1d.cpp
tests/test-conv2d.cpp
tests/test-grad0.cpp
tests/test-mul-mat.cpp
tests/test-quantize-perf.cpp
tests/test0.c