]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
ggml-backend v2 : add ggml_backend_sched (#586)
authorslaren <redacted>
Mon, 30 Oct 2023 20:28:09 +0000 (21:28 +0100)
committerGitHub <redacted>
Mon, 30 Oct 2023 20:28:09 +0000 (22:28 +0200)
commit08d748b2de5cf6c44d3926a0ec5da0df9e187c91
treebf71cae67416ce06c571cac4da69cdf2e446f2e5
parent05ff36f1487080043bfad0b5656d98d46ebad28d
ggml-backend v2 : add ggml_backend_sched (#586)

* ggml-backend-v2 wip

* fix metal build

* ggml-alloc : use a real backend buffer in measure mode

* backend sched : ignore view ops to reduce the number of splits

* dynamic ggml_cgraph wip

* dyn graphs : remove n_tasks from ggml_cplan

* dyn graphs : update ggml_graph_import

* reset hash table in ggml_build_forward

* ggml-alloc : split into tensor and graph allocators

* add ggml_backend_sched_set_node_backend

* remove ggml_build_forward_ctx, ggml_build_backward_ctx
add ggml_opt_params::graph_size
add ggml_new_graph_custom, ggml_graph_overhead_custom
add ggml_graph_clear

* update examples and tests, fix issues

* update more examples

* update gpt-2/main-backend.cpp from master

* ggml : fix copmile warning

* ci : update yolo, fix mnist, use gpt-2-backend

* ggml : fix uninit warning

* ci : switch to gpt-2-backend2

ggml-ci

* metal : skip noops early to avoid warnings from ggml_metal_get_buffer

---------

Co-authored-by: Georgi Gerganov <redacted>
42 files changed:
ci/run.sh
examples/dolly-v2/main.cpp
examples/gpt-2/CMakeLists.txt
examples/gpt-2/main-alloc.cpp [new file with mode: 0644]
examples/gpt-2/main-backend.cpp [new file with mode: 0644]
examples/gpt-2/main-batched.cpp
examples/gpt-2/main-ctx.cpp [new file with mode: 0644]
examples/gpt-2/main.cpp
examples/gpt-j/main.cpp
examples/gpt-neox/main.cpp
examples/mnist/main-cnn.cpp
examples/mnist/main-cpu.cpp
examples/mnist/main-mtl.cpp
examples/mnist/main.cpp
examples/mpt/main.cpp
examples/replit/main.cpp
examples/sam/main.cpp
examples/starcoder/main.cpp
examples/starcoder/starcoder-mmap.cpp
examples/whisper/whisper.cpp
examples/yolo/yolov3-tiny.cpp
include/ggml/ggml-alloc.h
include/ggml/ggml-backend.h
include/ggml/ggml.h
src/CMakeLists.txt
src/ggml-alloc.c
src/ggml-backend-impl.h [new file with mode: 0644]
src/ggml-backend.c
src/ggml-cuda.cu
src/ggml-impl.h [new file with mode: 0644]
src/ggml-metal.m
src/ggml.c
tests/test-blas0.c
tests/test-conv-transpose.c
tests/test-customop.c
tests/test-grad0.cpp
tests/test-mul-mat0.c
tests/test-opt.cpp
tests/test-pool.c
tests/test-rel-pos.c
tests/test-xpos.c
tests/test1.c