git.djapps.eu Git - pkg/ggml/sources/ggml/log

]> git.djapps.eu Git - pkg/ggml/sources/ggml/log

overview / pkg / ggml / sources / ggml / log

commit | commitdiff | tree

Jiahao Li [Sun, 6 Aug 2023 07:22:25 +0000 (15:22 +0800)]

cmake : add avx compile options on Windows (synced from llama.cpp) (#435)

commit | commitdiff | tree

Borislav Stanimirov [Thu, 3 Aug 2023 08:03:49 +0000 (11:03 +0300)]

cmake : fix msvc cuda build - no need to specify stdlib (#428)

commit | commitdiff | tree

Borislav Stanimirov [Thu, 3 Aug 2023 08:03:24 +0000 (11:03 +0300)]

tests : fixed windows build (#426)

commit | commitdiff | tree

slaren [Tue, 1 Aug 2023 10:15:46 +0000 (12:15 +0200)]

Merge pull request #422 from ggerganov/ggml-map-custom-new

ggml : add ggml_map_custom1-3

commit | commitdiff | tree

slaren [Mon, 31 Jul 2023 18:48:52 +0000 (20:48 +0200)]

test-customop.c : more comments

commit | commitdiff | tree

slaren [Mon, 31 Jul 2023 09:48:44 +0000 (11:48 +0200)]

fix formatting

commit | commitdiff | tree

slaren [Sun, 30 Jul 2023 15:59:51 +0000 (17:59 +0200)]

ggml : add ggml_map_custom1-3

commit | commitdiff | tree

johnson442 [Wed, 26 Jul 2023 05:14:56 +0000 (06:14 +0100)]

starcoder : fix unused variable warnings (#416)

commit | commitdiff | tree

Georgi Gerganov [Tue, 25 Jul 2023 15:28:22 +0000 (18:28 +0300)]

ggml : sync llama.cpp (#415)

- faster graph build
- inference speed-ups across GPU backends
- activation functions relax constraints

ggml-ci

commit | commitdiff | tree

klosax [Tue, 25 Jul 2023 15:20:14 +0000 (17:20 +0200)]

readme : add chatglm example (#414)

commit | commitdiff | tree

Georgi Gerganov [Mon, 24 Jul 2023 11:24:53 +0000 (14:24 +0300)]

tests : remove unnecessary funcs

commit | commitdiff | tree

Borislav Stanimirov [Mon, 24 Jul 2023 10:46:22 +0000 (13:46 +0300)]

starcoder : fix windows build, fix cast warnings (#411)

commit | commitdiff | tree

Andrei [Sun, 23 Jul 2023 19:44:54 +0000 (15:44 -0400)]

cmake : add metal kernel to target resources when built as shared library (#403)

* Add GGML_METAL option to compile metal support.

* Add metal option

* Add metal sources

* Add current source directory

* Fix typo

commit | commitdiff | tree

Ivan Zdane [Sun, 23 Jul 2023 19:44:13 +0000 (15:44 -0400)]

ggml : refactor unary ops (#405)

* Add gitignore rule for temporary vim files

* ggml: refactor implementation of unary ops

* backends : adapt to ggml_unary_op

* ggml : fix assert number of ops

* ggml : hide ggml_set_unary_op from public API

---------

Co-authored-by: izdane <redacted>
Co-authored-by: Georgi Gerganov <redacted>

commit | commitdiff | tree

goerch [Sun, 23 Jul 2023 16:35:43 +0000 (18:35 +0200)]

ggml : add coverage measurement for Clang, increase test coverage, F16 ggml_sum (#377)

* First shot at adding clang/llvm coverage analysis

* Fix for compiler dependency

* Reducing dimensions in test-opt

* cmake : try to fix test coverage build + CI

* cmake : fix CMAKE option + CI

* Adding some tests for half precision floating point tests

* Adding missing tests for unary operations

* Some more tests for unary operations

* Fix syntax error.

* Fix bug in relu derivative computation

* Revert testing change

* ggml : style fixes

---------

Co-authored-by: Georgi Gerganov <redacted>

commit | commitdiff | tree

Jiahao Li [Sun, 23 Jul 2023 15:17:06 +0000 (23:17 +0800)]

ggml : add get_no_alloc api corresponding to set_no_alloc (#402)

commit | commitdiff | tree

Cebtenzzre [Sun, 23 Jul 2023 15:16:04 +0000 (11:16 -0400)]

ggml : build with -Wmissing-prototypes, fix warnings (#398)

A few function declarations were missing from the API. This warning flag
helps find these problems.

commit | commitdiff | tree

Reza Rezvan [Sun, 23 Jul 2023 15:12:47 +0000 (17:12 +0200)]

common : fix param parsing (#391)

commit | commitdiff | tree

taher [Sun, 23 Jul 2023 14:54:27 +0000 (07:54 -0700)]

ggml : add vector scaling using Accelerate (#380)

* Added vector scaling using Accelerate

* added missing elif

commit | commitdiff | tree

Georgi Gerganov [Sun, 23 Jul 2023 14:51:29 +0000 (17:51 +0300)]

ggml : sync llama.cpp (#409)

* ggml : sync llama.cpp

ggml-ci

* ggml : fix nullptr derefs in backward

* ci : add mnist test, import/export graph

* add op_params to ggml_graph_export/import

ggml-ci

* mnist : export/import op_params for testing purposes

* mnist : fix f32 model generation test + instructions

ggml-ci

* ci : install python deps even for low-perf builds

ggml-ci

---------

Co-authored-by: Diego Devesa <redacted>

commit | commitdiff | tree

Tom Bailey [Sun, 23 Jul 2023 14:51:13 +0000 (23:51 +0900)]

readme : fix gpt-neox docs (#410)

* Fix gpt neox example directory reference

* Fix gpt neox bin directory references

commit | commitdiff | tree

fitzsim [Sun, 23 Jul 2023 13:46:57 +0000 (09:46 -0400)]

cmake : add PPC64 support (#392)

* cmake : add CMake support for ppc64

* readme : mention ppc64 VSX support

commit | commitdiff | tree

Jiahao Li [Sun, 23 Jul 2023 13:43:09 +0000 (21:43 +0800)]

cmake : sync metal compile scripts from llama.cpp (#407)

commit | commitdiff | tree

Georgi Gerganov [Sat, 22 Jul 2023 10:37:55 +0000 (13:37 +0300)]

readme : add minigpt4 link

commit | commitdiff | tree

Georgi Gerganov [Sat, 22 Jul 2023 06:23:34 +0000 (09:23 +0300)]

ci : avoid hardcoded number of threads

commit | commitdiff | tree

Georgi Gerganov [Tue, 18 Jul 2023 11:40:53 +0000 (14:40 +0300)]

ci : update to be able to run locally

commit | commitdiff | tree

Georgi Gerganov [Tue, 18 Jul 2023 08:10:40 +0000 (11:10 +0300)]

ggml : sync llama.cpp (fix PERF + CUDA dup cont support)

commit | commitdiff | tree

Georgi Gerganov [Mon, 17 Jul 2023 14:17:07 +0000 (17:17 +0300)]

ci : disable wget progress

commit | commitdiff | tree

Georgi Gerganov [Mon, 17 Jul 2023 13:35:06 +0000 (16:35 +0300)]

ci : use models-mnt

commit | commitdiff | tree

Georgi Gerganov [Sun, 16 Jul 2023 17:55:06 +0000 (20:55 +0300)]

ci : integrate with ggml-org/ci (#393)

* ci : add initial scripts

* ci : remove setup.sh

* run : add deps

* run : fix

* ci : remove deps

* ci : ignore test-opt

* ci : fix ctest

* ci : induce error in ci-1

* ci : try to capture exit codes

* ci : try fix pipefail

* ci : output time of commands

* ci : disable test-opt for release as well

* ci : generate README.md

* ci : update README.md

* ci : fix header

* ci : utilize gg_printf

row 0
row 1 testing adafa

asdfdsa

* ci : move out the README.md header generation

row 0
row 1 dsfkdjs

adslfkaj ska
test

* ci : fix

row 0
row 1 dsfkdjs

adslfkaj ska
test

* ci : induce error

* ci : fix error

remove assert(false)

* ci : fix ctest summary

* ci : fix comment

minor

test test

* ci : fix gg_printf usage

* ci : switch ci-1 to Release

* ci : try to simplify

* ci : induce error

* Revert "ci : induce error"

This reverts commit a9cef1eeb174764a0a1eb5b13753a7637b10f9dd.

* ci : induce error

* ci : fix pipefail + status

* ci : try to fix pipefail

* ci : fix output

* ci : fix return codes

* ci : test

* ci : fix test

* ci : add gpt-2 ci

* ci : fix gpt-2 test

* ci : gpt-2 seed

* ci : fix checks

* ci : time gpt-2

* ci : fix gpt-2 output

* ci : try to fix duplicated output

* ci : try fix duplicated output

* ci : cat gpt-2 output

* ci : finally fix double output

* ci : try to add "set -x"

* ci : fix model var

* ci : append logs

* ci : add mpt

* ci : fix model dir creation

* ci : fix mpt convert

* ci : add mpt config.json

* ci : add configuration_mpt.py

* ci : don't run test-opt on low perf systems

* ci : do not run test-opt in Debug

commit | commitdiff | tree

Georgi Gerganov [Sat, 15 Jul 2023 11:31:18 +0000 (14:31 +0300)]

ggml : sync llama.cpp (CUDA add/mul bcast + Metal fix + custom RoPE)

commit | commitdiff | tree

johnson442 [Fri, 14 Jul 2023 17:58:51 +0000 (18:58 +0100)]

starcoder : mmap (and gpu) example (#338)

* Add basic mmap & GPU offload starcoder example

* starcode-mmap : adapt to new ggml API

---------

Co-authored-by: Georgi Gerganov <redacted>

commit | commitdiff | tree

Johannes Gäßler [Fri, 14 Jul 2023 17:55:34 +0000 (19:55 +0200)]

cuda : fix CUDA arch: OFF -> 52;61 (#389)

commit | commitdiff | tree

Georgi Gerganov [Fri, 14 Jul 2023 13:58:10 +0000 (16:58 +0300)]

ggml : sync llama.cpp (fix CUDA build, faster Metal)

close #379

commit | commitdiff | tree

Jiahao Li [Fri, 14 Jul 2023 12:10:59 +0000 (20:10 +0800)]

cuda : support GLM-style RoPE (#383)

commit | commitdiff | tree

Georgi Gerganov [Fri, 14 Jul 2023 12:02:12 +0000 (15:02 +0300)]

ggml : retire custom ggml_2d impl in favor of general (#352)

commit | commitdiff | tree

David Miller [Fri, 14 Jul 2023 11:38:47 +0000 (04:38 -0700)]

ggml : general conv_2d CPU implementation (#352)

* Conv2d s0 == s1 == 1, d0 == d1 == 1, variable padding

* Mark unused varibles

* Support variable strides

* Handle all non-kernel-width convolutions with same general conv2d

* General 2d Conv

* Remove old function

* Group functions

* Rearrange

* General Conv2d implementation

* Clean up whitespace

commit | commitdiff | tree

Georgi Gerganov [Fri, 14 Jul 2023 08:13:55 +0000 (11:13 +0300)]

ggml : fix mul_mat src1 indexing when not contiguous and non-F32 (#386)

commit | commitdiff | tree

Georgi Gerganov [Fri, 14 Jul 2023 08:03:55 +0000 (11:03 +0300)]

ggml : fix mul_mat src1 indexing when src1 is not contiguous (#386)

commit | commitdiff | tree

Georgi Gerganov [Wed, 12 Jul 2023 17:52:37 +0000 (20:52 +0300)]

ggml : apply mul_mat broadcast fix (sync llama.cpp)

commit | commitdiff | tree

Georgi Gerganov [Wed, 12 Jul 2023 17:38:49 +0000 (20:38 +0300)]

cuda : sync llama.cpp

commit | commitdiff | tree

M. Yusuf Sarıgöz [Wed, 12 Jul 2023 17:39:14 +0000 (20:39 +0300)]

ggml : broadcast mul_mat + conv batch support (#325)

* ggml : make src0 broadcast-able into src1 for ggml_mul_mat

* WIP: batch inference support for conv_2d

* Impl batch inference support cov_2d

* Correct offset in Conv2D

---------

Co-authored-by: Georgi Gerganov <redacted>

commit | commitdiff | tree

Georgi Gerganov [Wed, 12 Jul 2023 11:18:20 +0000 (14:18 +0300)]

ggml : pool naming changes + coding style fixes

commit | commitdiff | tree

Jiahao Li [Wed, 12 Jul 2023 10:52:47 +0000 (18:52 +0800)]

ggml : add CUDA gelu op (#374)

commit | commitdiff | tree

Borislav Stanimirov [Wed, 12 Jul 2023 10:43:30 +0000 (13:43 +0300)]

ggml : basic implementation of 1d and 2d pools (#375)

pools and tests

no lenet

commit | commitdiff | tree

Georgi Gerganov [Wed, 12 Jul 2023 08:39:10 +0000 (11:39 +0300)]

ggml : revert CUDA broadcasting changes until fixed

commit | commitdiff | tree

Georgi Gerganov [Tue, 11 Jul 2023 19:19:42 +0000 (22:19 +0300)]

ggml : remove unused function (#328)

commit | commitdiff | tree

Georgi Gerganov [Tue, 11 Jul 2023 19:18:48 +0000 (22:18 +0300)]

ggml : fix compile warnings after #328

commit | commitdiff | tree

Arjun [Tue, 11 Jul 2023 19:11:45 +0000 (00:41 +0530)]

ggml : add callback to abort ggml_graph_compute() (#328)

* mechanism to abort ggml_graph_compute

* use pthread_cancel

* forgot to commit ggml.h

* static always_false()

Co-authored-by: Georgi Gerganov <redacted>
* accept callback data

* proper function prototype

* return exit status

* remove pthread_cancel and join every thread

* put abort_callback onto cplan

* cplan abort_callback in ggml.c

* make sure all threads abort

---------

Co-authored-by: Georgi Gerganov <redacted>

commit | commitdiff | tree

Jiahao Li [Tue, 11 Jul 2023 18:12:57 +0000 (02:12 +0800)]

ggml : use a single kernel for CUDA mul op (#373)

commit | commitdiff | tree

Borislav Stanimirov [Tue, 11 Jul 2023 18:10:40 +0000 (21:10 +0300)]

tests : ifdef for #pragma GCC (#370)

commit | commitdiff | tree

Georgi Gerganov [Tue, 11 Jul 2023 18:10:14 +0000 (21:10 +0300)]

readme : add link to ggllm.cpp repo (close #361)

commit | commitdiff | tree

Jiahao Li [Tue, 11 Jul 2023 18:06:05 +0000 (02:06 +0800)]

ggml : broadcast ggml_add() for F32 (#359)

* Support broadcast add for fp32

* Use single kernel for broadcast add

commit | commitdiff | tree

Jiahao Li [Tue, 11 Jul 2023 17:58:02 +0000 (01:58 +0800)]

ggml : support norm op on CUDA (#364)

commit | commitdiff | tree

Tom Jobbins [Tue, 11 Jul 2023 17:54:36 +0000 (18:54 +0100)]

starcoder : fix for ggml-model.bin being saved in wrong directory + use argparse (#363)

* Fix for starcoder ggml-model.bin being saved in wrong directory. Modernise by using argparse.

* Make sure output directory exists

commit | commitdiff | tree

goerch [Tue, 11 Jul 2023 17:39:54 +0000 (19:39 +0200)]

ggml : fix ggml_set_xxx (#354)

`ggml_set_i32`/`ggml_set_f32` are in line with `ggml_set_i32_1d`/`ggml_set_f32_1d` then.

commit | commitdiff | tree

Daulet Zhanguzin [Tue, 11 Jul 2023 17:26:22 +0000 (10:26 -0700)]

ggml : fix Alibi implementation (#351)

* correct Alibi implementation

* update f16 too

commit | commitdiff | tree

Georgi Gerganov [Tue, 11 Jul 2023 16:36:52 +0000 (19:36 +0300)]

ggml : sync llama.cpp (fix for #341)

commit | commitdiff | tree

Georgi Gerganov [Mon, 10 Jul 2023 19:05:13 +0000 (22:05 +0300)]

ggml : fix docs about element access (close #348)

commit | commitdiff | tree

the-crypt-keeper [Mon, 10 Jul 2023 18:41:58 +0000 (14:41 -0400)]

starcoder : add <|end_of_turn|> token handling in order to support openchat/opencoderplus (#343)

* Add <|end_of_turn|> token handling to support openchat/opencoderplus

* The opencoder EOT occurs inside the prompt, so we should only break if the model actually generated it

---------

Co-authored-by: Mike <redacted>

commit | commitdiff | tree

Sam Spilsbury [Mon, 10 Jul 2023 18:40:29 +0000 (21:40 +0300)]

pkg-config : fix typo in includedir (#367)

commit | commitdiff | tree

Georgi Gerganov [Mon, 10 Jul 2023 18:40:05 +0000 (21:40 +0300)]

ggml : sync llama.cpp (changes to ggml_graph_compute() API) (#368)

commit | commitdiff | tree

Georgi Gerganov [Thu, 6 Jul 2023 16:41:18 +0000 (19:41 +0300)]

ggml : minor indentation

commit | commitdiff | tree

Borislav Stanimirov [Thu, 6 Jul 2023 07:24:39 +0000 (10:24 +0300)]

ggml : restore GGML_RESTRICT (#350)

commit | commitdiff | tree

Georgi Gerganov [Wed, 5 Jul 2023 17:38:55 +0000 (20:38 +0300)]

ggml : fix bug introduced in bfc6d42f8c2141383e4f21e4a030688c71560da0

commit | commitdiff | tree

Georgi Gerganov [Wed, 5 Jul 2023 17:38:20 +0000 (20:38 +0300)]

tests : sync from llama.cpp and disable some obsolete tests

commit | commitdiff | tree

Georgi Gerganov [Wed, 5 Jul 2023 17:14:13 +0000 (20:14 +0300)]

ggml : sync llama.cpp (generalize quantize_fns + CUDA improvements)

commit | commitdiff | tree

Andrei [Tue, 4 Jul 2023 19:53:42 +0000 (15:53 -0400)]

cmake : fix public header path for submodules (#342)

commit | commitdiff | tree

Georgi Gerganov [Tue, 4 Jul 2023 17:27:19 +0000 (20:27 +0300)]

whisper : fix wrong variable name from previous commit

commit | commitdiff | tree

Sam Spilsbury [Tue, 4 Jul 2023 17:35:13 +0000 (20:35 +0300)]

build : add pkg-config file (#335)

This makes it easier for other library consumers to find
the library and link to it.

Fixes #334

commit | commitdiff | tree

Sam Spilsbury [Tue, 4 Jul 2023 17:34:28 +0000 (20:34 +0300)]

cmake : install the header file to ggml/ggml.h (#333)

Fixes #332

commit | commitdiff | tree

Georgi Gerganov [Tue, 4 Jul 2023 17:24:22 +0000 (20:24 +0300)]

whisper : sync whisper.cpp (tinydiarize + OpenVINO)

commit | commitdiff | tree

Sam Spilsbury [Tue, 4 Jul 2023 13:30:21 +0000 (16:30 +0300)]

readme : add link to ggml-gobject (#336)

This enables also some bindings to python (through pygi), gjs, vala, csharp, etc. However `ggml-gobject`s main purpose is to make the library a bit more friendly to the desktop platform, eg, by providing asynchronous operation, a DBus service, etc.

commit | commitdiff | tree

Jakob Frick [Tue, 4 Jul 2023 13:26:57 +0000 (14:26 +0100)]

dolly : update error print behavior (#337)

commit | commitdiff | tree

Borislav Stanimirov [Tue, 4 Jul 2023 13:26:29 +0000 (16:26 +0300)]

dolly : disable interactive_port on Windows (#339)

commit | commitdiff | tree

Jakob Frick [Sun, 2 Jul 2023 18:48:02 +0000 (14:48 -0400)]

dolly : add interactive prompt and port mode (#319)

* update basic function to execute prompt

* try to factor our prediciton loop

* update code

* update prompt things

* only render at the end

* add basic server port

* refactor

* fix client file descriptor

* undo common.h style changes

* undo sytle changes to main.cpp

* fix check for interactive port

commit | commitdiff | tree

Georgi Gerganov [Sun, 2 Jul 2023 18:41:23 +0000 (21:41 +0300)]

examples : remove whitespace

commit | commitdiff | tree

Hirochika Matsumoto [Sun, 2 Jul 2023 16:47:47 +0000 (01:47 +0900)]

examples : use GGML_FILE_MAGIC where possible (#323)

commit | commitdiff | tree

sjinzh [Sun, 2 Jul 2023 16:36:53 +0000 (00:36 +0800)]

zig : add tests codes using zig (#315)

* update build.zig

* zig : add tests by zig

* zig : add tests codes using zig

* zig : add tests codes using zig

commit | commitdiff | tree

Hugo Rosenkranz-Costa [Sun, 2 Jul 2023 16:05:24 +0000 (18:05 +0200)]

mpt : convert model weights part by part to save memory (#314)

* mpt : update conversion script to load model weights part by part

* mpt : add usage README

commit | commitdiff | tree

Borislav Stanimirov [Sun, 2 Jul 2023 15:54:16 +0000 (18:54 +0300)]

ggml : generalize interface for 1d and 2d convolutions (#313)

* conv_1d wip

* conv_1d opt

* conv_1d done

* conv_1 improve alias func name

* conv_2d wip

* conv size to separate func

* conv2d done

---------

Co-authored-by: Georgi Gerganov <redacted>

commit | commitdiff | tree

Georgi Gerganov [Sun, 2 Jul 2023 15:53:42 +0000 (18:53 +0300)]

ggml : disable ggml_rope_back for ChatGLM

commit | commitdiff | tree

Georgi Gerganov [Sun, 2 Jul 2023 15:33:41 +0000 (18:33 +0300)]

ggml : remove tensor ptr from export for now (close #267)

Not used for now

commit | commitdiff | tree

Georgi Gerganov [Sun, 2 Jul 2023 15:26:26 +0000 (18:26 +0300)]

ggml : fix enum order for TANH (#316)

commit | commitdiff | tree

PAB [Sun, 2 Jul 2023 15:25:37 +0000 (17:25 +0200)]

ggml : add `ELU`, `TANH`, `ARGMAX` (#316)

* add: `elu` activation

* add: `tanh` activation

* add: `argmax`

* ggml : rearrange ops - put "tanh" after "step"

---------

Co-authored-by: Georgi Gerganov <redacted>

commit | commitdiff | tree

goerch [Sun, 2 Jul 2023 15:13:23 +0000 (17:13 +0200)]

ggml : add GGML_TENSOR_LOCALS helper macros (#309)

* [WIP] ref #292

* Further code reduction

* ggml : minor style fixes

* ggml : hide op locals in source file

---------

Co-authored-by: Georgi Gerganov <redacted>

commit | commitdiff | tree

the-crypt-keeper [Sun, 2 Jul 2023 14:52:52 +0000 (10:52 -0400)]

starcoder : add repeat penalty (#311)

* implement repeat penalty processing for starcoder

* show effective parameters at starcoder startup

---------

Co-authored-by: Mike Ravkine <redacted>

commit | commitdiff | tree

Georgi Gerganov [Sun, 2 Jul 2023 14:33:57 +0000 (17:33 +0300)]

ggml : sync latest llama.cpp (ggml_task_type changes + GPU backends)

commit | commitdiff | tree

sjinzh [Mon, 26 Jun 2023 20:48:31 +0000 (04:48 +0800)]

zig : add tests by zig (#307)

* update build.zig

* zig : add tests by zig

commit | commitdiff | tree

Jiahao Li [Mon, 26 Jun 2023 20:47:31 +0000 (04:47 +0800)]

ggml : support ChatGLM-style RoPE (#305)

commit | commitdiff | tree

Georgi Gerganov [Mon, 26 Jun 2023 20:26:37 +0000 (23:26 +0300)]

ggml : increase max name size to 48