]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
examples : sample replit + MPT inference (#145)
authorLukas Möller <redacted>
Wed, 17 May 2023 19:58:21 +0000 (21:58 +0200)
committerGitHub <redacted>
Wed, 17 May 2023 19:58:21 +0000 (22:58 +0300)
commit1d6a133b098e3b038a7e2c544941b5288ca76bc8
tree3816f06744b2225d8cc83033253058db01221820
parent74705055853f7922e9622bdd0a1ebde2b8f57431
examples : sample replit + MPT inference (#145)

* Add replit model

* Add unigram tokenization support

* Remove debug log

* Port alibi attn bias fix

* Remove torch input

* Fix hardcoded path

* Remove unsupported hyperparams

* Add mpt

* Add replit quantization script

* Remove debug print

* Add quantization support to mpt

* Reformat

* Remove trailing return type

* Implement stylistic changes

* use f16 in k/v memory calculations for replit/mpt

* Update context size calculation

* Add clip_qkv and alibi_bias_max support

* fix clamping implementation, remove implicit conversions

* Fix qkv if condition

* Fix replit context size calculation

* Potentially fix gcc compilation error

* Fix warning

* Adjust object overhead

* Remove dead code
examples/CMakeLists.txt
examples/mpt/CMakeLists.txt [new file with mode: 0644]
examples/mpt/convert-h5-to-ggml.py [new file with mode: 0644]
examples/mpt/main.cpp [new file with mode: 0644]
examples/mpt/quantize.cpp [new file with mode: 0644]
examples/replit/CMakeLists.txt [new file with mode: 0644]
examples/replit/convert-h5-to-ggml.py [new file with mode: 0644]
examples/replit/main.cpp [new file with mode: 0644]
examples/replit/quantize.cpp [new file with mode: 0644]
include/ggml/ggml.h
src/ggml.c