]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
mpt : utf-8 support, perplexity testing, repeat penalty sampling (#184)
authorklosax <redacted>
Wed, 24 May 2023 07:27:36 +0000 (09:27 +0200)
committerGitHub <redacted>
Wed, 24 May 2023 07:27:36 +0000 (10:27 +0300)
commit9276285583ee322973064e8aaff0400eb7321604
treec2666e9c4b9561635f69c734465b06477f094bcd
parent6e064f69aa99e940108dbcfb3c18a3e5f0f964a5
mpt : utf-8 support, perplexity testing, repeat penalty sampling (#184)

* common: utf-8 decoder, reverted gpt_toeknize utf-8 convert

* Update common.h

* main: decode utf-8 tokens on load

* mpt import: bug fix

* common: style fixes

* common: style fix

* Update common.h

* common: revert gpt_tokenize utf-8 convert

* Update common.cpp

* Update common.cpp

* Update common.cpp

* Add perplexity to mpt

* Update CMakeLists: perplexity

* mpt-perplexity: fixes

* Update perplexity.cpp

* common: add sampling with repeat penalty

* mpt-main: add repeat penalty sampling, add commandline parameters

* Update common.h

* mpt-main: style fixes

* Update perplexity.cpp

* Delete perplexity.cpp

* mpt: move perplexity to main

* mpt: move perplexity to main

* common.cpp: Use codecvt utf-8 converter

* main.cpp: Use codecvt utf-8 converter

* mpt : code style changes

---------

Co-authored-by: Georgi Gerganov <redacted>
examples/common.cpp
examples/common.h
examples/mpt/convert-h5-to-ggml.py
examples/mpt/main.cpp