]>
git.djapps.eu Git - pkg/ggml/sources/ggml/commit
examples : sample replit + MPT inference (#145)
* Add replit model
* Add unigram tokenization support
* Remove debug log
* Port alibi attn bias fix
* Remove torch input
* Fix hardcoded path
* Remove unsupported hyperparams
* Add mpt
* Add replit quantization script
* Remove debug print
* Add quantization support to mpt
* Reformat
* Remove trailing return type
* Implement stylistic changes
* use f16 in k/v memory calculations for replit/mpt
* Update context size calculation
* Add clip_qkv and alibi_bias_max support
* fix clamping implementation, remove implicit conversions
* Fix qkv if condition
* Fix replit context size calculation
* Potentially fix gcc compilation error
* Fix warning
* Adjust object overhead
* Remove dead code