]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/log
pkg/ggml/sources/llama.cpp
2 years agoCMake build in Release by default (#75)
Georgi Gerganov [Mon, 13 Mar 2023 19:22:15 +0000 (21:22 +0200)]
CMake build in Release by default (#75)

2 years agoUpdate contribution section, hot topics, limitations, etc.
Georgi Gerganov [Mon, 13 Mar 2023 17:21:51 +0000 (19:21 +0200)]
Update contribution section, hot topics, limitations, etc.

2 years agoPrint system information
Georgi Gerganov [Mon, 13 Mar 2023 17:15:08 +0000 (19:15 +0200)]
Print system information

2 years agoInitial support for CMake (#75)
Sebastián A [Mon, 13 Mar 2023 17:12:33 +0000 (14:12 -0300)]
Initial support for CMake (#75)

2 years agoAdd NetBSD support. (#90)
Thomas Klausner [Mon, 13 Mar 2023 16:40:54 +0000 (17:40 +0100)]
Add NetBSD support. (#90)

2 years agoUse fprintf for diagnostic output (#48)
Pavol Rusnak [Mon, 13 Mar 2023 16:39:56 +0000 (17:39 +0100)]
Use fprintf for diagnostic output (#48)

keep printf only for printing model output

one can now use ./main ... 2>dev/null to suppress any diagnostic output

2 years agoUse vdotq_s32 to improve performance (#67)
Georgi Gerganov [Mon, 13 Mar 2023 16:36:44 +0000 (18:36 +0200)]
Use vdotq_s32 to improve performance (#67)

* 10% performance boost on ARM

* Back to original change

2 years agoReduce model loading time (#43)
uint256_t [Mon, 13 Mar 2023 16:33:43 +0000 (01:33 +0900)]
Reduce model loading time (#43)

* Use buffering

* Use vector

* Minor

---------

Co-authored-by: Georgi Gerganov <redacted>
2 years agoFix UTF-8 handling (including colors) (#79)
Val Kharitonov [Mon, 13 Mar 2023 16:24:18 +0000 (12:24 -0400)]
Fix UTF-8 handling (including colors) (#79)

2 years agoAdd quantize script for batch quantization (#92)
Pavol Rusnak [Mon, 13 Mar 2023 16:15:20 +0000 (17:15 +0100)]
Add quantize script for batch quantization (#92)

* Add quantize script for batch quantization

* Indentation

* README for new quantize.sh

* Fix script name

* Fix file list on Mac OS

---------

Co-authored-by: Georgi Gerganov <redacted>
2 years agoAdd initial contribution guidelines
Georgi Gerganov [Mon, 13 Mar 2023 07:42:26 +0000 (09:42 +0200)]
Add initial contribution guidelines

2 years agoGate signal support on being on a unixoid system. (#74)
Matvey Soloviev [Mon, 13 Mar 2023 03:08:01 +0000 (04:08 +0100)]
Gate signal support on being on a unixoid system. (#74)

2 years agoFix token count accounting
Matvey Soloviev [Sun, 12 Mar 2023 23:35:51 +0000 (00:35 +0100)]
Fix token count accounting

2 years agoRevert "10% performance boost on ARM"
Georgi Gerganov [Sun, 12 Mar 2023 23:28:08 +0000 (01:28 +0200)]
Revert "10% performance boost on ARM"

This reverts commit 113a9e83ebc0f788f861394437087bf3ca0e019b.

There are some reports for illegal instruction.
Moved this stuff to vdotq_s32 branch until resolve

2 years agoCheck for vdotq_s32 availability
Georgi Gerganov [Sun, 12 Mar 2023 23:21:03 +0000 (01:21 +0200)]
Check for vdotq_s32 availability

2 years agoAmmend to previous commit - forgot to update non-QRDMX branch
Georgi Gerganov [Sun, 12 Mar 2023 23:05:24 +0000 (01:05 +0200)]
Ammend to previous commit - forgot to update non-QRDMX branch

2 years ago10% performance boost on ARM
Georgi Gerganov [Sun, 12 Mar 2023 22:56:10 +0000 (00:56 +0200)]
10% performance boost on ARM

2 years agoFix color getting reset before prompt output done (#65)
Matvey Soloviev [Sun, 12 Mar 2023 22:07:34 +0000 (23:07 +0100)]
Fix color getting reset before prompt output done (#65)

(cherry picked from commit 7eb2987619feee04c40eff69b604017d09919cb6)

2 years agoUpdate README.md
Georgi Gerganov [Sun, 12 Mar 2023 21:39:01 +0000 (23:39 +0200)]
Update README.md

2 years agoAdd interactive mode (#61)
Matvey Soloviev [Sun, 12 Mar 2023 21:13:28 +0000 (22:13 +0100)]
Add interactive mode (#61)

* Initial work on interactive mode.

* Improve interactive mode. Make rev. prompt optional.

* Update README to explain interactive mode.

* Fix OS X build

2 years agoFix typo in README (#45)
Marc Köhlbrugge [Sun, 12 Mar 2023 20:30:08 +0000 (03:30 +0700)]
Fix typo in README (#45)

2 years agoAllow using prompt files (#59)
Ben Garney [Sun, 12 Mar 2023 20:28:36 +0000 (13:28 -0700)]
Allow using prompt files (#59)

2 years agoAdd back top_k (#56)
beiller [Sun, 12 Mar 2023 20:23:15 +0000 (16:23 -0400)]
Add back top_k (#56)

* Add back top_k

* Update utils.cpp

* Update utils.h

---------

Co-authored-by: Bill Hamilton <redacted>
Co-authored-by: Georgi Gerganov <redacted>
2 years agoWindows fixes (#31)
Sebastián A [Sun, 12 Mar 2023 20:15:00 +0000 (17:15 -0300)]
Windows fixes (#31)

* Apply fixes suggested to build on windows

Issue: https://github.com/ggerganov/llama.cpp/issues/22

* Remove unsupported VLAs

* MSVC: Remove features that are only available on MSVC C++20.

* Fix zero initialization of the other fields.

* Change the use of vector for stack allocations.

2 years agoUpdate README.md
Georgi Gerganov [Sun, 12 Mar 2023 20:09:26 +0000 (22:09 +0200)]
Update README.md

2 years agoAdd CI (#60)
Georgi Gerganov [Sun, 12 Mar 2023 20:08:24 +0000 (22:08 +0200)]
Add CI (#60)

2 years agoRevert "weights_only" arg - this causing more trouble than help
Georgi Gerganov [Sun, 12 Mar 2023 18:59:01 +0000 (20:59 +0200)]
Revert "weights_only" arg - this causing more trouble than help

2 years agopython/pytorch compat notes (#44)
Oleksandr Nikitin [Sun, 12 Mar 2023 12:16:33 +0000 (14:16 +0200)]
python/pytorch compat notes (#44)

2 years agoAdd repetition penalty (#20)
beiller [Sun, 12 Mar 2023 09:27:42 +0000 (05:27 -0400)]
Add repetition penalty (#20)

* Adding repeat penalization

* Update utils.h

* Update utils.cpp

* Numeric fix

Should probably still scale by temp even if penalized

* Update comments, more proper application

I see that numbers can go negative so a fix from a referenced commit

* Minor formatting

---------

Co-authored-by: Georgi Gerganov <redacted>
2 years agoClarify meaning of hacking
Georgi Gerganov [Sun, 12 Mar 2023 07:03:25 +0000 (09:03 +0200)]
Clarify meaning of hacking

2 years agoREADME: add "Supported platforms" + update hot topics
Georgi Gerganov [Sun, 12 Mar 2023 06:41:54 +0000 (08:41 +0200)]
README: add "Supported platforms" + update hot topics

2 years agouse weights_only in conversion script (#32)
deepdiffuser [Sun, 12 Mar 2023 06:36:35 +0000 (22:36 -0800)]
use weights_only in conversion script (#32)

this restricts malicious weights from executing arbitrary code by restricting the unpickler to only loading tensors, primitive types, and dictionaries

2 years agoAdd LICENSE (#21)
Pavol Rusnak [Sun, 12 Mar 2023 06:36:03 +0000 (07:36 +0100)]
Add LICENSE (#21)

2 years agoUpdate README.md
Georgi Gerganov [Sat, 11 Mar 2023 23:26:32 +0000 (01:26 +0200)]
Update README.md

2 years agoFix a typo in model name (#16)
Juraj Bednar [Sat, 11 Mar 2023 17:32:20 +0000 (18:32 +0100)]
Fix a typo in model name (#16)

2 years agoUpdate README.md
Georgi Gerganov [Sat, 11 Mar 2023 16:10:18 +0000 (18:10 +0200)]
Update README.md

2 years agoAdd AVX2 support for x86 architectures thanks to @Const-me !
Georgi Gerganov [Sat, 11 Mar 2023 15:58:18 +0000 (17:58 +0200)]
Add AVX2 support for x86 architectures thanks to @Const-me !

2 years agoFix un-initialized FP16 tables on x86 (#15, #2)
Georgi Gerganov [Sat, 11 Mar 2023 15:40:14 +0000 (17:40 +0200)]
Fix un-initialized FP16 tables on x86 (#15, #2)

2 years agoBump memory buffer
Georgi Gerganov [Sat, 11 Mar 2023 10:44:21 +0000 (12:44 +0200)]
Bump memory buffer

2 years agoUpdate README.md
Georgi Gerganov [Sat, 11 Mar 2023 10:31:21 +0000 (12:31 +0200)]
Update README.md

2 years ago.gitignore models/
Georgi Gerganov [Sat, 11 Mar 2023 10:26:46 +0000 (12:26 +0200)]
.gitignore models/

2 years agoUpdate Makefile var + add comment
Georgi Gerganov [Sat, 11 Mar 2023 10:26:16 +0000 (12:26 +0200)]
Update Makefile var + add comment

2 years agoUpdate README.md
Georgi Gerganov [Sat, 11 Mar 2023 09:34:25 +0000 (11:34 +0200)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Sat, 11 Mar 2023 09:34:11 +0000 (11:34 +0200)]
Update README.md

2 years agoSupport all LLaMA models + change Q4_0 quantization storage
Georgi Gerganov [Sat, 11 Mar 2023 08:47:09 +0000 (10:47 +0200)]
Support all LLaMA models + change Q4_0 quantization storage

2 years agoInclude Python dependencies in README (#6)
Simon Willison [Sat, 11 Mar 2023 05:47:26 +0000 (21:47 -0800)]
Include Python dependencies in README (#6)

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 23:30:47 +0000 (01:30 +0200)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 23:22:58 +0000 (01:22 +0200)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 23:18:10 +0000 (01:18 +0200)]
Update README.md

2 years agoAdd missing headers for memcpy and assert (#3)
Jean-Michaël Celerier [Fri, 10 Mar 2023 23:04:06 +0000 (18:04 -0500)]
Add missing headers for memcpy and assert (#3)

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 22:55:22 +0000 (00:55 +0200)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 22:51:46 +0000 (00:51 +0200)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 22:09:19 +0000 (00:09 +0200)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 21:53:11 +0000 (23:53 +0200)]
Update README.md

2 years agoFix a bug in the rope calculation
Georgi Gerganov [Fri, 10 Mar 2023 21:46:39 +0000 (23:46 +0200)]
Fix a bug in the rope calculation

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 19:52:27 +0000 (21:52 +0200)]
Update README.md

2 years agoFinal touches
Georgi Gerganov [Fri, 10 Mar 2023 19:50:46 +0000 (21:50 +0200)]
Final touches

2 years agoCreate README.md
Georgi Gerganov [Fri, 10 Mar 2023 19:47:46 +0000 (21:47 +0200)]
Create README.md

2 years agoInitial release
Georgi Gerganov [Fri, 10 Mar 2023 18:40:58 +0000 (20:40 +0200)]
Initial release