]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/log
pkg/ggml/sources/llama.cpp
2 years agoFix typo in README (#45)
Marc Köhlbrugge [Sun, 12 Mar 2023 20:30:08 +0000 (03:30 +0700)]
Fix typo in README (#45)

2 years agoAllow using prompt files (#59)
Ben Garney [Sun, 12 Mar 2023 20:28:36 +0000 (13:28 -0700)]
Allow using prompt files (#59)

2 years agoAdd back top_k (#56)
beiller [Sun, 12 Mar 2023 20:23:15 +0000 (16:23 -0400)]
Add back top_k (#56)

* Add back top_k

* Update utils.cpp

* Update utils.h

---------

Co-authored-by: Bill Hamilton <redacted>
Co-authored-by: Georgi Gerganov <redacted>
2 years agoWindows fixes (#31)
Sebastián A [Sun, 12 Mar 2023 20:15:00 +0000 (17:15 -0300)]
Windows fixes (#31)

* Apply fixes suggested to build on windows

Issue: https://github.com/ggerganov/llama.cpp/issues/22

* Remove unsupported VLAs

* MSVC: Remove features that are only available on MSVC C++20.

* Fix zero initialization of the other fields.

* Change the use of vector for stack allocations.

2 years agoUpdate README.md
Georgi Gerganov [Sun, 12 Mar 2023 20:09:26 +0000 (22:09 +0200)]
Update README.md

2 years agoAdd CI (#60)
Georgi Gerganov [Sun, 12 Mar 2023 20:08:24 +0000 (22:08 +0200)]
Add CI (#60)

2 years agoRevert "weights_only" arg - this causing more trouble than help
Georgi Gerganov [Sun, 12 Mar 2023 18:59:01 +0000 (20:59 +0200)]
Revert "weights_only" arg - this causing more trouble than help

2 years agopython/pytorch compat notes (#44)
Oleksandr Nikitin [Sun, 12 Mar 2023 12:16:33 +0000 (14:16 +0200)]
python/pytorch compat notes (#44)

2 years agoAdd repetition penalty (#20)
beiller [Sun, 12 Mar 2023 09:27:42 +0000 (05:27 -0400)]
Add repetition penalty (#20)

* Adding repeat penalization

* Update utils.h

* Update utils.cpp

* Numeric fix

Should probably still scale by temp even if penalized

* Update comments, more proper application

I see that numbers can go negative so a fix from a referenced commit

* Minor formatting

---------

Co-authored-by: Georgi Gerganov <redacted>
2 years agoClarify meaning of hacking
Georgi Gerganov [Sun, 12 Mar 2023 07:03:25 +0000 (09:03 +0200)]
Clarify meaning of hacking

2 years agoREADME: add "Supported platforms" + update hot topics
Georgi Gerganov [Sun, 12 Mar 2023 06:41:54 +0000 (08:41 +0200)]
README: add "Supported platforms" + update hot topics

2 years agouse weights_only in conversion script (#32)
deepdiffuser [Sun, 12 Mar 2023 06:36:35 +0000 (22:36 -0800)]
use weights_only in conversion script (#32)

this restricts malicious weights from executing arbitrary code by restricting the unpickler to only loading tensors, primitive types, and dictionaries

2 years agoAdd LICENSE (#21)
Pavol Rusnak [Sun, 12 Mar 2023 06:36:03 +0000 (07:36 +0100)]
Add LICENSE (#21)

2 years agoUpdate README.md
Georgi Gerganov [Sat, 11 Mar 2023 23:26:32 +0000 (01:26 +0200)]
Update README.md

2 years agoFix a typo in model name (#16)
Juraj Bednar [Sat, 11 Mar 2023 17:32:20 +0000 (18:32 +0100)]
Fix a typo in model name (#16)

2 years agoUpdate README.md
Georgi Gerganov [Sat, 11 Mar 2023 16:10:18 +0000 (18:10 +0200)]
Update README.md

2 years agoAdd AVX2 support for x86 architectures thanks to @Const-me !
Georgi Gerganov [Sat, 11 Mar 2023 15:58:18 +0000 (17:58 +0200)]
Add AVX2 support for x86 architectures thanks to @Const-me !

2 years agoFix un-initialized FP16 tables on x86 (#15, #2)
Georgi Gerganov [Sat, 11 Mar 2023 15:40:14 +0000 (17:40 +0200)]
Fix un-initialized FP16 tables on x86 (#15, #2)

2 years agoBump memory buffer
Georgi Gerganov [Sat, 11 Mar 2023 10:44:21 +0000 (12:44 +0200)]
Bump memory buffer

2 years agoUpdate README.md
Georgi Gerganov [Sat, 11 Mar 2023 10:31:21 +0000 (12:31 +0200)]
Update README.md

2 years ago.gitignore models/
Georgi Gerganov [Sat, 11 Mar 2023 10:26:46 +0000 (12:26 +0200)]
.gitignore models/

2 years agoUpdate Makefile var + add comment
Georgi Gerganov [Sat, 11 Mar 2023 10:26:16 +0000 (12:26 +0200)]
Update Makefile var + add comment

2 years agoUpdate README.md
Georgi Gerganov [Sat, 11 Mar 2023 09:34:25 +0000 (11:34 +0200)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Sat, 11 Mar 2023 09:34:11 +0000 (11:34 +0200)]
Update README.md

2 years agoSupport all LLaMA models + change Q4_0 quantization storage
Georgi Gerganov [Sat, 11 Mar 2023 08:47:09 +0000 (10:47 +0200)]
Support all LLaMA models + change Q4_0 quantization storage

2 years agoInclude Python dependencies in README (#6)
Simon Willison [Sat, 11 Mar 2023 05:47:26 +0000 (21:47 -0800)]
Include Python dependencies in README (#6)

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 23:30:47 +0000 (01:30 +0200)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 23:22:58 +0000 (01:22 +0200)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 23:18:10 +0000 (01:18 +0200)]
Update README.md

2 years agoAdd missing headers for memcpy and assert (#3)
Jean-Michaël Celerier [Fri, 10 Mar 2023 23:04:06 +0000 (18:04 -0500)]
Add missing headers for memcpy and assert (#3)

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 22:55:22 +0000 (00:55 +0200)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 22:51:46 +0000 (00:51 +0200)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 22:09:19 +0000 (00:09 +0200)]
Update README.md

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 21:53:11 +0000 (23:53 +0200)]
Update README.md

2 years agoFix a bug in the rope calculation
Georgi Gerganov [Fri, 10 Mar 2023 21:46:39 +0000 (23:46 +0200)]
Fix a bug in the rope calculation

2 years agoUpdate README.md
Georgi Gerganov [Fri, 10 Mar 2023 19:52:27 +0000 (21:52 +0200)]
Update README.md

2 years agoFinal touches
Georgi Gerganov [Fri, 10 Mar 2023 19:50:46 +0000 (21:50 +0200)]
Final touches

2 years agoCreate README.md
Georgi Gerganov [Fri, 10 Mar 2023 19:47:46 +0000 (21:47 +0200)]
Create README.md

2 years agoInitial release
Georgi Gerganov [Fri, 10 Mar 2023 18:40:58 +0000 (20:40 +0200)]
Initial release