]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2023-06-16 KawrakowCUDA : faster k-quant dot kernels (#1862)
2023-06-16 Borislav Stanimirovgitignore : add several entries specific to Visual...
2023-06-15 Johannes GäßlerFixed CUDA runtime version check (#1879)
2023-06-15 Georgi Gerganovcmake : remove whitespaces
2023-06-15 yangli2examples : add chat-vicuna.sh (#1854)
2023-06-15 Igor Okulistcmake : set include path for OpenBlas (#1830)
2023-06-15 Frederik Vogelswift : Package compile breaks due to ggml-metal.metal...
2023-06-15 daboe01make : add train-text-from-scratch (#1850)
2023-06-15 Srinivas Billareadme : server compile flag (#1874)
2023-06-15 sandyiscoolmake : clean *.so files (#1857)
2023-06-15 Howard SuFix the validation of main device (#1872)
2023-06-15 Georgi Gerganovmetal : parallel command buffer encoding (#1860)
2023-06-15 Johannes GäßlerBetter error when using both LoRA + GPU layers (#1861)
2023-06-14 Johannes GäßlerCUDA full GPU acceleration, KV cache in VRAM (#1827)
2023-06-13 0xspringtimebaby-llama : fix operator!= (#1821)
2023-06-13 xaedestrain : improved training-from-scratch example (#1652)
2023-06-13 Georgi Gerganovllama : do a warm-up eval at start for better timings...
2023-06-13 KerfuffleAllow "quantizing" to f16 and f32 (#1787)
2023-06-12 KawrakowMetal implementation for all k_quants (#1807)
2023-06-12 slarenci : run when changing only the CUDA sources (#1800)
2023-06-12 Howard SuLeverage mmap for offloading tensors to GPU (#1597)
2023-06-12 Kawrakowmetal : fix failure to load model (#1817)
2023-06-11 KerfuffleFix issue where interactive mode crashes when input...
2023-06-11 Kyle LiangFixed WSL cuda's OOM error (#1594)
2023-06-11 Ryan LandayUpdate SHA256SUMS with current hashes for models quanti...
2023-06-10 Georgi Gerganovcmake : fix Metal build (close #1791)
2023-06-10 Artyom Lebedevk-quants : GCC12 compilation fix (#1792)
2023-06-10 Andreimetal : fix issue with ggml-metal.metal path. Closes...
2023-06-10 Aisukodoc : fix wrong address of BLIS.md (#1772)
2023-06-10 Georgi Gerganovggml : force no_alloc == false when creating opt tensor...
2023-06-10 Kawrakowmetal : add Q4_1 implementation (#1785)
2023-06-10 Kerfufflellama : support requantizing models instead of only...
2023-06-10 Xingchen Song... ggml : workaround for missing _mm256_setr_m128i in...
2023-06-10 rankaiyxmake : add SSSE3 compilation use case (#1659)
2023-06-09 Robert Sung... OpenCL: Add release memory (#1741)
2023-06-09 Johannes GäßlerWindows nvcc workaround (#1753)
2023-06-09 Georgi Gerganovmetal : fix build "tanhf" -> "tanh"
2023-06-09 ATmetal : add GELU implementation (#1770)
2023-06-09 Kawrakowmetal : faster q4_0 (#1775)
2023-06-08 Kawrakowmetal : add Q2_K implementation (#1762)
2023-06-08 Georgi GerganovRevert "ggml : load data into int8x16x4_t using vld4q_s...
2023-06-08 le.changggml : load data into int8x16x4_t using vld4q_s8 on...
2023-06-08 Kawrakowmetal : Q6_K implementation (#1752)
2023-06-08 qingfengfengaAdd llama.cpp docker support for non-latin languages...
2023-06-08 Steven Rousseyggml : fix fprintf warnings (#1720)
2023-06-08 Georgi Gerganovclang-tidy : restore dot file from accidental deletion
2023-06-08 Kawrakowmetal : add Q4_K implementation (#1733)
2023-06-08 johnson442k-quants : add missing compile definition to CMakeLists...
2023-06-07 Georgi Gerganovk-quants : allow to optionally disable at compile time...
2023-06-07 jacobi petruccianiflake : update to support metal on m1/m2 (#1724)
2023-06-07 Georgi Gerganovreadme : add June roadmap
2023-06-07 Willy Tarreaumain: add the possibility to open the prompt cache...
2023-06-06 Georgi Gerganovllama : fix vram_scratch var
2023-06-06 Georgi Gerganovllama : fix compile warnings
2023-06-06 Johannes GäßlerMulti GPU support, CUDA refactor, CUDA scratch buffer...
2023-06-06 Georgi Gerganovmetal : add f16 support
2023-06-06 LostRuinsClblast fixes + enhancements to save VRAM and offload...
2023-06-06 Georgi Gerganovggml : fix builds, add ggml-quants-k.o (close #1712...
2023-06-06 Georgi Gerganovgitignore : add .clang-tidy
2023-06-06 Georgi Gerganovllama : temporary disable Q6_K output quantization...
2023-06-06 Spencer Suttonmetal : add checks for buffer size (#1706)
2023-06-05 Yuval Peleddocs : add performance troubleshoot + example benchmark...
2023-06-05 Foul-Tarnishedreadme : fix typo (#1700)
2023-06-05 mgroeber9110llama : consistently catch and throw only exceptions...
2023-06-05 kiltyjmetal : use shared buffers between CPU and GPU (#1696)
2023-06-05 grahamethggml : fix internal overflow in ggml_time_us on Windows...
2023-06-05 Georgi Gerganovci : disable auto tidy (#1705)
2023-06-05 Kawrakowggml : add SOTA 2,3,4,5,6 bit k-quantizations (#1684)
2023-06-05 Henri VassermanIncrease 3B scratch buffers. (#1698)
2023-06-05 Georgi Gerganovllama : fix Metal KV cache sync (close #1695)
2023-06-04 Georgi Gerganovreadme : update hot topics
2023-06-04 Georgi Gerganovllama : Metal inference (#1642)
2023-06-04 0cc4mOpenCL: Fix duplication of layers in VRAM and RAM,...
2023-06-03 Henri VassermanAdd info about CUDA_VISIBLE_DEVICES (#1682)
2023-06-03 Jiří PodivínDocker: change to calling convert.py (#1641)
2023-06-03 Evan JonesFix prompt cache saving and chat-persistent rollover...
2023-05-30 Henri VassermanOpenLLaMA 3B support (#1588)
2023-05-29 Georgi Gerganovggml : sync cgraph import / export API
2023-05-29 Georgi Gerganovggml : fix bug in ggml_alibi
2023-05-29 DannyDaemonicWork around for recalculating logits in cached prompts...
2023-05-29 Jiří PodivínAdding git in container package dependencies (#1621)
2023-05-28 Johannes GäßlerLLAMA_DEBUG adds debug symbols (#1617)
2023-05-28 KerfuffleOnly show -ngl option when relevant + other doc/arg...
2023-05-28 Vladimir Zorinexamples : add --alias option to gpt_params to set...
2023-05-28 Howard Suopencl : no need to allocate cl_mem on heap (#1612)
2023-05-28 Howard Suopencl : use strstr to check if fp16 supported (#1611)
2023-05-27 apcameronggml : add support for the RISCV architecture (#1616)
2023-05-27 KerfuffleInclude server in releases + other build system cleanup...
2023-05-27 Henri VassermanAdd documentation about CLBlast (#1604)
2023-05-27 Henri Vasserman[CI] Fix openblas (#1613)
2023-05-27 Georgi Gerganovggml : add ggml_tensor_overhead()
2023-05-27 Henri Vasserman[CI] CLBlast: Fix directory name (#1606)
2023-05-27 Georgi Gerganovggml : sync ggml core (minor additions, e.g. ggml_get_t...
2023-05-26 KerfuffleSome improvements to loading the session with --prompt...
2023-05-25 Johannes Gäßlercuda : performance optimizations (#1530)
2023-05-24 Henri VassermanUpdate CLBlast to 1.6.0 (#1580)
2023-05-24 Evan Jonesreadme : add docs for chat-persistent.sh (#1568)
2023-05-24 Senemuchat-persistent.sh : use bracket expressions in grep...
2023-05-23 Maarten ter... Fix handling of "invalid property" when creating OpenCL...
2023-05-22 0cc4mOpenCL Token Generation Acceleration (#1459)
next