]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2023-06-24 Georgi Gerganovtests : sync test-grad0 from ggml
2023-06-24 Rowan Hartflake : fix ggml-metal.metal path and run nixfmt (...
2023-06-24 AN Longconvert : fix invalid params in write_vocab_only (...
2023-06-24 slarenggml : improve ggml_graph_dump_dot, add ggml_format_nam...
2023-06-24 Georgi Gerganovreadme : fix whitespaces
2023-06-24 Albertoreadme : fixed termux instructions (#1973)
2023-06-24 Alex Rendallama : fix top-p sampling to match the canonical defin...
2023-06-24 Didzis Goskollama : make model stateless and context stateful ...
2023-06-23 eieryAdd OpenLLaMA instructions to the README (#1954)
2023-06-22 Erik Scholzrework convert.py to read hyper-parameters from config...
2023-06-21 Johannes Gäßlercmake: revert CUDA arch default to 52, 61 if f16 (...
2023-06-21 Rahul Vivek... Fix typo in README.md (#1961)
2023-06-20 Georgi Gerganovreadme : add link to p1
2023-06-20 Xiake SunFix typo (#1949)
2023-06-20 Ettore Di Giacintollama : fix params struct slignment (#1936)
2023-06-19 Henri Vasserman[Fix] Reenable server embedding endpoint (#1937)
2023-06-19 Georgi Gerganovggml : fix bug in LBFGS optimizer (found by ggml tests)
2023-06-19 l3utterflyllama : use aligned memory during ggml_init call from...
2023-06-19 Georgi Gerganovcmake : fix trailing whitespaces
2023-06-19 Kawrakowllama : only use Q6_K for output weights if tensor...
2023-06-19 Kawrakowcuda : faster k-quants on older GPUs (#1930)
2023-06-19 Georgi Gerganovggml : sync latest ggml repo (#1924)
2023-06-19 Howard Sucmake : fix build shared ggml when CUDA is enabled...
2023-06-19 Johannes GäßlerConvert vector to f16 for dequantize mul mat vec (...
2023-06-18 Johannes GäßlerAdded tokens per second to info prints (#1928)
2023-06-18 Johannes GäßlerFixed incorrectly applying RMS norm twice (#1925)
2023-06-18 l3utterflyggml : fix bug in ggml_compute_forward_add_q_f32 (...
2023-06-18 Mikereadme : update Android build instructions (#1922)
2023-06-18 Kawrakowllama : prevent usage of k-quants when tensor size...
2023-06-18 Kawrakowexamples : fix examples/metal (#1920)
2023-06-18 Georgi Gerganovmetal : handle buffers larger than device's maxBufferLe...
2023-06-18 Howard Sucmake : add CUDA_ARCHITECTURES to new target ggml_stati...
2023-06-17 Georgi Gerganovmake : do not print help for simple example
2023-06-17 Georgi Gerganovminor : warning fixes
2023-06-17 Johannes GäßlerOnly one CUDA stream per device for async compute ...
2023-06-17 Georgi Gerganovllama : fix kv_cache `n` init (close #1903)
2023-06-17 DaniAndTheWebmake : update for latest Arch (#1701)
2023-06-17 Howard Suggml : fix warnings under MSVC (#1908)
2023-06-17 Aaron Millermetal : add norm, cpy f16->f16, alibi kernels (#1823)
2023-06-17 Faez Shakilexposed modules so that they can be invoked by nix...
2023-06-17 Randall FitzgeraldServer Example Refactor and Improvements (#1570)
2023-06-17 Jiří Podivínhooks : setting up flake8 and pre-commit hooks (#1681)
2023-06-17 Gustavo Rocha... readme : alternative way to build for Android with...
2023-06-17 KerfuffleAllow cmake to build ggml as a library (#1896)
2023-06-17 David Yangtrain : get raw text instead of page with html (#1905)
2023-06-16 0cc4mopencl : support k-quants (#1836)
2023-06-16 SuperUserNameManexamples : add "simple" (#1840)
2023-06-16 Zenixcmake : add auto detection of BLAS_INCLUDE_DIRS (#1886)
2023-06-16 Johannes Gäßlerllama : fix embd when offloading non-repeating layers...
2023-06-16 FrankHBFixed possible macro redefinition (#1892)
2023-06-16 Borislav Stanimirovbuild : fix and ignore MSVC warnings (#1889)
2023-06-16 KawrakowCUDA : faster k-quant dot kernels (#1862)
2023-06-16 Borislav Stanimirovgitignore : add several entries specific to Visual...
2023-06-15 Johannes GäßlerFixed CUDA runtime version check (#1879)
2023-06-15 Georgi Gerganovcmake : remove whitespaces
2023-06-15 yangli2examples : add chat-vicuna.sh (#1854)
2023-06-15 Igor Okulistcmake : set include path for OpenBlas (#1830)
2023-06-15 Frederik Vogelswift : Package compile breaks due to ggml-metal.metal...
2023-06-15 daboe01make : add train-text-from-scratch (#1850)
2023-06-15 Srinivas Billareadme : server compile flag (#1874)
2023-06-15 sandyiscoolmake : clean *.so files (#1857)
2023-06-15 Howard SuFix the validation of main device (#1872)
2023-06-15 Georgi Gerganovmetal : parallel command buffer encoding (#1860)
2023-06-15 Johannes GäßlerBetter error when using both LoRA + GPU layers (#1861)
2023-06-14 Johannes GäßlerCUDA full GPU acceleration, KV cache in VRAM (#1827)
2023-06-13 0xspringtimebaby-llama : fix operator!= (#1821)
2023-06-13 xaedestrain : improved training-from-scratch example (#1652)
2023-06-13 Georgi Gerganovllama : do a warm-up eval at start for better timings...
2023-06-13 KerfuffleAllow "quantizing" to f16 and f32 (#1787)
2023-06-12 KawrakowMetal implementation for all k_quants (#1807)
2023-06-12 slarenci : run when changing only the CUDA sources (#1800)
2023-06-12 Howard SuLeverage mmap for offloading tensors to GPU (#1597)
2023-06-12 Kawrakowmetal : fix failure to load model (#1817)
2023-06-11 KerfuffleFix issue where interactive mode crashes when input...
2023-06-11 Kyle LiangFixed WSL cuda's OOM error (#1594)
2023-06-11 Ryan LandayUpdate SHA256SUMS with current hashes for models quanti...
2023-06-10 Georgi Gerganovcmake : fix Metal build (close #1791)
2023-06-10 Artyom Lebedevk-quants : GCC12 compilation fix (#1792)
2023-06-10 Andreimetal : fix issue with ggml-metal.metal path. Closes...
2023-06-10 Aisukodoc : fix wrong address of BLIS.md (#1772)
2023-06-10 Georgi Gerganovggml : force no_alloc == false when creating opt tensor...
2023-06-10 Kawrakowmetal : add Q4_1 implementation (#1785)
2023-06-10 Kerfufflellama : support requantizing models instead of only...
2023-06-10 Xingchen Song... ggml : workaround for missing _mm256_setr_m128i in...
2023-06-10 rankaiyxmake : add SSSE3 compilation use case (#1659)
2023-06-09 Robert Sung... OpenCL: Add release memory (#1741)
2023-06-09 Johannes GäßlerWindows nvcc workaround (#1753)
2023-06-09 Georgi Gerganovmetal : fix build "tanhf" -> "tanh"
2023-06-09 ATmetal : add GELU implementation (#1770)
2023-06-09 Kawrakowmetal : faster q4_0 (#1775)
2023-06-08 Kawrakowmetal : add Q2_K implementation (#1762)
2023-06-08 Georgi GerganovRevert "ggml : load data into int8x16x4_t using vld4q_s...
2023-06-08 le.changggml : load data into int8x16x4_t using vld4q_s8 on...
2023-06-08 Kawrakowmetal : Q6_K implementation (#1752)
2023-06-08 qingfengfengaAdd llama.cpp docker support for non-latin languages...
2023-06-08 Steven Rousseyggml : fix fprintf warnings (#1720)
2023-06-08 Georgi Gerganovclang-tidy : restore dot file from accidental deletion
2023-06-08 Kawrakowmetal : add Q4_K implementation (#1733)
2023-06-08 johnson442k-quants : add missing compile definition to CMakeLists...
2023-06-07 Georgi Gerganovk-quants : allow to optionally disable at compile time...
next