]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2023-06-19 Georgi Gerganovcmake : fix trailing whitespaces
2023-06-19 Kawrakowllama : only use Q6_K for output weights if tensor...
2023-06-19 Kawrakowcuda : faster k-quants on older GPUs (#1930)
2023-06-19 Georgi Gerganovggml : sync latest ggml repo (#1924)
2023-06-19 Howard Sucmake : fix build shared ggml when CUDA is enabled...
2023-06-19 Johannes GäßlerConvert vector to f16 for dequantize mul mat vec (...
2023-06-18 Johannes GäßlerAdded tokens per second to info prints (#1928)
2023-06-18 Johannes GäßlerFixed incorrectly applying RMS norm twice (#1925)
2023-06-18 l3utterflyggml : fix bug in ggml_compute_forward_add_q_f32 (...
2023-06-18 Mikereadme : update Android build instructions (#1922)
2023-06-18 Kawrakowllama : prevent usage of k-quants when tensor size...
2023-06-18 Kawrakowexamples : fix examples/metal (#1920)
2023-06-18 Georgi Gerganovmetal : handle buffers larger than device's maxBufferLe...
2023-06-18 Howard Sucmake : add CUDA_ARCHITECTURES to new target ggml_stati...
2023-06-17 Georgi Gerganovmake : do not print help for simple example
2023-06-17 Georgi Gerganovminor : warning fixes
2023-06-17 Johannes GäßlerOnly one CUDA stream per device for async compute ...
2023-06-17 Georgi Gerganovllama : fix kv_cache `n` init (close #1903)
2023-06-17 DaniAndTheWebmake : update for latest Arch (#1701)
2023-06-17 Howard Suggml : fix warnings under MSVC (#1908)
2023-06-17 Aaron Millermetal : add norm, cpy f16->f16, alibi kernels (#1823)
2023-06-17 Faez Shakilexposed modules so that they can be invoked by nix...
2023-06-17 Randall FitzgeraldServer Example Refactor and Improvements (#1570)
2023-06-17 Jiří Podivínhooks : setting up flake8 and pre-commit hooks (#1681)
2023-06-17 Gustavo Rocha... readme : alternative way to build for Android with...
2023-06-17 KerfuffleAllow cmake to build ggml as a library (#1896)
2023-06-17 David Yangtrain : get raw text instead of page with html (#1905)
2023-06-16 0cc4mopencl : support k-quants (#1836)
2023-06-16 SuperUserNameManexamples : add "simple" (#1840)
2023-06-16 Zenixcmake : add auto detection of BLAS_INCLUDE_DIRS (#1886)
2023-06-16 Johannes Gäßlerllama : fix embd when offloading non-repeating layers...
2023-06-16 FrankHBFixed possible macro redefinition (#1892)
2023-06-16 Borislav Stanimirovbuild : fix and ignore MSVC warnings (#1889)
2023-06-16 KawrakowCUDA : faster k-quant dot kernels (#1862)
2023-06-16 Borislav Stanimirovgitignore : add several entries specific to Visual...
2023-06-15 Johannes GäßlerFixed CUDA runtime version check (#1879)
2023-06-15 Georgi Gerganovcmake : remove whitespaces
2023-06-15 yangli2examples : add chat-vicuna.sh (#1854)
2023-06-15 Igor Okulistcmake : set include path for OpenBlas (#1830)
2023-06-15 Frederik Vogelswift : Package compile breaks due to ggml-metal.metal...
2023-06-15 daboe01make : add train-text-from-scratch (#1850)
2023-06-15 Srinivas Billareadme : server compile flag (#1874)
2023-06-15 sandyiscoolmake : clean *.so files (#1857)
2023-06-15 Howard SuFix the validation of main device (#1872)
2023-06-15 Georgi Gerganovmetal : parallel command buffer encoding (#1860)
2023-06-15 Johannes GäßlerBetter error when using both LoRA + GPU layers (#1861)
2023-06-14 Johannes GäßlerCUDA full GPU acceleration, KV cache in VRAM (#1827)
2023-06-13 0xspringtimebaby-llama : fix operator!= (#1821)
2023-06-13 xaedestrain : improved training-from-scratch example (#1652)
2023-06-13 Georgi Gerganovllama : do a warm-up eval at start for better timings...
2023-06-13 KerfuffleAllow "quantizing" to f16 and f32 (#1787)
2023-06-12 KawrakowMetal implementation for all k_quants (#1807)
2023-06-12 slarenci : run when changing only the CUDA sources (#1800)
2023-06-12 Howard SuLeverage mmap for offloading tensors to GPU (#1597)
2023-06-12 Kawrakowmetal : fix failure to load model (#1817)
2023-06-11 KerfuffleFix issue where interactive mode crashes when input...
2023-06-11 Kyle LiangFixed WSL cuda's OOM error (#1594)
2023-06-11 Ryan LandayUpdate SHA256SUMS with current hashes for models quanti...
2023-06-10 Georgi Gerganovcmake : fix Metal build (close #1791)
2023-06-10 Artyom Lebedevk-quants : GCC12 compilation fix (#1792)
2023-06-10 Andreimetal : fix issue with ggml-metal.metal path. Closes...
2023-06-10 Aisukodoc : fix wrong address of BLIS.md (#1772)
2023-06-10 Georgi Gerganovggml : force no_alloc == false when creating opt tensor...
2023-06-10 Kawrakowmetal : add Q4_1 implementation (#1785)
2023-06-10 Kerfufflellama : support requantizing models instead of only...
2023-06-10 Xingchen Song... ggml : workaround for missing _mm256_setr_m128i in...
2023-06-10 rankaiyxmake : add SSSE3 compilation use case (#1659)
2023-06-09 Robert Sung... OpenCL: Add release memory (#1741)
2023-06-09 Johannes GäßlerWindows nvcc workaround (#1753)
2023-06-09 Georgi Gerganovmetal : fix build "tanhf" -> "tanh"
2023-06-09 ATmetal : add GELU implementation (#1770)
2023-06-09 Kawrakowmetal : faster q4_0 (#1775)
2023-06-08 Kawrakowmetal : add Q2_K implementation (#1762)
2023-06-08 Georgi GerganovRevert "ggml : load data into int8x16x4_t using vld4q_s...
2023-06-08 le.changggml : load data into int8x16x4_t using vld4q_s8 on...
2023-06-08 Kawrakowmetal : Q6_K implementation (#1752)
2023-06-08 qingfengfengaAdd llama.cpp docker support for non-latin languages...
2023-06-08 Steven Rousseyggml : fix fprintf warnings (#1720)
2023-06-08 Georgi Gerganovclang-tidy : restore dot file from accidental deletion
2023-06-08 Kawrakowmetal : add Q4_K implementation (#1733)
2023-06-08 johnson442k-quants : add missing compile definition to CMakeLists...
2023-06-07 Georgi Gerganovk-quants : allow to optionally disable at compile time...
2023-06-07 jacobi petruccianiflake : update to support metal on m1/m2 (#1724)
2023-06-07 Georgi Gerganovreadme : add June roadmap
2023-06-07 Willy Tarreaumain: add the possibility to open the prompt cache...
2023-06-06 Georgi Gerganovllama : fix vram_scratch var
2023-06-06 Georgi Gerganovllama : fix compile warnings
2023-06-06 Johannes GäßlerMulti GPU support, CUDA refactor, CUDA scratch buffer...
2023-06-06 Georgi Gerganovmetal : add f16 support
2023-06-06 LostRuinsClblast fixes + enhancements to save VRAM and offload...
2023-06-06 Georgi Gerganovggml : fix builds, add ggml-quants-k.o (close #1712...
2023-06-06 Georgi Gerganovgitignore : add .clang-tidy
2023-06-06 Georgi Gerganovllama : temporary disable Q6_K output quantization...
2023-06-06 Spencer Suttonmetal : add checks for buffer size (#1706)
2023-06-05 Yuval Peleddocs : add performance troubleshoot + example benchmark...
2023-06-05 Foul-Tarnishedreadme : fix typo (#1700)
2023-06-05 mgroeber9110llama : consistently catch and throw only exceptions...
2023-06-05 kiltyjmetal : use shared buffers between CPU and GPU (#1696)
2023-06-05 grahamethggml : fix internal overflow in ggml_time_us on Windows...
2023-06-05 Georgi Gerganovci : disable auto tidy (#1705)
next