]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2023-06-28 Johannes GäßlerCUDA GPU acceleration for LoRAs + f16 models (#1970)
2023-06-28 ningshanwutuobangllama : support input embeddings directly (#1910)
2023-06-27 Erik Scholzfix pthreads setaffinity usage on android (#2020)
2023-06-27 Howard Subaby-llama : fix build after ggml_rope change (#2016)
2023-06-26 Georgi Gerganovllama : fix rope usage after ChatGLM change
2023-06-26 Georgi Gerganovggml : add support for ChatGLM RoPE
2023-06-26 Roman Parykinreadme : add Scala 3 bindings repo (#2010)
2023-06-26 David Yangggml : increase max tensor name + clean up compiler...
2023-06-26 Gustavo Rocha... readme : LD_LIBRARY_PATH complement for some Android...
2023-06-26 Georgi Gerganovggml : avoid conv 2d kernel round up
2023-06-26 zrmggml : add NUMA support (#1556)
2023-06-26 Georgi Gerganovk-quants : fix indentation
2023-06-26 katsu560tests : fix quantize perf (#1990)
2023-06-26 katsu560k-quants : add AVX support to dot functions (#1916)
2023-06-26 Georgi Gerganovreadme : add link to new k-quants for visibility
2023-06-26 Kawrakowk-quants : support for super-block size of 64 (#2001)
2023-06-26 Howard SuFix assert when free invalid cuda pointer (#2005)
2023-06-25 Georgi Gerganovreadme : add new roadmap + manifesto
2023-06-25 Georgi Gerganovggml : sync latest ggml (custom operators)
2023-06-25 anon998fix server sampling: top k sampler first (#1977)
2023-06-25 Georgi Gerganovreadme : add Azure CI discussion link
2023-06-25 sjinzhzig : upgrade build system support (#1981)
2023-06-24 Robyn#1869 Fix null reference errors when training from...
2023-06-24 Georgi Gerganovtests : sync test-grad0 from ggml
2023-06-24 Rowan Hartflake : fix ggml-metal.metal path and run nixfmt (...
2023-06-24 AN Longconvert : fix invalid params in write_vocab_only (...
2023-06-24 slarenggml : improve ggml_graph_dump_dot, add ggml_format_nam...
2023-06-24 Georgi Gerganovreadme : fix whitespaces
2023-06-24 Albertoreadme : fixed termux instructions (#1973)
2023-06-24 Alex Rendallama : fix top-p sampling to match the canonical defin...
2023-06-24 Didzis Goskollama : make model stateless and context stateful ...
2023-06-23 eieryAdd OpenLLaMA instructions to the README (#1954)
2023-06-22 Erik Scholzrework convert.py to read hyper-parameters from config...
2023-06-21 Johannes Gäßlercmake: revert CUDA arch default to 52, 61 if f16 (...
2023-06-21 Rahul Vivek... Fix typo in README.md (#1961)
2023-06-20 Georgi Gerganovreadme : add link to p1
2023-06-20 Xiake SunFix typo (#1949)
2023-06-20 Ettore Di Giacintollama : fix params struct slignment (#1936)
2023-06-19 Henri Vasserman[Fix] Reenable server embedding endpoint (#1937)
2023-06-19 Georgi Gerganovggml : fix bug in LBFGS optimizer (found by ggml tests)
2023-06-19 l3utterflyllama : use aligned memory during ggml_init call from...
2023-06-19 Georgi Gerganovcmake : fix trailing whitespaces
2023-06-19 Kawrakowllama : only use Q6_K for output weights if tensor...
2023-06-19 Kawrakowcuda : faster k-quants on older GPUs (#1930)
2023-06-19 Georgi Gerganovggml : sync latest ggml repo (#1924)
2023-06-19 Howard Sucmake : fix build shared ggml when CUDA is enabled...
2023-06-19 Johannes GäßlerConvert vector to f16 for dequantize mul mat vec (...
2023-06-18 Johannes GäßlerAdded tokens per second to info prints (#1928)
2023-06-18 Johannes GäßlerFixed incorrectly applying RMS norm twice (#1925)
2023-06-18 l3utterflyggml : fix bug in ggml_compute_forward_add_q_f32 (...
2023-06-18 Mikereadme : update Android build instructions (#1922)
2023-06-18 Kawrakowllama : prevent usage of k-quants when tensor size...
2023-06-18 Kawrakowexamples : fix examples/metal (#1920)
2023-06-18 Georgi Gerganovmetal : handle buffers larger than device's maxBufferLe...
2023-06-18 Howard Sucmake : add CUDA_ARCHITECTURES to new target ggml_stati...
2023-06-17 Georgi Gerganovmake : do not print help for simple example
2023-06-17 Georgi Gerganovminor : warning fixes
2023-06-17 Johannes GäßlerOnly one CUDA stream per device for async compute ...
2023-06-17 Georgi Gerganovllama : fix kv_cache `n` init (close #1903)
2023-06-17 DaniAndTheWebmake : update for latest Arch (#1701)
2023-06-17 Howard Suggml : fix warnings under MSVC (#1908)
2023-06-17 Aaron Millermetal : add norm, cpy f16->f16, alibi kernels (#1823)
2023-06-17 Faez Shakilexposed modules so that they can be invoked by nix...
2023-06-17 Randall FitzgeraldServer Example Refactor and Improvements (#1570)
2023-06-17 Jiří Podivínhooks : setting up flake8 and pre-commit hooks (#1681)
2023-06-17 Gustavo Rocha... readme : alternative way to build for Android with...
2023-06-17 KerfuffleAllow cmake to build ggml as a library (#1896)
2023-06-17 David Yangtrain : get raw text instead of page with html (#1905)
2023-06-16 0cc4mopencl : support k-quants (#1836)
2023-06-16 SuperUserNameManexamples : add "simple" (#1840)
2023-06-16 Zenixcmake : add auto detection of BLAS_INCLUDE_DIRS (#1886)
2023-06-16 Johannes Gäßlerllama : fix embd when offloading non-repeating layers...
2023-06-16 FrankHBFixed possible macro redefinition (#1892)
2023-06-16 Borislav Stanimirovbuild : fix and ignore MSVC warnings (#1889)
2023-06-16 KawrakowCUDA : faster k-quant dot kernels (#1862)
2023-06-16 Borislav Stanimirovgitignore : add several entries specific to Visual...
2023-06-15 Johannes GäßlerFixed CUDA runtime version check (#1879)
2023-06-15 Georgi Gerganovcmake : remove whitespaces
2023-06-15 yangli2examples : add chat-vicuna.sh (#1854)
2023-06-15 Igor Okulistcmake : set include path for OpenBlas (#1830)
2023-06-15 Frederik Vogelswift : Package compile breaks due to ggml-metal.metal...
2023-06-15 daboe01make : add train-text-from-scratch (#1850)
2023-06-15 Srinivas Billareadme : server compile flag (#1874)
2023-06-15 sandyiscoolmake : clean *.so files (#1857)
2023-06-15 Howard SuFix the validation of main device (#1872)
2023-06-15 Georgi Gerganovmetal : parallel command buffer encoding (#1860)
2023-06-15 Johannes GäßlerBetter error when using both LoRA + GPU layers (#1861)
2023-06-14 Johannes GäßlerCUDA full GPU acceleration, KV cache in VRAM (#1827)
2023-06-13 0xspringtimebaby-llama : fix operator!= (#1821)
2023-06-13 xaedestrain : improved training-from-scratch example (#1652)
2023-06-13 Georgi Gerganovllama : do a warm-up eval at start for better timings...
2023-06-13 KerfuffleAllow "quantizing" to f16 and f32 (#1787)
2023-06-12 KawrakowMetal implementation for all k_quants (#1807)
2023-06-12 slarenci : run when changing only the CUDA sources (#1800)
2023-06-12 Howard SuLeverage mmap for offloading tensors to GPU (#1597)
2023-06-12 Kawrakowmetal : fix failure to load model (#1817)
2023-06-11 KerfuffleFix issue where interactive mode crashes when input...
2023-06-11 Kyle LiangFixed WSL cuda's OOM error (#1594)
2023-06-11 Ryan LandayUpdate SHA256SUMS with current hashes for models quanti...
2023-06-10 Georgi Gerganovcmake : fix Metal build (close #1791)
next