]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-02-20 Charles Xuggml-cpu: Add CPU backend support for KleidiAI library...
2025-02-20 Prashant Vithuleggml: aarch64: implement SVE kernels for q3_K_q8_K...
2025-02-20 Michael Engelrun : add --chat-template-file (#11961)
2025-02-19 Johannes Gäßlerdoc: add links to ggml examples [no ci] (#11958)
2025-02-19 Daniel Beveniuscommon : add llama.vim preset for Qwen2.5 Coder (#11945)
2025-02-19 Georgi Gerganovspeculative : update default params (#11954)
2025-02-19 Daniel Beveniusllama : fix indentation in llama-grammar [no ci] (...
2025-02-18 igardevserver : (webui) Enable communication with parent html...
2025-02-18 Olivier Chafiktool-call: refactor common chat / tool-call api (+...
2025-02-18 Xuan-Son Nguyenserver : add TEI API format for /rerank endpoint (...
2025-02-18 MoonRide303scripts: corrected encoding when getting chat template...
2025-02-18 xiaobing318docs : Fix duplicated file extension in test command...
2025-02-17 Johannes GäßlerCUDA: use async data loading for FlashAttention (#11894)
2025-02-17 Eveupdate release requirements (#11897)
2025-02-17 Antoine Viallonserver : fix divide-by-zero in metrics reporting (...
2025-02-17 Rémy Ovulkan: implement several ops relevant for ggml_opt...
2025-02-16 Xuan-Son Nguyenserver : bump httplib to 0.19.0 (#11908)
2025-02-16 standby24x7common : Fix a typo in help (#11899)
2025-02-16 Xuan-Son Nguyenci : fix (again) arm64 build fails (#11895)
2025-02-16 Jeff Bolzvulkan: support multi/vision rope, and noncontiguous...
2025-02-16 Hale Chanmetal : fix the crash caused by the lack of residency...
2025-02-15 Johannes Gäßlerscripts: fix compare-llama-bench commit hash logic...
2025-02-15 708-145examples: fix typo in imatrix/README.md (#11884)
2025-02-15 Adrian Kretzmetal : optimize dequant q6_K kernel (#11892)
2025-02-15 Georgi Gerganovreadme : add notice about new package registry (#11890)
2025-02-15 Georgi Gerganovrepo : update links to new url (#11886)
2025-02-15 Olivier Chafikserver: fix type promotion typo causing crashes w/...
2025-02-15 Rémy Ovulkan: initial support for IQ1_S and IQ1_M quantizatio...
2025-02-14 Michał Moskalllguidance build fixes for Windows (#11664) upstream/0.0.4719
2025-02-14 lhezopencl: Fix rope and softmax (#11833)
2025-02-14 Diego Devesacuda : add ampere to the list of default architectures...
2025-02-14 Georgi Gerganovdocker : drop to CUDA 12.4 (#11869)
2025-02-14 Daniel Beveniusllama : add completion for --chat-template-file (#11860)
2025-02-14 Jinyang Heggml: optimize some vec dot functions for LoongArch...
2025-02-14 Evevulkan: linux builds + small subgroup size fixes (...
2025-02-14 theraininskyllama-bench : fix unexpected global variable initialize...
2025-02-13 Georgi Gerganovreadme : minor
2025-02-13 Jeffrey Morganllamafile: use member variable instead of constant...
2025-02-13 Reza Rahemtolaserver : (docs) Update wrong tool calling example ...
2025-02-13 Daniel Beveniusllama : add --completion-bash option (#11846)
2025-02-13 R0CKSTARmusa: bump MUSA SDK version to rc3.1.1 (#11822)
2025-02-13 Olivier Chafik`server`: fix tool-call of DeepSeek R1 Qwen, return...
2025-02-13 Vinesh Janarthanansampling: add Top-nσ sampler (#11223)
2025-02-13 Oleksandr Kuvshynovllama.cpp: fix warning message (#11839)
2025-02-13 Daniel Beveniusllama : update llama_decode_internal ref [no ci] (...
2025-02-13 Diego Devesaggml-cpu : add chunking support to mul_mat_id (#11666)
2025-02-12 Xuan-Son Nguyenggml : x2 speed for WASM by optimizing SIMD (#11453)
2025-02-12 Woof Dogserver : (webui) Give copy button back to all message...
2025-02-12 uvosHIP: Remove GCN from list of devices that avoid MMQ...
2025-02-12 JCFix: Compile failure due to Microsoft STL breaking...
2025-02-12 Georgi Gerganovsync : ggml
2025-02-12 uvosHIP: Switch to std::vector in rocblas version check...
2025-02-12 bandoticleanup: fix compile warnings associated with gnu_print...
2025-02-12 Richardggml : fix multi-threaded clamp_f32 (#11824)
2025-02-12 Weizhao Ouyangggml-cpu: Fix duplicate MATMUL_INT8 (#11817)
2025-02-12 Johannes GäßlerCUDA: fix CUDART_VERSION checks (#11821)
2025-02-12 Daniel Beveniusllama : fix typo in llama-grammar.h [no ci] (#11816)
2025-02-11 lhezdocs: add OpenCL (#11697)
2025-02-11 Sheldon RobinsonFix #11802: Compile bug - RegQueryValueExA changed...
2025-02-11 Daniel Beveniusserver : use common_token_to_piece instead of common_de...
2025-02-10 Johannes GäßlerCUDA: use arch list for compatibility check (#11775)
2025-02-10 Maxim Evtushfix: typos in documentation files (#11791)
2025-02-10 jason_wdocs: utilize the forward slash (/) as the path separat...
2025-02-10 Xuan-Son Nguyenserver : (webui) introduce conversation branching ...
2025-02-10 Wilken Gottwaltllama-mmap: fix missing include (#11796)
2025-02-10 Xuan-Son Nguyenserver : correct signal handler (#11795)
2025-02-10 Olivier Chafiksync: minja (https://github.com/google/minja/commit...
2025-02-10 pascal-lcUpdate README.md [no ci] (#11781)
2025-02-10 Danny Milosavljevicvulkan: Make Vulkan optional at runtime (#11493). ...
2025-02-10 Wagner Brunavulkan: add environment variable GGML_VK_PREFER_HOST_ME...
2025-02-09 Eric CurtinThere's a better way of clearing lines (#11756)
2025-02-09 Jeff Bolzvulkan: account for lookup tables when checking shared...
2025-02-08 Xuan-Son Nguyenserver : (webui) revamp Settings dialog, add Pyodide...
2025-02-08 Woof Dogserver : (webui) increase edit textarea size (#11763)
2025-02-08 Georgi Gerganovserver : minor log updates (#11760)
2025-02-08 Georgi Gerganovcont : fix mmap flag print (#11699)
2025-02-08 Karol Kontnyggml: Fix data race in ggml threadpool (#11736)
2025-02-08 Johannes GäßlerCUDA: fix min. version for movmatrix (#11751)
2025-02-08 Nikolaos Pothitosreadme : update front-end framework (#11753)
2025-02-08 Xuan-Son Nguyenserver : (webui) fix numeric settings being saved as...
2025-02-07 Eric CurtinMake logging more verbose (#11714)
2025-02-07 Georgi Gerganovllama : fix defrag logic (#11707)
2025-02-07 Christian Fillionvocab : ignore invalid UTF-8 input in the BPE tokenizer...
2025-02-07 magicsellama : fix progress dots (#11730)
2025-02-07 Jeff Bolzvulkan: print shared memory size (#11719)
2025-02-07 Christian Fillionllama : add llama_sampler_init for safe usage of llama_...
2025-02-07 Akarshan BiswasSYCL: remove XMX info from print devices (#11712)
2025-02-07 Daniel Beveniuscommon : add default embeddings presets (#11677)
2025-02-07 Jinyang Heggml : optimize and build warning fix for LoongArch...
2025-02-06 tv1wndllama : fix old glm4 models (#11670)
2025-02-06 Georgi Gerganovsync : ggml
2025-02-06 Patrick Pengrpc: fix known RCE in rpc-server (ggml/1103)
2025-02-06 Xuan-Son Nguyenserver : (webui) migrate project to ReactJS with typesc...
2025-02-06 Tei Homedocs: update fedora cuda guide for 12.8 release (#11393)
2025-02-06 Akarshan BiswasSYCL: Adjust support condition for norm operators ...
2025-02-06 Georgi Gerganovllama : add log about loading model tensors (#11699)
2025-02-06 Adrien Gallouëtbuild : fix llama.pc (#11658)
2025-02-06 junchao-zhaoggml : fix LoongArch compile error with 128-bit SIMD...
2025-02-06 Jeff Bolzvulkan: optimize coopmat2 iq2/iq3 callbacks (#11521)
2025-02-06 Rémy Ovulkan: initial support for IQ4_XS quantization (#11501)
next