]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-02-14 Evevulkan: linux builds + small subgroup size fixes (...
2025-02-14 theraininskyllama-bench : fix unexpected global variable initialize...
2025-02-13 Georgi Gerganovreadme : minor
2025-02-13 Jeffrey Morganllamafile: use member variable instead of constant...
2025-02-13 Reza Rahemtolaserver : (docs) Update wrong tool calling example ...
2025-02-13 Daniel Beveniusllama : add --completion-bash option (#11846)
2025-02-13 R0CKSTARmusa: bump MUSA SDK version to rc3.1.1 (#11822)
2025-02-13 Olivier Chafik`server`: fix tool-call of DeepSeek R1 Qwen, return...
2025-02-13 Vinesh Janarthanansampling: add Top-nσ sampler (#11223)
2025-02-13 Oleksandr Kuvshynovllama.cpp: fix warning message (#11839)
2025-02-13 Daniel Beveniusllama : update llama_decode_internal ref [no ci] (...
2025-02-13 Diego Devesaggml-cpu : add chunking support to mul_mat_id (#11666)
2025-02-12 Xuan-Son Nguyenggml : x2 speed for WASM by optimizing SIMD (#11453)
2025-02-12 Woof Dogserver : (webui) Give copy button back to all message...
2025-02-12 uvosHIP: Remove GCN from list of devices that avoid MMQ...
2025-02-12 JCFix: Compile failure due to Microsoft STL breaking...
2025-02-12 Georgi Gerganovsync : ggml
2025-02-12 uvosHIP: Switch to std::vector in rocblas version check...
2025-02-12 bandoticleanup: fix compile warnings associated with gnu_print...
2025-02-12 Richardggml : fix multi-threaded clamp_f32 (#11824)
2025-02-12 Weizhao Ouyangggml-cpu: Fix duplicate MATMUL_INT8 (#11817)
2025-02-12 Johannes GäßlerCUDA: fix CUDART_VERSION checks (#11821)
2025-02-12 Daniel Beveniusllama : fix typo in llama-grammar.h [no ci] (#11816)
2025-02-11 lhezdocs: add OpenCL (#11697)
2025-02-11 Sheldon RobinsonFix #11802: Compile bug - RegQueryValueExA changed...
2025-02-11 Daniel Beveniusserver : use common_token_to_piece instead of common_de...
2025-02-10 Johannes GäßlerCUDA: use arch list for compatibility check (#11775)
2025-02-10 Maxim Evtushfix: typos in documentation files (#11791)
2025-02-10 jason_wdocs: utilize the forward slash (/) as the path separat...
2025-02-10 Xuan-Son Nguyenserver : (webui) introduce conversation branching ...
2025-02-10 Wilken Gottwaltllama-mmap: fix missing include (#11796)
2025-02-10 Xuan-Son Nguyenserver : correct signal handler (#11795)
2025-02-10 Olivier Chafiksync: minja (https://github.com/google/minja/commit...
2025-02-10 pascal-lcUpdate README.md [no ci] (#11781)
2025-02-10 Danny Milosavljevicvulkan: Make Vulkan optional at runtime (#11493). ...
2025-02-10 Wagner Brunavulkan: add environment variable GGML_VK_PREFER_HOST_ME...
2025-02-09 Eric CurtinThere's a better way of clearing lines (#11756)
2025-02-09 Jeff Bolzvulkan: account for lookup tables when checking shared...
2025-02-08 Xuan-Son Nguyenserver : (webui) revamp Settings dialog, add Pyodide...
2025-02-08 Woof Dogserver : (webui) increase edit textarea size (#11763)
2025-02-08 Georgi Gerganovserver : minor log updates (#11760)
2025-02-08 Georgi Gerganovcont : fix mmap flag print (#11699)
2025-02-08 Karol Kontnyggml: Fix data race in ggml threadpool (#11736)
2025-02-08 Johannes GäßlerCUDA: fix min. version for movmatrix (#11751)
2025-02-08 Nikolaos Pothitosreadme : update front-end framework (#11753)
2025-02-08 Xuan-Son Nguyenserver : (webui) fix numeric settings being saved as...
2025-02-07 Eric CurtinMake logging more verbose (#11714)
2025-02-07 Georgi Gerganovllama : fix defrag logic (#11707)
2025-02-07 Christian Fillionvocab : ignore invalid UTF-8 input in the BPE tokenizer...
2025-02-07 magicsellama : fix progress dots (#11730)
2025-02-07 Jeff Bolzvulkan: print shared memory size (#11719)
2025-02-07 Christian Fillionllama : add llama_sampler_init for safe usage of llama_...
2025-02-07 Akarshan BiswasSYCL: remove XMX info from print devices (#11712)
2025-02-07 Daniel Beveniuscommon : add default embeddings presets (#11677)
2025-02-07 Jinyang Heggml : optimize and build warning fix for LoongArch...
2025-02-06 tv1wndllama : fix old glm4 models (#11670)
2025-02-06 Georgi Gerganovsync : ggml
2025-02-06 Patrick Pengrpc: fix known RCE in rpc-server (ggml/1103)
2025-02-06 Xuan-Son Nguyenserver : (webui) migrate project to ReactJS with typesc...
2025-02-06 Tei Homedocs: update fedora cuda guide for 12.8 release (#11393)
2025-02-06 Akarshan BiswasSYCL: Adjust support condition for norm operators ...
2025-02-06 Georgi Gerganovllama : add log about loading model tensors (#11699)
2025-02-06 Adrien Gallouëtbuild : fix llama.pc (#11658)
2025-02-06 junchao-zhaoggml : fix LoongArch compile error with 128-bit SIMD...
2025-02-06 Jeff Bolzvulkan: optimize coopmat2 iq2/iq3 callbacks (#11521)
2025-02-06 Rémy Ovulkan: initial support for IQ4_XS quantization (#11501)
2025-02-06 Jeff Bolzvulkan: use smaller combined allocations to avoid fragm...
2025-02-06 Charles Duffymetal : avoid breaking build when metal API predates...
2025-02-06 Matvey Solovievreadme : add link to Autopen under UIs (#11684)
2025-02-05 Georgi Gerganovmetal : adjust support conditions for norm operators...
2025-02-05 Johannes GäßlerCUDA: support for mat. mul. with ne03 != ne13 (#11656)
2025-02-05 SAMIllava: add quantization for the visual projector LLAVA...
2025-02-05 Olivier Chafik`sync`: minja (#11641)
2025-02-04 Johannes GäßlerCUDA: non-contiguous (RMS) norm support (#11659)
2025-02-04 fxzjshmHIP: force max threads per block to be 1024 (#11621)
2025-02-04 Xuan-Son Nguyenserver : add try..catch to places not covered by set_ex...
2025-02-04 Radoslav Gerganovarg : list RPC devices first when using --list-devices...
2025-02-04 Olivier Chafik`tool-call`: command r7b fix for normal responses ...
2025-02-04 Shelby Jenkinsreadme : add llm_client Rust crate to readme bindings...
2025-02-04 Jhen-Jie Hongswift : fix llama-vocab api usage (#11645)
2025-02-04 Jhen-Jie Hongmetal : use residency set for other platforms (#11648)
2025-02-04 Georgi Gerganovauthors : update
2025-02-04 Georgi Gerganovsync : ggml upstream/0.0.4631
2025-02-04 Christian Kastnercmake: Add ability to pass in GGML_BUILD_NUMBER (ggml...
2025-02-04 Georgi Gerganovci : do not stale-close roadmap issues
2025-02-03 Olivier Chafik`tool-call`: allow `--chat-template chatml` w/ `--jinja...
2025-02-03 Xuan-Son Nguyenserver : (webui) revert hacky solution from #11626...
2025-02-03 Woof Dogserver : (webui) allow typing and submitting during...
2025-02-03 Daniel Beveniusserver : remove CPPHTTPLIB_NO_EXCEPTIONS define (#11622)
2025-02-03 Georgi Gerganovsync : ggml
2025-02-03 Johannes GäßlerCUDA: fix Volta FlashAttention logic (#11615)
2025-02-03 mashdragonserver : (webui) Fix Shift+Enter handling (#11609)
2025-02-02 Johannes GäßlerHIP: fix flash_attn_stream_k_fixup warning (#11604)
2025-02-02 uvosCUDA/HIP: add support for selectable warp size to mmv...
2025-02-02 uvosHIP: add GGML_CUDA_CC_IS_* for amd familys as increasin...
2025-02-02 Olivier Chafiknit: more informative crash when grammar sampler fails...
2025-02-02 Johannes GäßlerCUDA: use mma PTX instructions for FlashAttention ...
2025-02-02 Eric CurtinName colors (#11573)
2025-02-02 Olivier Chafik`tool-call`: support Command R7B (+ return tool_plan...
2025-02-02 Olivier ChafikFix exotic ci env that lacks ostringstream::str (#11581)
next