]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2026-03-29 Gaurav GargOptimize MOE GEMV kernel for BS > 1. (#20905)
2026-03-29 Max Krasnyanskyhexagon: dma optimizations (mostly fixing regressions...
2026-03-29 Davi Henrique... devops: including compute-runtime for intel.Dockerfile...
2026-03-29 Neo Zhang[SYCL] Enhance build script to use half cores to build...
2026-03-28 Sigbjørn Skjæretfix **/x glob matching (#21129)
2026-03-28 Piotr Wilkin... common/parser: fix handling of tool definition with...
2026-03-28 Sigbjørn Skjæretcommon : add character class support to glob_match...
2026-03-28 BlueMöhreWebUI: Replace illegal nested button elements (#21026)
2026-03-28 Adriencommon/json-schema: fix: handle non-capturing groups...
2026-03-28 Aldehir Rojascommon : add reasoning_format = none support to gpt...
2026-03-28 Georgi Gerganovserver : fix processing of multiple back-to-back mtmd...
2026-03-28 Adrien Gallouëtci : gracefully shut down the server (#21110)
2026-03-28 Woof DogDocument custom default webui preferences in server...
2026-03-28 Aleksander... webui: Conversation forking + branching improvements...
2026-03-28 Adrien Gallouëtvendor : update cpp-httplib to 0.40.0 (#21100)
2026-03-28 Ruben Ortlamvulkan: add noncontiguous GLU support (#21081)
2026-03-28 Piotr Wilkin... common/parser: fix reasoning whitespace bugs + extra...
2026-03-28 Sigbjørn Skjæretcli : add /glob command (#21084)
2026-03-28 Ts-sounddocker : fix and enable ARM64 image build (#20929)
2026-03-28 Adrien Gallouëtserver : add custom socket options to disable SO_REUSEP...
2026-03-27 Aldehir Rojascommon : inhibit lazy grammar sampler while reasoning...
2026-03-27 Kusha Gharahiserver: Introduce LLAMA_BUILD_WEBUI build flag to allow...
2026-03-27 Yiwei Shaohexagon: support for IQ4_NL and MXFP4 (#21018)
2026-03-27 Aleksander... webui: Improve Chat Messages initial scroll + auto...
2026-03-27 AN Longserver: remove the verbose_prompt parameter (#21059)
2026-03-27 Xuan-Son Nguyenmtmd: add more sanity checks (#21047)
2026-03-27 Xuan-Son Nguyenserver: add built-in tools backend support (#20898)
2026-03-27 Radoslav Gerganovrpc : proper handling of data pointers to CPU buffers...
2026-03-27 mtmcpcompletion : session_tokens insert range in completion...
2026-03-27 mtmcpcompletion : Fix segfault on model load failure (#21049)
2026-03-27 PascalSend reasoning content back to the model across turns...
2026-03-27 renmetal : Fix dimension constraint violation in matmul2d...
2026-03-27 KokerZhouCANN: update docker images to 8.5.0 and improve CANN...
2026-03-26 Saba Fallahmtmd: fix "v.patch_embd" quant and unsupported im2col...
2026-03-26 uvoship: use fnuz fp8 for conversion on CDNA3 (#21040)
2026-03-26 Xuan-Son Nguyenci: pin external actions to exact commit SHA (#21033)
2026-03-26 Adrien Gallouëtcommon : add getpwuid fallback for HF cache when HOME...
2026-03-26 Xuan-Son Nguyenmtmd: refactor image preprocessing (#21031)
2026-03-26 lhezopencl: allow large buffer for adreno (#20997)
2026-03-26 Michael Wandconvert : support Qwen3.5/Qwen3.5 Moe NVFP4 and add...
2026-03-26 Pavel Zloiconvert : add RuGPT3XL (RuGPT3XLForCausalLM) support...
2026-03-26 Adrien Gallouëtcommon : filter out imatrix when finding models (#21023)
2026-03-26 ihb2032fix(ggml): correct RISC-V ISA string canonical ordering...
2026-03-26 Adrien Gallouëtcommon : make LLAMA_CACHE the one cache for everything...
2026-03-26 Adrien Gallouëtcommon : fix split model migration (#21019)
2026-03-26 Michael Wandggml-cuda: Add NVFP4 dp4a kernel (#20644)
2026-03-26 SamareshSinghimatrix : fix crash when using --show-statistics with...
2026-03-26 Yihao WangCUDA & CPU: support F32 kernel type for `CONV_TRANSPOSE...
2026-03-25 Adrien Gallouëtcommon : do not delete old files from the old cache...
2026-03-25 Saba Fallahmtmd: Add DeepSeekOCR Support (#17400)
2026-03-25 Adrien Gallouëtcommon : fix verbosity setup (#20989)
2026-03-25 Adrien Gallouëtcommon : fix gguf selection in common_list_cached_model...
2026-03-25 uvosci : fix parsing of vgpr counts in hip-quality-check...
2026-03-25 Saba Fallahmodel: codefuse-ai/F2LLM-v2 support
2026-03-25 Dowonmodel : allow causal_attn and pooling_type on all archi...
2026-03-25 Aparna M Psnapdragon: add missing features to WoS scripts to...
2026-03-25 Shreya JainUse docker in build-android.yml (#20928)
2026-03-25 Aman Guptallama-bench: print `-n-cpu-moe` when offloaded layers...
2026-03-25 Masato Nakasakaci: Allow ninja to be used during unit test (#20742)
2026-03-25 Georgi Gerganovci : disable self-hosted mac jobs (#20985)
2026-03-25 Xuan-Son Nguyenjinja: fix macro with kwargs (#20960)
2026-03-25 Francisco Herreragguf-split : clarify operation of gguf-split (#19749)
2026-03-25 Johannes Gäßlerllama: fix llama-model-saver (#20503)
2026-03-25 Aleksander... webui: Fix editing assistant message without branching...
2026-03-25 PascalAdd SLEEPING status to the WebUI model selector (#20949)
2026-03-25 yikechayedanandroid : fix-pointer-dangling (#20974)
2026-03-25 Neo Zhangsycl : fix wrong variable check by assert (#20903)
2026-03-25 Sigbjørn Skjæretci : bump gguf publish python version (#20982)
2026-03-25 Sigbjørn Skjæretci : limit requirements versions (#20980)
2026-03-25 Dowonconvert : register Qwen3Model architecture (#20967)
2026-03-25 Ravi Panchumarthydocs : Update OpenVINO backend docs (#20968)
2026-03-24 Georgi Gerganovmodels : move the token embedding norms to the first...
2026-03-24 Aman Guptaggml-backend: re-enable graph reuse with pipeline paral...
2026-03-24 Alessandro... vendor : update cpp-httplib to 0.39.0 (#20933)
2026-03-24 Adrien Gallouëtcommon : fix get_gguf_split_info (#20946)
2026-03-24 BlueMöhreWebUI: fix edit msg form textarea height (#20830)
2026-03-24 Adrien Gallouëtreadme : clarify MODEL_ENDPOINT usage (#20941)
2026-03-24 Adrien Gallouëtcommon : add a WARNING for HF cache migration (#20935)
2026-03-24 nurimetal : add FLOOR, CEIL, ROUND, TRUNC unary ops (#20930)
2026-03-24 Georgi Gerganovmetal : add FA instantiations for HSK=512, HSV=512...
2026-03-24 Aaron Teoissues: add openvino backends (#20932)
2026-03-24 Adrien Gallouëtcommon : add standard Hugging Face cache support (...
2026-03-24 Aman Guptallama-fit: fix regex pattern for gate_up tensors (...
2026-03-24 Aldehir Rojascommon : replace wrap_for_generation with a prefix...
2026-03-23 Max Krasnyanskyhexagon: general DMA and Binary Op fixes for large...
2026-03-23 Max KrasnyanskyAdd codeowners for scripts/snapdragon and docs/snapdrag...
2026-03-23 lhezopencl: add q6_K gemm and gemv kernels for Adreno ...
2026-03-23 las7rpc : RCE patch (#20908)
2026-03-23 Xuan-Son Nguyencontrib: add "Requirements" section to PR template...
2026-03-23 Davi Henrique... devops: upgraded default oneAPI version (#20731)
2026-03-23 Aleksander... webui: Improve chat form positioning (#20901)
2026-03-23 Geo Maciolekdocs: Fix typo in reasoning flag documentation (#20780)
2026-03-23 Georgi Gerganovmemory : fix seq_id bounds in llama_memory_recurrent...
2026-03-23 Eric Zhangdocs : rerun llama-gen-docs to include new CLI args...
2026-03-23 Xuan-Son Nguyenserver: use httplib dynamic threads (#20817)
2026-03-23 Georgi Gerganovai : update gh permissions (#20895)
2026-03-23 Pascalwebui: fix --webui-config-file settings not applied...
2026-03-23 Rashid Ul Islammetal: add CONV_3D (#19927)
2026-03-23 Jhen-Jie Hongcommon/autoparser : detect reasoning markers when enabl...
2026-03-23 Chenguang LiCANN: add RoPE cache preload before ACL graph capture...
next