]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2026-03-25 Shreya JainUse docker in build-android.yml (#20928)
2026-03-25 Aman Guptallama-bench: print `-n-cpu-moe` when offloaded layers...
2026-03-25 Masato Nakasakaci: Allow ninja to be used during unit test (#20742)
2026-03-25 Georgi Gerganovci : disable self-hosted mac jobs (#20985)
2026-03-25 Xuan-Son Nguyenjinja: fix macro with kwargs (#20960)
2026-03-25 Francisco Herreragguf-split : clarify operation of gguf-split (#19749)
2026-03-25 Johannes Gäßlerllama: fix llama-model-saver (#20503)
2026-03-25 Aleksander... webui: Fix editing assistant message without branching...
2026-03-25 PascalAdd SLEEPING status to the WebUI model selector (#20949)
2026-03-25 yikechayedanandroid : fix-pointer-dangling (#20974)
2026-03-25 Neo Zhangsycl : fix wrong variable check by assert (#20903)
2026-03-25 Sigbjørn Skjæretci : bump gguf publish python version (#20982)
2026-03-25 Sigbjørn Skjæretci : limit requirements versions (#20980)
2026-03-25 Dowonconvert : register Qwen3Model architecture (#20967)
2026-03-25 Ravi Panchumarthydocs : Update OpenVINO backend docs (#20968)
2026-03-24 Georgi Gerganovmodels : move the token embedding norms to the first...
2026-03-24 Aman Guptaggml-backend: re-enable graph reuse with pipeline paral...
2026-03-24 Alessandro... vendor : update cpp-httplib to 0.39.0 (#20933)
2026-03-24 Adrien Gallouëtcommon : fix get_gguf_split_info (#20946)
2026-03-24 BlueMöhreWebUI: fix edit msg form textarea height (#20830)
2026-03-24 Adrien Gallouëtreadme : clarify MODEL_ENDPOINT usage (#20941)
2026-03-24 Adrien Gallouëtcommon : add a WARNING for HF cache migration (#20935)
2026-03-24 nurimetal : add FLOOR, CEIL, ROUND, TRUNC unary ops (#20930)
2026-03-24 Georgi Gerganovmetal : add FA instantiations for HSK=512, HSV=512...
2026-03-24 Aaron Teoissues: add openvino backends (#20932)
2026-03-24 Adrien Gallouëtcommon : add standard Hugging Face cache support (...
2026-03-24 Aman Guptallama-fit: fix regex pattern for gate_up tensors (...
2026-03-24 Aldehir Rojascommon : replace wrap_for_generation with a prefix...
2026-03-23 Max Krasnyanskyhexagon: general DMA and Binary Op fixes for large...
2026-03-23 Max KrasnyanskyAdd codeowners for scripts/snapdragon and docs/snapdrag...
2026-03-23 lhezopencl: add q6_K gemm and gemv kernels for Adreno ...
2026-03-23 las7rpc : RCE patch (#20908)
2026-03-23 Xuan-Son Nguyencontrib: add "Requirements" section to PR template...
2026-03-23 Davi Henrique... devops: upgraded default oneAPI version (#20731)
2026-03-23 Aleksander... webui: Improve chat form positioning (#20901)
2026-03-23 Geo Maciolekdocs: Fix typo in reasoning flag documentation (#20780)
2026-03-23 Georgi Gerganovmemory : fix seq_id bounds in llama_memory_recurrent...
2026-03-23 Eric Zhangdocs : rerun llama-gen-docs to include new CLI args...
2026-03-23 Xuan-Son Nguyenserver: use httplib dynamic threads (#20817)
2026-03-23 Georgi Gerganovai : update gh permissions (#20895)
2026-03-23 Pascalwebui: fix --webui-config-file settings not applied...
2026-03-23 Rashid Ul Islammetal: add CONV_3D (#19927)
2026-03-23 Jhen-Jie Hongcommon/autoparser : detect reasoning markers when enabl...
2026-03-23 Chenguang LiCANN: add RoPE cache preload before ACL graph capture...
2026-03-23 Dan Hoffmanfix(openvino): explicit memset in buffer_context alloca...
2026-03-23 shaofeiqiopencl: add flattened Q4_K mv and general Q4_K mm ...
2026-03-23 bssrdfmtmd: Add dynamic high-resolution image preprocessing...
2026-03-23 DorianRudolphmtmd : fix LightOnOCR image preprocessing (#20877)
2026-03-22 Xuan-Son Nguyenserver: allow router to report child instances sleep...
2026-03-22 Johannes GäßlerCUDA: fix BF16 FA compilation (#20865)
2026-03-22 Sigbjørn Skjæretjinja : refactor token advancement (#20864)
2026-03-22 Evgeny Kurnevskyserver: fix Host header (#20843)
2026-03-22 Neo Zhangsupport bf16 and quantized type (#20803)
2026-03-22 Patrick Buckleyggml-cuda: native bf16 flash attention for vec kernel...
2026-03-22 Gaurav Garg[CUDA] Increase number of output elements per-thread...
2026-03-21 ddh0misc : prefer ggml-org models in docs and examples...
2026-03-21 Andrea Arcangelicommon/grammar: fix grammar parsing issues to prevent...
2026-03-21 Tom Hillbrunnercontext : use n_embd_out for pooled embedding extractio...
2026-03-21 Xuan-Son Nguyendocs : explicit about banning accounts that violates...
2026-03-21 y198fix(rpc): prevent division by zero in deserialize_tenso...
2026-03-21 Michael WandConvert: Make NVFP4 and MXFP4 HF conversions say NVFP4...
2026-03-21 Sigbjørn Skjæretci : switch from pyright to ty (#20826)
2026-03-21 Matt CoralloAdd shader count for Intel Arc Pro B60 (#20818)
2026-03-20 Piotr Wilkin... common/parser: fix nasty bug causing subtle corruption...
2026-03-20 shalinib-ibmggml-cpu: add always_inline to tinyBLAS_PPC accumulator...
2026-03-20 Georgi Gerganovai : limit runtime of the agent (#20816)
2026-03-20 James O'Learycommon : fix typo in debug log ('extracft' -> 'extract...
2026-03-20 Georgi Gerganovai : do not run bash commands in the prompt (#20810)
2026-03-20 Victor Villarmodel : fix Granite Hybrid type check for 7B.A1B (...
2026-03-20 Xuan-Son Nguyenserver: (doc) clarify in-scope and out-scope features...
2026-03-20 Jeff Bolzvulkan: change gated_delta_net to shard a column across...
2026-03-20 Ruikai Pengcontext: zero output buffer on allocation (#20781)
2026-03-20 Ruikai Pengmodel: assert nextn_predict_layers to prevent underflow...
2026-03-20 Georgi Gerganovserver : improve mtmd ctx checkpoints (#20726)
2026-03-20 hipuddingCANN: add BF16 support for core operators (#20152)
2026-03-20 Seyoung Jeongdocs : fix Metal backend op support status in ops.md...
2026-03-20 Georgi Gerganovai : update find-related action (#20790)
2026-03-20 Ruikai Pengjinja : fix heap OOB read in value equality comparison...
2026-03-20 James O'Learycommon/parser : fix out_of_range crash in throw path...
2026-03-19 Ben Racicotserver: fix router mode deadlock on child crash and...
2026-03-19 Tomeamisdocs: Update server README to reflect PR #20297 (#20560)
2026-03-19 Sundaram krishnanggml: guard KleidiAI DOWNLOAD_EXTRACT_TIMESTAMP for...
2026-03-19 Georgi Gerganovci : improve action for duplicate issue (#20772)
2026-03-19 Rail Chabdarovhip: Avoid compiler bug in RDNA code generation during...
2026-03-19 Ryan Gouldenserver: Add cached_tokens info to oaicompat responses...
2026-03-19 James O'Learychat : handle tool calls with no required args in TAG_W...
2026-03-19 Georgi Gerganovci : clarify gh command for viewing issues (#20766)
2026-03-19 Yiwei Shaohexagon: add Matrix Extensions (HMX) for Hexagon NPU...
2026-03-19 uvosci : add hip quality check (#20430)
2026-03-19 Piotr Wilkin... common/parser: add proper reasoning tag prefill reading...
2026-03-19 Reese Levineggml webgpu: ops support for qwen3.5 (SET, TRI_SOLVE...
2026-03-19 ddh0common : add LLAMA_ARG_SPEC_TYPE (#20744)
2026-03-19 Georgi Gerganovci : add action for finding duplicate issues (#20756)
2026-03-19 PascalServer becomes the source of truth for sampling paramet...
2026-03-19 Xuan-Son Nguyenmtmd: add clip_graph::build_mm() (#20751)
2026-03-19 PascalWebUI: Persist the on/off state of the MCP servers...
2026-03-19 Aleksander... webui: Improve model parsing logic + add unit tests...
2026-03-19 Dowonconvert : support is_causal hyperparameter (#20746)
2026-03-19 Aldehir Rojascommon : fix gpt-oss content removal (#20745)
2026-03-19 Evevulkan: dequantize iq4_xs 4 at a time (#20657)
next