]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2026-03-25 Ravi Panchumarthydocs : Update OpenVINO backend docs (#20968)
2026-03-24 Georgi Gerganovmodels : move the token embedding norms to the first...
2026-03-24 Aman Guptaggml-backend: re-enable graph reuse with pipeline paral...
2026-03-24 Alessandro... vendor : update cpp-httplib to 0.39.0 (#20933)
2026-03-24 Adrien Gallouëtcommon : fix get_gguf_split_info (#20946)
2026-03-24 BlueMöhreWebUI: fix edit msg form textarea height (#20830)
2026-03-24 Adrien Gallouëtreadme : clarify MODEL_ENDPOINT usage (#20941)
2026-03-24 Adrien Gallouëtcommon : add a WARNING for HF cache migration (#20935)
2026-03-24 nurimetal : add FLOOR, CEIL, ROUND, TRUNC unary ops (#20930)
2026-03-24 Georgi Gerganovmetal : add FA instantiations for HSK=512, HSV=512...
2026-03-24 Aaron Teoissues: add openvino backends (#20932)
2026-03-24 Adrien Gallouëtcommon : add standard Hugging Face cache support (...
2026-03-24 Aman Guptallama-fit: fix regex pattern for gate_up tensors (...
2026-03-24 Aldehir Rojascommon : replace wrap_for_generation with a prefix...
2026-03-23 Max Krasnyanskyhexagon: general DMA and Binary Op fixes for large...
2026-03-23 Max KrasnyanskyAdd codeowners for scripts/snapdragon and docs/snapdrag...
2026-03-23 lhezopencl: add q6_K gemm and gemv kernels for Adreno ...
2026-03-23 las7rpc : RCE patch (#20908)
2026-03-23 Xuan-Son Nguyencontrib: add "Requirements" section to PR template...
2026-03-23 Davi Henrique... devops: upgraded default oneAPI version (#20731)
2026-03-23 Aleksander... webui: Improve chat form positioning (#20901)
2026-03-23 Geo Maciolekdocs: Fix typo in reasoning flag documentation (#20780)
2026-03-23 Georgi Gerganovmemory : fix seq_id bounds in llama_memory_recurrent...
2026-03-23 Eric Zhangdocs : rerun llama-gen-docs to include new CLI args...
2026-03-23 Xuan-Son Nguyenserver: use httplib dynamic threads (#20817)
2026-03-23 Georgi Gerganovai : update gh permissions (#20895)
2026-03-23 Pascalwebui: fix --webui-config-file settings not applied...
2026-03-23 Rashid Ul Islammetal: add CONV_3D (#19927)
2026-03-23 Jhen-Jie Hongcommon/autoparser : detect reasoning markers when enabl...
2026-03-23 Chenguang LiCANN: add RoPE cache preload before ACL graph capture...
2026-03-23 Dan Hoffmanfix(openvino): explicit memset in buffer_context alloca...
2026-03-23 shaofeiqiopencl: add flattened Q4_K mv and general Q4_K mm ...
2026-03-23 bssrdfmtmd: Add dynamic high-resolution image preprocessing...
2026-03-23 DorianRudolphmtmd : fix LightOnOCR image preprocessing (#20877)
2026-03-22 Xuan-Son Nguyenserver: allow router to report child instances sleep...
2026-03-22 Johannes GäßlerCUDA: fix BF16 FA compilation (#20865)
2026-03-22 Sigbjørn Skjæretjinja : refactor token advancement (#20864)
2026-03-22 Evgeny Kurnevskyserver: fix Host header (#20843)
2026-03-22 Neo Zhangsupport bf16 and quantized type (#20803)
2026-03-22 Patrick Buckleyggml-cuda: native bf16 flash attention for vec kernel...
2026-03-22 Gaurav Garg[CUDA] Increase number of output elements per-thread...
2026-03-21 ddh0misc : prefer ggml-org models in docs and examples...
2026-03-21 Andrea Arcangelicommon/grammar: fix grammar parsing issues to prevent...
2026-03-21 Tom Hillbrunnercontext : use n_embd_out for pooled embedding extractio...
2026-03-21 Xuan-Son Nguyendocs : explicit about banning accounts that violates...
2026-03-21 y198fix(rpc): prevent division by zero in deserialize_tenso...
2026-03-21 Michael WandConvert: Make NVFP4 and MXFP4 HF conversions say NVFP4...
2026-03-21 Sigbjørn Skjæretci : switch from pyright to ty (#20826)
2026-03-21 Matt CoralloAdd shader count for Intel Arc Pro B60 (#20818)
2026-03-20 Piotr Wilkin... common/parser: fix nasty bug causing subtle corruption...
2026-03-20 shalinib-ibmggml-cpu: add always_inline to tinyBLAS_PPC accumulator...
2026-03-20 Georgi Gerganovai : limit runtime of the agent (#20816)
2026-03-20 James O'Learycommon : fix typo in debug log ('extracft' -> 'extract...
2026-03-20 Georgi Gerganovai : do not run bash commands in the prompt (#20810)
2026-03-20 Victor Villarmodel : fix Granite Hybrid type check for 7B.A1B (...
2026-03-20 Xuan-Son Nguyenserver: (doc) clarify in-scope and out-scope features...
2026-03-20 Jeff Bolzvulkan: change gated_delta_net to shard a column across...
2026-03-20 Ruikai Pengcontext: zero output buffer on allocation (#20781)
2026-03-20 Ruikai Pengmodel: assert nextn_predict_layers to prevent underflow...
2026-03-20 Georgi Gerganovserver : improve mtmd ctx checkpoints (#20726)
2026-03-20 hipuddingCANN: add BF16 support for core operators (#20152)
2026-03-20 Seyoung Jeongdocs : fix Metal backend op support status in ops.md...
2026-03-20 Georgi Gerganovai : update find-related action (#20790)
2026-03-20 Ruikai Pengjinja : fix heap OOB read in value equality comparison...
2026-03-20 James O'Learycommon/parser : fix out_of_range crash in throw path...
2026-03-19 Ben Racicotserver: fix router mode deadlock on child crash and...
2026-03-19 Tomeamisdocs: Update server README to reflect PR #20297 (#20560)
2026-03-19 Sundaram krishnanggml: guard KleidiAI DOWNLOAD_EXTRACT_TIMESTAMP for...
2026-03-19 Georgi Gerganovci : improve action for duplicate issue (#20772)
2026-03-19 Rail Chabdarovhip: Avoid compiler bug in RDNA code generation during...
2026-03-19 Ryan Gouldenserver: Add cached_tokens info to oaicompat responses...
2026-03-19 James O'Learychat : handle tool calls with no required args in TAG_W...
2026-03-19 Georgi Gerganovci : clarify gh command for viewing issues (#20766)
2026-03-19 Yiwei Shaohexagon: add Matrix Extensions (HMX) for Hexagon NPU...
2026-03-19 uvosci : add hip quality check (#20430)
2026-03-19 Piotr Wilkin... common/parser: add proper reasoning tag prefill reading...
2026-03-19 Reese Levineggml webgpu: ops support for qwen3.5 (SET, TRI_SOLVE...
2026-03-19 ddh0common : add LLAMA_ARG_SPEC_TYPE (#20744)
2026-03-19 Georgi Gerganovci : add action for finding duplicate issues (#20756)
2026-03-19 PascalServer becomes the source of truth for sampling paramet...
2026-03-19 Xuan-Son Nguyenmtmd: add clip_graph::build_mm() (#20751)
2026-03-19 PascalWebUI: Persist the on/off state of the MCP servers...
2026-03-19 Aleksander... webui: Improve model parsing logic + add unit tests...
2026-03-19 Dowonconvert : support is_causal hyperparameter (#20746)
2026-03-19 Aldehir Rojascommon : fix gpt-oss content removal (#20745)
2026-03-19 Evevulkan: dequantize iq4_xs 4 at a time (#20657)
2026-03-19 Charles Xucmake : fix build warning when kleidiai is enabled...
2026-03-19 Sigbjørn Skjæretvocab : assert array size of scores and toktypes (...
2026-03-19 Kevin Hannondocs: add information about openvino in the docker...
2026-03-19 Chenguang LiCANN: handle in-place ROPE on non-contiguous f32 tensor...
2026-03-19 Masashi Yoshimuraggml-webgpu: Update the `RMS_NORM` preprocessor and...
2026-03-19 Masashi Yoshimuraggml-webgpu: Add supports for `DIAG` and `TRI` (#20664)
2026-03-19 Chenguang LiCANN: support flash attention for head dim not multiple...
2026-03-18 Michael Graumodel : add control vector support where missing (...
2026-03-18 Sigbjørn Skjæretgguf-py : cleaner way to get the first key (#20727)
2026-03-18 crsawyerRebuild index.html.gz (#20724)
2026-03-18 Reese LevineMove to no timeout for WaitAny in graph submission...
2026-03-18 Shaw Nguyenggml-cpu/x86: fix unused changemask warning in repack...
2026-03-18 Georgi Gerganovsync : ggml
2026-03-18 Georgi Gerganovggml : bump version to 0.9.8 (ggml/1442)
next