]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2026-03-11 DAN™model : add support for Phi4ForCausalLMV (#20168)
2026-03-11 Richard Davisongraph : add optional scale parameter to build_lora_mm...
2026-03-11 ddh0common : fix --n-cpu-moe, --cpu-moe for models with...
2026-03-11 Masashi Yoshimuraggml-webgpu: Add supports for `GGML_OP_REPEAT` (#20230)
2026-03-11 Georgi Gerganovllama : enable chunked fused GDN path (#20340)
2026-03-11 Sigbjørn Skjæretllama : whitespace cleanup (#20422)
2026-03-11 Richard Davisonggml : add NVFP4 quantization type support (#19769)
2026-03-11 Georgi Gerganovbenches : add nemotron super (#20420)
2026-03-11 Daniel Beveniusllama : add support for Nemotron 3 Super (#20411)
2026-03-11 Georgi Gerganovmetal : fix capture_compute counter logic (#20410)
2026-03-11 Aman Guptacompare-llama-bench: check remotes as well (#20406)
2026-03-11 Georgi Gerganovmetal : fix q5_k mul_mv register spill (#20399)
2026-03-11 Georgi Gerganovmetal : add env var to trigger graph capture (#20398)
2026-03-11 Neo Zhang[SYCL] Update SYCL.md for binary package for Windows...
2026-03-11 Ruben Ortlamci: disable coopmat on ubuntu-24-cmake-vulkan job ...
2026-03-11 Aldehir Rojascommon/parser: use nlohmann::ordered_json to preserve...
2026-03-11 Piotr Wilkin... common/parser: handle reasoning budget (#20297)
2026-03-11 uvosggml-cuda: gdn use shared mem for HIP (#20366)
2026-03-11 uvoscuda/hip: fix loop unrolling in ssm-conv (#20369)
2026-03-11 PascalFix agentic mcp image single model (#20339)
2026-03-11 Alessandro... vendor : update cpp-httplib to 0.37.0 (#20207)
2026-03-11 Alessandro... vendor : update miniaudio to 0.11.25 (#20209)
2026-03-11 Neo Zhangfix op rope, add rope_back (#20293)
2026-03-11 Neo Zhangfix for failed UT case: ACC, L2_NORM, UPSCALE, fused_gl...
2026-03-10 Vinicios Luglimodel : qwen3vl reranker text support (#20332)
2026-03-10 ddh0llama-quant : correct `n_attention_wv` usage (#20357)
2026-03-10 Georgi Gerganovggml : bump RPC version (#20330)
2026-03-10 Reese Levineggml webgpu: faster normal quant and some k-quant matri...
2026-03-10 Piotr Wilkin... Reduce level of content parser warning message to avoid...
2026-03-10 Ray Xuexamples : fix empty items in json_schema_to_grammar...
2026-03-10 a3894281docs: update CPU backend ops to mark POOL_1D as support...
2026-03-10 Georgi Gerganovmodels : fix assert in mamba2 (cont) (#20335)
2026-03-10 Georgi Gerganovserver : make 2 checkpoints near the end of the prompt...
2026-03-10 Sigbjørn Skjæretcommon : fix incorrect uses of stoul (#20313)
2026-03-10 Charles Xukleidiai : support for concurrent sme and neon kernel...
2026-03-10 Taimur Ahmadggml-cpu: add RVV repack GEMM and GEMV for quantization...
2026-03-10 Julian Pscheidmetal: handle command buffer failures gracefully in...
2026-03-10 ddh0llama-quant : fail early on missing imatrix, refactor...
2026-03-09 Aldehir Rojascommon: consolidate PEG string parsers (#20263)
2026-03-09 Xuan-Son Nguyenmodel: fix step3.5 n_rot (#20318)
2026-03-09 Xuan-Son Nguyenllama: dynamic head_dim and n_rot for SWA (#20301)
2026-03-09 Evan Huusserver: Parse port numbers from MCP server URLs in...
2026-03-09 Paul Flynnmetal : extend mul_mv_ext to BF16, Q2_K, Q3_K (#20250)
2026-03-09 Georgi Gerganovserver : fix checkpoints n_tokens calculation (#20287)
2026-03-09 Georgi Gerganovmetal : add upscale (#20284)
2026-03-09 Georgi Gerganovserver : warn swa-full is not supported for non-SWA...
2026-03-09 Georgi Gerganovserver : fix off-by-1 in server_tokens::size_up_to_pos...
2026-03-09 Piotr Wilkin... common: map developer role to system (#20215)
2026-03-09 Georgi Gerganovmodels : fix assert in mamba2 graph (#20270)
2026-03-09 Georgi Gerganovserver : add kill switch when server is stuck (#20277)
2026-03-09 Aman Guptaggml-cuda: disable gdn for musa (#20278)
2026-03-09 ddh0llama-quant : left-align tensor names in output (#20117)
2026-03-09 Aman Guptacontributing: limit open PRs for new contributors to...
2026-03-09 Bertay Erenggml-vulkan: add SGN operator, auto-generate Vulkan...
2026-03-09 Ruben Ortlamvulkan: skip zero size tensors in backend copies (...
2026-03-09 Michael Huangcuda : display total and free VRAM capacity during...
2026-03-09 Aaron Teollama-bench: introduce `-hf` and `-hff` flags & use...
2026-03-09 Piotr Wilkin... PEG parser for LFM2 (#20251)
2026-03-08 Georgi Gerganovserver : do not create checkpoints right after mtmd...
2026-03-08 Sigbjørn Skjæretgraph : remove redundant scale_w parameter (#20235)
2026-03-08 Aldehir Rojascommon : gracefully handle incomplete output (#20191)
2026-03-08 Piotr Wilkin... Fix compile bug (#20203)
2026-03-08 Piotr Wilkin... Fix structured outputs (#20223)
2026-03-08 GiantPrinceggml-vulkan: Add ELU op support (#20183)
2026-03-08 Jeff Bolzvulkan: Fix data races in coopmat1 mul_mat(_id) (#20084)
2026-03-08 Johannes Gäßlerllama: end-to-end tests (#19802)
2026-03-08 Christopher... readme : update infra list (#20212)
2026-03-08 Piotr Wilkin... Revert to OAI-compatible args (#20213)
2026-03-08 decahedron1server : correct index on finish in OAI completion...
2026-03-08 Neo Zhang[SYCL] supprt Flash Attention for fp32/fp16/Q4/Q5/Q8...
2026-03-07 Aman Guptaggml: add GATED_DELTA_NET op (#19504)
2026-03-07 lhezopencl: add l2_norm (#20160)
2026-03-07 Piotr Wilkin... Autoparser: True streaming (#20177)
2026-03-06 Piotr Wilkin... Autoparser: add optional argument reshuffle capability...
2026-03-06 Bartowskiquants : Add memsets and other fixes for IQ quants...
2026-03-06 Piotr Wilkin... Add @pwilkin to CODEOWNERS for autoparser code (#20174)
2026-03-06 Piotr Wilkin... Autoparser - complete refactoring of parser architectur...
2026-03-06 Todor Boinovskihexagon: add f32 ssm_conv op (#20122)
2026-03-06 Tom Vaucourtserver : preserve anthropic thinking blocks in conversi...
2026-03-06 Max Krasnyanskycpu: skip redudant ROPE cache updates (#20149)
2026-03-06 Aman Guptaggml-cuda: add mem check for fusion (#19916)
2026-03-06 Aaron Teoggml: update comments for backends which have no memory...
2026-03-06 shalinib-ibmggml-cpu: Fix gcc 15 ICE on ppc64le (#20083) (#20130)
2026-03-06 Aman GuptaCUDA: use shared mem for ssm_conv (#20128)
2026-03-06 Tim Neumanncontext: ignore zero scale LoRAs when checking sameness...
2026-03-06 Piotr Wilkin... Checkpoint every n tokens: squash (#20087)
2026-03-06 Aleksander... webui: Agentic Loop + MCP Client with support for Tools...
2026-03-06 Johannes Gäßlerggml-cpu: fix data race for debug asserts (#20148)
2026-03-06 Georgi Gerganovkv-cache : fix M-RoPE checkpoints (#20132)
2026-03-06 Roj234cli : Don't clear system prompt when using '/clear...
2026-03-06 lhezopencl: add neg, exp and diag (#20127)
2026-03-06 YardenTal44hexagon: add fp16 support for binary ops: add,sub,mul...
2026-03-05 ymckimodels : kda chunk size = 16 (#19827)
2026-03-05 Andreas KieslingerCUDA: Improve performance via less synchronizations...
2026-03-05 Eric Zhangmodel : update Qwen3.5 model type detection (#20126)
2026-03-05 Sigbjørn Skjæretcli : add command and file auto-completion (#19985)
2026-03-05 Sigbjørn Skjæretconvert : register Qwen 3.5 ForCausalLM for text only...
2026-03-05 Aleksander... webui: Improvements for Models Selector UI (#20066)
2026-03-05 Marcel Petrickchore : correct typos [no ci] (#20041)
2026-03-05 Max Krasnyanskyhexagon: Flash Attention optimizations (dma, mpyacc...
next