]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2026-03-15 sprayandwipekv-cache : fix reading llama_kv_cell_ext during state...
2026-03-14 Michael Wandmodel : wire up Qwen3.5/Qwen3.5MoE tensors for NVFP4...
2026-03-14 Georgi Gerganovmetal : add FA specialization for HSK = 320, HSV =...
2026-03-14 Georgi Gerganovci : move self-hosted workflows to separate files ...
2026-03-14 Gerard Guillemas... docker : force Python 3.13 in Vulkan container (#20530)
2026-03-14 Eveci : try to optimize some jobs (#20521)
2026-03-14 Max Krasnyanskyhexagon: Q4_0 and MXFP4 repack fixes (#20527)
2026-03-14 Georgi Gerganovci : reduce webgpu tests timeout to 900s (#20538)
2026-03-14 Xuan-Son Nguyenmtmd: add llama-mtmd-debug binary (#20508)
2026-03-14 Neo Zhangadd op gated_delta_net (#20455)
2026-03-14 Chedrian07webui: restore code preview iframe origin isolation...
2026-03-14 Adrien Gallouëtscripts : remove get-wikitext-103.sh (#20543)
2026-03-14 Adrien Gallouëtscripts : update get-hellaswag.sh and get-winogrande...
2026-03-14 Adrien Gallouëtggml : add native AVX512-FP16 support for F16 operation...
2026-03-14 WallentriUse fp32 in cuBLAS V100 to avoid overflows, env variabl...
2026-03-14 Zijun Yuggml : add OpenVINO backend (#15307)
2026-03-14 Adrien Gallouëtvendor : update cpp-httplib to 0.37.2 (#20484)
2026-03-14 Rail ChabdarovFix data race in CUDA's "cpy" kernel (influences GGML...
2026-03-14 lhezopencl: fix l2_norm (#20480)
2026-03-13 Adrien Gallouëttools : enable kvu in perplexity for hellaswag, winogra...
2026-03-13 Georgi Gerganovgraph : remove redundant GDN state transposes (#20443)
2026-03-13 Piotr Wilkin... common/parser: gracefully handle undetected tool parser...
2026-03-13 ZeroV0LTllama : fix pooling assertion crash in chunked GDN...
2026-03-13 SoftwareRendererserver: reset counter related to kill-switch on client...
2026-03-13 rehan-10xengineerggml-cpu: add RVV vec dot kernels for quantization...
2026-03-13 Adrien Gallouëtggml : fix typo gmml (#20512)
2026-03-13 Daniel Beveniusmtmd : rename mtmd_get_audio_bitrate to mtmd_get_audio_...
2026-03-13 Piotr Wilkin... general: CONTRIBUTING.md - guidelines for quantization...
2026-03-13 Georgi Gerganovmetal : fix l2 norm scale (#20493)
2026-03-13 Daniel Beveniusconvert : fix/suppress pyright errors (#20442)
2026-03-12 Georgi Gerganovllama : disable graph reuse with pipeline parallelism...
2026-03-12 Alessandro... vendor : update cpp-httplib to 0.37.1 (#20390)
2026-03-12 Piotr Wilkin... tests : use `reasoning` instead of `reasoning_budget...
2026-03-12 Ruben Ortlamtest-backend-ops: allow loading tests from file and...
2026-03-12 Daniel Beveniuscommon : update completion executables list [no ci...
2026-03-12 Asbjørn Ollinggrammar: Fix grammar root symbol check (#19761)
2026-03-12 ProgenyAlphavulkan: add GATED_DELTA_NET op support (#20334)
2026-03-12 Sigbjørn Skjæretconvert : better mtp check and fix return [no ci] ...
2026-03-12 ProgenyAlphavulkan: fix SSM_CONV PP scaling with large ubatch sizes...
2026-03-12 PascalNew conversations now auto-select the first loaded...
2026-03-12 Masashi Yoshimuraggml-virtgpu: Fix some build commands (#20341)
2026-03-12 Georgi Gerganovmetal : avoid divisions in bin kernel (#20426)
2026-03-12 Masato Nakasakaci: Setup self-hosted CI for Intel Linux Vulkan backend...
2026-03-12 Jeff Bolzvulkan: fix l2_norm epsilon handling (#20350)
2026-03-12 Jeff Bolzvulkan: fix OOB check in flash_attn_mask_opt (#20296)
2026-03-12 Masato Nakasakavulkan: Fix ErrorOutOfHostMemory on Intel GPU when...
2026-03-12 lhezopencl: use larger workgroup size for get_rows (#20316)
2026-03-12 shaofeiqiopencl: add cumsum op (#18981)
2026-03-12 uvoship: compile debug builds with -O2 on hip to avoid...
2026-03-12 Mishushacommon/parser: add GigaChatV3/3.1 models support (...
2026-03-11 DAN™model : add support for Phi4ForCausalLMV (#20168)
2026-03-11 Richard Davisongraph : add optional scale parameter to build_lora_mm...
2026-03-11 ddh0common : fix --n-cpu-moe, --cpu-moe for models with...
2026-03-11 Masashi Yoshimuraggml-webgpu: Add supports for `GGML_OP_REPEAT` (#20230)
2026-03-11 Georgi Gerganovllama : enable chunked fused GDN path (#20340)
2026-03-11 Sigbjørn Skjæretllama : whitespace cleanup (#20422)
2026-03-11 Richard Davisonggml : add NVFP4 quantization type support (#19769)
2026-03-11 Georgi Gerganovbenches : add nemotron super (#20420)
2026-03-11 Daniel Beveniusllama : add support for Nemotron 3 Super (#20411)
2026-03-11 Georgi Gerganovmetal : fix capture_compute counter logic (#20410)
2026-03-11 Aman Guptacompare-llama-bench: check remotes as well (#20406)
2026-03-11 Georgi Gerganovmetal : fix q5_k mul_mv register spill (#20399)
2026-03-11 Georgi Gerganovmetal : add env var to trigger graph capture (#20398)
2026-03-11 Neo Zhang[SYCL] Update SYCL.md for binary package for Windows...
2026-03-11 Ruben Ortlamci: disable coopmat on ubuntu-24-cmake-vulkan job ...
2026-03-11 Aldehir Rojascommon/parser: use nlohmann::ordered_json to preserve...
2026-03-11 Piotr Wilkin... common/parser: handle reasoning budget (#20297)
2026-03-11 uvosggml-cuda: gdn use shared mem for HIP (#20366)
2026-03-11 uvoscuda/hip: fix loop unrolling in ssm-conv (#20369)
2026-03-11 PascalFix agentic mcp image single model (#20339)
2026-03-11 Alessandro... vendor : update cpp-httplib to 0.37.0 (#20207)
2026-03-11 Alessandro... vendor : update miniaudio to 0.11.25 (#20209)
2026-03-11 Neo Zhangfix op rope, add rope_back (#20293)
2026-03-11 Neo Zhangfix for failed UT case: ACC, L2_NORM, UPSCALE, fused_gl...
2026-03-10 Vinicios Luglimodel : qwen3vl reranker text support (#20332)
2026-03-10 ddh0llama-quant : correct `n_attention_wv` usage (#20357)
2026-03-10 Georgi Gerganovggml : bump RPC version (#20330)
2026-03-10 Reese Levineggml webgpu: faster normal quant and some k-quant matri...
2026-03-10 Piotr Wilkin... Reduce level of content parser warning message to avoid...
2026-03-10 Ray Xuexamples : fix empty items in json_schema_to_grammar...
2026-03-10 a3894281docs: update CPU backend ops to mark POOL_1D as support...
2026-03-10 Georgi Gerganovmodels : fix assert in mamba2 (cont) (#20335)
2026-03-10 Georgi Gerganovserver : make 2 checkpoints near the end of the prompt...
2026-03-10 Sigbjørn Skjæretcommon : fix incorrect uses of stoul (#20313)
2026-03-10 Charles Xukleidiai : support for concurrent sme and neon kernel...
2026-03-10 Taimur Ahmadggml-cpu: add RVV repack GEMM and GEMV for quantization...
2026-03-10 Julian Pscheidmetal: handle command buffer failures gracefully in...
2026-03-10 ddh0llama-quant : fail early on missing imatrix, refactor...
2026-03-09 Aldehir Rojascommon: consolidate PEG string parsers (#20263)
2026-03-09 Xuan-Son Nguyenmodel: fix step3.5 n_rot (#20318)
2026-03-09 Xuan-Son Nguyenllama: dynamic head_dim and n_rot for SWA (#20301)
2026-03-09 Evan Huusserver: Parse port numbers from MCP server URLs in...
2026-03-09 Paul Flynnmetal : extend mul_mv_ext to BF16, Q2_K, Q3_K (#20250)
2026-03-09 Georgi Gerganovserver : fix checkpoints n_tokens calculation (#20287)
2026-03-09 Georgi Gerganovmetal : add upscale (#20284)
2026-03-09 Georgi Gerganovserver : warn swa-full is not supported for non-SWA...
2026-03-09 Georgi Gerganovserver : fix off-by-1 in server_tokens::size_up_to_pos...
2026-03-09 Piotr Wilkin... common: map developer role to system (#20215)
2026-03-09 Georgi Gerganovmodels : fix assert in mamba2 graph (#20270)
2026-03-09 Georgi Gerganovserver : add kill switch when server is stuck (#20277)
next