]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-12-22 Xuan-Son Nguyenserver: (docs) remove mention about extra_args (#18262)
2025-12-22 Johannes Gäßlertool/ex/tests: consistently free ctx, then model (...
2025-12-21 Jeff Bolzvulkan: Implement set_tensor_async and the event interf...
2025-12-21 Johannes Gäßlerllama: fix RPC for -fit on (#18233)
2025-12-21 Xuan-Son Nguyenmove copilot instructions to AGENTS.md (#18259)
2025-12-21 Jeff Bolzvulkan: fix im2col overflowing maxworkgroupcount (...
2025-12-21 Jeff Bolzvulkan/cuda: fix topk_moe with exp_probs_b (#18071)
2025-12-21 Jeff Bolzvulkan: support GGML_UNARY_OP_XIELU (#18062)
2025-12-21 Jeff Bolzvulkan: in graph_optimize, try to group ADD operations...
2025-12-21 lovedheartVulkan: some improvement on mul_mat_iq2_xs (#18031)
2025-12-21 Daniel Beveniusdocs : fix links in parsing.md (#18245)
2025-12-21 Aldehir Rojascommon : reorganize includes to prioritize vendored...
2025-12-21 Xuan-Son Nguyenserver: add auto-sleep after N seconds of idle (#18228)
2025-12-20 Jeff Bolztests: Avoid floating point precision false positives...
2025-12-20 Jeff Bolztest-backend-ops: improve msvc build time (#18209)
2025-12-20 Aadeshveer... Added comments explaining thread block size selection...
2025-12-20 Oleksandr Kuvshynovserver : [easy] fix per round speculative decode loggin...
2025-12-20 Xuan-Son Nguyenserver: support load model on startup, support preset...
2025-12-19 Sigbjørn Skjæretci : remove non-windows zip artifacts (#18201)
2025-12-19 Sigbjørn Skjæretci : only save ccache on master (#18207)
2025-12-19 Alfredggml-hexagon: Implement true Q8_0 quantization on Hexag...
2025-12-19 Pascalarg: fix order to use short form before long form ...
2025-12-19 Julius Tischbeinllama : Changing off_t to size_t for Windows (#18204)
2025-12-19 Aman Guptaserver: friendlier error msg when ctx < input (#18174)
2025-12-19 Xuan-Son Nguyenpresets: refactor, allow cascade presets from different...
2025-12-19 Aleksander... webui: Add editing attachments in user messages (#18147)
2025-12-19 Daniel Beveniusmodel-conversion : add verbose flag in run-org-model...
2025-12-19 Naco Sirenandroid: fix missing screenshots for Android.md (#18156)
2025-12-19 Jeff Bolzvulkan: Add perf logger mode with concurrency (#17944)
2025-12-18 Xuan-Son Nguyenmodel : add ASR support for LFM2-Audio-1.5B (conformer...
2025-12-18 Pascalwebui: display prompt processing stats (#18146)
2025-12-18 Taimur Ahmadggml-cpu: extend support for RVV floating-point kernels...
2025-12-18 Xuan-Son Nguyenarg: fix ASAN error on sampler_type_names empty (#18167)
2025-12-18 Sigbjørn Skjæretgguf-py : use copy-on-write mode for localtensor (...
2025-12-18 yuloremove i_major_dual (#18157)
2025-12-18 Aleksander... webui: Fix selecting generated output issues during...
2025-12-18 Kim S.webui: fix chat screen shadow width (#18010)
2025-12-18 Johannes Gäßlerllama: offload output layer to GPU first (#18148)
2025-12-18 Sigbjørn Skjæretconvert : sort and use file parts from model index...
2025-12-18 Julius Tischbeinllama : Async DirectIO model loading on Linux (#18012)
2025-12-17 Shouyuggml-hexagon: swiglu_oai operation (#18114)
2025-12-17 Sigbjørn Skjæretconvert : force patch_merger tensors to f16/f32 (#18124)
2025-12-17 Pascalserver: (webui) add --webui-config (#18028)
2025-12-17 Xuan-Son Nguyenserver: (router) disable SSL on child process (#18141)
2025-12-17 Johannes Gäßlerllama-fit-params: fix memory print (#18136)
2025-12-17 Kim S.webui: fix chat header width when sidebar is closed...
2025-12-17 Shouyuggml-hexagon: gelu operation (#17921)
2025-12-17 Georgi Gerganovcommon : restore grammar-based rejection sampling ...
2025-12-17 Johannes Gäßlercommon: clarify instructions for bug reports (#18134)
2025-12-17 HonestQiaomodel: fix GLM-ASR-Nano-2512 load error (#18130) (...
2025-12-17 Xuan-Son Nguyenserver: (router) allow child process to report status...
2025-12-17 Piotr Wilkin... Extend run-org-model.py, add (a) batching (b) loading...
2025-12-17 Johannes GäßlerGithub: ask for -v logs for params_fit [no ci] (#18128)
2025-12-17 Alberto Cabrera... ggml-cpu: ARM64: repack version of q8_0 (dotprod and...
2025-12-17 Tarek Dakhranmodel: fix LFM2_MOE missing tensors (#18132)
2025-12-17 Sigbjørn Skjæretci : clean up webui jobs (#18116)
2025-12-17 Pascalcommon: fix --override-kv to support comma-separated...
2025-12-17 yuloHIP: Refactor mma for RDNA and CDNA (#17990)
2025-12-17 Naco Sirenllama.android : Rewrite Android binding (w/o cpu_featur... upstream/0.0.7446
2025-12-17 TrevorSarg: allow -kvu flag for llama-perplexity (#18117)
2025-12-17 Aadeshveer... ggml : use WARP_SIZE/2 for argmax reduction offset...
2025-12-17 Yuri Khrustalevgguf-py : allow converting multi-tensor models from...
2025-12-16 Johannes Gäßlerllama-fit-params: force disable mlock (#18103)
2025-12-16 Johannes Gäßlerllama-fit-params: lower ctx size for multi GPU (#18101)
2025-12-16 Johannes Gäßlerllama-fit-params: fix underflow for dense models (...
2025-12-16 Johannes Gäßlerllama-fit-params: QoL impr. for prints/errors (#18089)
2025-12-16 Xuan-Son Nguyenmodel: fix LFM2 missing tensors (#18105)
2025-12-16 Johannes Gäßlerllama: fix early stop in params_fit if ctx is set ...
2025-12-16 yifant-codeserver: fix crash when batch > ubatch with embeddings...
2025-12-16 Daniel Beveniusmodel-conversion : remove -fa option in model card...
2025-12-16 Xuan-Son Nguyenarch: refactor LLM_TENSOR_NAMES (#18051)
2025-12-16 Xuan-Son Nguyenarg: clarify auto kvu/np being set on server (#17997)
2025-12-16 Piotr Wilkin... Optimization: Qwen3 next autoregressive pass (#17996)
2025-12-16 Andrew AladjevCLI: fixed adding cli and completion into docker contai...
2025-12-16 2114L3server: Update README.md incorrect argument (#18073)
2025-12-16 Xuan-Son Nguyenmodel: support GLM4V vision encoder (#18042)
2025-12-16 Daniel Beveniusmodel-conversion : add note about verifying previous...
2025-12-16 Daniel Beveniusmodel-conversion : use CONVERTED_EMBEDDING_MODEL for...
2025-12-16 Aldehir Rojascommon : add nemotron 3 parsing (#18077)
2025-12-16 Francisco Herreraadded note for old Intel hardware pre sycl (#18017)
2025-12-16 Georgi Gerganovsecurity : add collaborator guidance (#18081)
2025-12-16 Chris Petersonllama: Include algorithm header needed for C++23 (...
2025-12-16 Georgi Gerganovgraph : reuse SSM graphs (#16490)
2025-12-16 Sigbjørn Skjæretci : separate webui from server (#18072)
2025-12-16 Aleksander... webui: Improve copy to clipboard with text attachments...
2025-12-16 Aleksander... webui: Add setting to always show sidebar on Desktop...
2025-12-16 Daniel Beveniusllama : add support for NVIDIA Nemotron 3 Nano (#18058)
2025-12-16 Darius LukasWebui: Disable attachment button and model selector...
2025-12-15 Sigbjørn Skjæretconvert : move rope_parameters to TextModel class ...
2025-12-15 Shouyuggml-hexagon: mm for mtmd (#17894)
2025-12-15 HelloKSmodel : add KORMo model (#18032)
2025-12-15 ssweenskv-cache: Fix state restore fragmented cache (#17982)
2025-12-15 PascalFix unreadable user markdown colors and truncate long...
2025-12-15 Jeremy Demeulemetal: use shared buffers on eGPU (#17866)
2025-12-15 Xuan-Son Nguyenmtmd: refactor audio preprocessing (#17978)
2025-12-15 Andrew Aladjevcli: fixed dead links to tools/main for cli and complet...
2025-12-15 Thomas Jaroschwebui: add "delete all conversations" button to import...
2025-12-15 Johannes Gäßlerllama: automatically set parameters not set by the...
2025-12-15 Neo Zhang Jianyu[SYCL] Support gpt-oss by OPs add-id, mul_mat for mxfp4...
2025-12-15 piDackmodel : add glm-asr support (#17901)
next