]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2026-02-16 Mathieu BaudierSimplify packaging debian/latest
2026-02-16 Mathieu BaudierIntroduce ggml RPC server
2026-02-16 Mathieu BaudierUpdate upstream
2026-02-16 Mathieu BaudierMerge tag 'upstream/0.0.8067' into debian/latest
2026-02-15 Georgi Gerganovsync : ggml upstream/0.0.8067
2026-02-15 Georgi Gerganovggml : bump version to 0.9.7 (ggml/1425)
2026-02-15 Georgi Gerganovggml : bump version to 0.9.6 (ggml/1423)
2026-02-15 David Friehscuda: optimize iq2xxs/iq2xs/iq3xxs dequantization ...
2026-02-15 Aaron Teodocs: update s390x build docs (#19643)
2026-02-15 Adrien Gallouëtbuild : remove LLAMA_HTTPLIB option (#19623)
2026-02-15 Daniel Beveniuscmake : check if KleidiAI API has been fetched (#19640)
2026-02-15 Georgi Gerganovcontext : fix output reorder with backend sampling...
2026-02-15 Georgi Gerganovggml : avoid UB in gemm ukernel (#19642)
2026-02-15 Aaron Teoggml-cpu: optimize ggml_vec_dot_bf16 for s390x (#19399)
2026-02-15 Aman Guptaggml-cpu: FA add GEMM microkernel (#19422)
2026-02-15 SamareshSinghcmake : fix KleidiAI install target failure with EXCLUD...
2026-02-14 Sigbjørn Skjæretconvert : ensure all models handle new experts count...
2026-02-14 Anav Prasadmtmd : Add Nemotron Nano 12B v2 VL support (#19547)
2026-02-14 Georgi Gerganovmodels : optimize qwen3next graph (#19375)
2026-02-14 Adrien Gallouëtggml : fix GGML_DEBUG with OpenMP (#19599)
2026-02-14 iMilNetBSD build support (#19589)
2026-02-14 Aleksander... webui: Architecture and UI improvements (#19596)
2026-02-14 agent-enemy-2llama : update LoRA API. + fix excessive graph reserves...
2026-02-14 Georgemmap: Fix Windows handle lifetime (#19598)
2026-02-14 Georgi Gerganovmetal : fix ACC op (#19427)
2026-02-14 Adrien Gallouëtscripts : use official split.py for cpp-httplib (#19588)
2026-02-14 Sigbjørn Skjæretconvert : store ffn_gate_inp_shexp as F32 (#19606)
2026-02-14 Adrien Gallouëtbuild : fix libtool call in build-xcframework.sh (...
2026-02-14 Jeff Bolzvulkan: support L2_NORM with contiguous rows (#19604)
2026-02-14 Jeff Bolzvulkan: support GGML_OP_SET (#19584)
2026-02-14 Sophonvulkan: Add vendor id for Qualcomm drivers (#19569)
2026-02-14 Max Krasnyanskyhexagon: further optimizations and refactoring for...
2026-02-13 Mengsheng Wugithub : add missing backends to issue templates (...
2026-02-13 Jeff Bolzvulkan: restore -inf check in FA shaders (#19582)
2026-02-13 Adrien Gallouëtcommon : update download code (#19573)
2026-02-13 Xuan-Son Nguyenmodel: support GLM MoE DSA arch (NOTE: indexer is not...
2026-02-13 Alberto Cabrera... Fix wrong memcpy length for block_interleave == 4 ...
2026-02-13 ymckifix vulkan ggml_acc only works in 3d but not 4d (#19426)
2026-02-13 Sigbjørn Skjæretsupport --verbose-prompt (#19576)
2026-02-13 Aman GuptaCUDA: loop over ne2*ne3 in case it overflows (#19538)
2026-02-13 Aleksander... webui: UI and routing fixes (#19586)
2026-02-13 Oliver SimonsCUDA: Do not mutate cgraph for fused ADDs (#19566)
2026-02-13 Pavan Shindedocs : fix broken link and typo (#19560)
2026-02-13 ymckimodel : Kimi Linear fix conv state update (#19531)
2026-02-13 Adrien Gallouëtllama : remove deprecated codecvt (#19565)
2026-02-13 Adrien Gallouëtvendor : update BoringSSL to 0.20260211.0 (#19562)
2026-02-13 Georgi Gerganovmemory : fix kv cache size for hybrid models (#19559)
2026-02-13 Georgi Gerganovmetal : improve concurrency (#19555)
2026-02-13 Georgi Gerganovmetal : support GGML_OP_SET (#19548)
2026-02-12 Shupei Fanhexagon: fix typo in vtcm_needs_release (#19545)
2026-02-12 lhezopencl: add basic support for q4_1 (#19534)
2026-02-12 Georgi Gerganovargs : add -kvu to llama-parallel (#19577)
2026-02-12 Aleksander... webui: Add switcher to Chat Message UI to show raw...
2026-02-12 Adrien Gallouëtvendor : update cpp-httplib (#19537)
2026-02-12 Christian Schmitzllama : update outdated comment in llama.h (#19428)
2026-02-12 Aleksander... (webui) FEATURE: Enable adding or injecting System...
2026-02-12 Daniel Beveniusscripts : add support for forks in pr2wt.sh (#19540)
2026-02-12 Aleksander... (webui) REFACTOR: UI primitives and polish (#19551)
2026-02-12 Aleksander... WebUI Architecture Cleanup (#19541)
2026-02-12 Georgi Gerganovmetal : update sum_rows kernel to support float4 (...
2026-02-12 Mario LimoncielloAdd a workaround for compilation with ROCWMMA_FATTN...
2026-02-12 RichardScottOZserver : fix typo in README.md for features list (...
2026-02-12 TriDefenderdocs : update path in snapdragon README.md (#19533)
2026-02-12 Max Krasnyanskyhexagon: further optimization and tuning of matmul...
2026-02-12 Adrien Gallouëtcommon : replace deprecated codecvt using parse_utf8_co...
2026-02-11 lhezopencl: add general Q6_K mm and Q4_K mv (#19347)
2026-02-11 Georgi Gerganovggml : unary ops support non-cont src0 + metal F16...
2026-02-11 Daniel Beveniuscommon : remove unused token util functions (#19506)
2026-02-11 AesSedaimodel: Add Kimi-K2.5 support (#19170)
2026-02-11 Daniel Beveniusbuild : fix case in dSYMs path for build-macos [no...
2026-02-11 Georgi Gerganovmetal : extend l2_norm support for non-cont src0 (...
2026-02-11 Johannes Gäßlerdocs: ban AI for issues and discussions [no CI] (#19512)
2026-02-11 Adrien Gallouëtcommon : improve download error reporting (#19491)
2026-02-11 Max Krasnyanskyhexagon: Add ARGSORT, DIV, SQR, SQRT, SUM_ROWS, GEGLU...
2026-02-11 thecaptain789llama : correct typos 'occured' and 'occurences' (...
2026-02-11 Georgi Gerganovmodel : fix wavtokenizer embedding notions (#19479)
2026-02-11 Georgi Gerganovggml : extend bin bcast for permuted src1 (#19484)
2026-02-11 Georgi Gerganovmetal : consolidate unary ops (#19490)
2026-02-11 Daniel Beveniusllama : refactor sampling_info to use buffer_view templ...
2026-02-10 Oliver SimonsCUDA : Update CCCL-tag for 3.2 to final release from...
2026-02-10 Nikhil Jain[WebGPU] Plug memory leaks and free resources on shutdo...
2026-02-10 JJJYmmmmodels : support qwen3.5 series (#19468)
2026-02-10 Xuan-Son Nguyentest: fix IMROPE perf test case (#19465)
2026-02-10 Alberto Cabrera... ggml-cpu: arm64: q6_K repack gemm and gemv (and generic...
2026-02-10 k4ss4nggml : use noexcept overload for is_regular_file in...
2026-02-10 Piotr Wilkin... convert : move experts permutation from Qwen2MoeModel...
2026-02-10 Daniel Beveniustts : fix typos in README.md [no ci] (#19463)
2026-02-10 Raul TorresCANN: Remove unnecessary wrapper for `gml_backend_buft_...
2026-02-10 hipuddingCANN: implement quantized MUL_MAT_ID for MoE models...
2026-02-10 Georgi Gerganovcuda : extend GGML_OP_PAD to work with non-cont src0...
2026-02-09 Xuan-Son Nguyenchat: fix case where template accepts type content...
2026-02-09 Tarek Dakhranmtmd: Implement tiling for LFM2-VL (#19454)
2026-02-09 손희준Server: log when converting requests to chat completion...
2026-02-09 Sascha Rogmannspec : remove check rate (#19377)
2026-02-09 Georgi Gerganovci : add metal server workflows (#19293)
2026-02-09 Georgi Gerganovrevert : "[Model] Qwen3.5 dense and MoE support (no...
2026-02-09 Kevin Pougetggml-virtgpu: add backend documentation (#19354)
2026-02-09 Hugocmake : add variable to skip installing tests (#19370)
2026-02-08 Piotr Wilkin... [Model] Qwen3.5 dense and MoE support (no vision) ...
2026-02-08 Oliver SimonsCUDA: Fix non-contig rope (#19338)
next