]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-09-28 Jeff Bolzvulkan: handle mat_mul with A matrix > 4GB (#16176)
2025-09-27 Jeff Bolzvulkan: support arbitrary KV dimension in flash attenti...
2025-09-27 Aclyvulkan : make the vulkan.hpp dynamic dispatcher instanc...
2025-09-27 Aleksander... Show message actions by default (#16289)
2025-09-27 Aman GuptaCUDA: mul_mat_id for mmf for bs <= 64 for f16 and bs...
2025-09-27 Johannes GäßlerCUDA: refactor and deduplicate vector FA kernels (...
2025-09-27 Dmytro Minochkinvulkan: throw system error instead of SIGABRT during...
2025-09-27 Adrien Gallouëtserver : remove old LLAMA_SERVER_SSL (#16290)
2025-09-27 Jeff Bolzvulkan: support GET_ROWS for k-quants (#16235)
2025-09-27 Adrien Gallouëtbuild : add LLAMA_OPENSSL option (#16287)
2025-09-26 Vinkalmodel : make minicpm embedding_scale, residual_scale...
2025-09-26 Aaron Teodevops: add s390x & ppc64le CI (#15925)
2025-09-26 Aleksander... Enhance text file detection logic for file attachments...
2025-09-26 Aleksander... Allow viewing conversations even when llama server...
2025-09-26 Isaac McFadyenwebui: switch to hash-based routing (alternative of...
2025-09-26 Aleksander... Always show message actions for mobile UI + improvement...
2025-09-26 Radoslav Gerganovcodeowners : add rgerganov as owner of RPC [no ci]...
2025-09-26 Aleksei Nikiforovmtmd : fix uninitialized variable in bicubic_resize...
2025-09-26 Georgi Gerganovmetal : report OOM errors (#16274)
2025-09-26 Adrien Gallouëtcommon : use cpp-httplib as a cURL alternative for...
2025-09-26 Adrien Gallouëtbuild : fix build-ios-device (#16257)
2025-09-26 Aaron Teoggml-cpu: implement MXFP4 SIMD for s390x (#16193)
2025-09-26 Radoslav Gerganovci : create git tags for released docker images (#16008)
2025-09-26 Daniel Beveniuscodeowners : add danbev as owner of build-xcframework...
2025-09-26 R0CKSTARmusa: upgrade musa sdk to 4.3.0 (#16240)
2025-09-26 R0CKSTARmusa: fix build warnings (#15611)
2025-09-25 Sigbjørn Skjæretmodel : add GroveMoE support (#15510)
2025-09-25 Aaron Teovendors: update miniaudio version (#16212)
2025-09-25 rtaluyevreadme : update bindings (#16144)
2025-09-25 Aman GuptaCUDA: add a fused top-K MoE kernel (#16130)
2025-09-25 Daniel Beveniusmodel-conversion : add embedding prompt file support...
2025-09-25 Daniel Beveniusserver : add support for external server for tests...
2025-09-25 junchao-zhaoggml : fix loongarch lsx compilation error (#15864)
2025-09-25 Johannes Gäßlerdocs: fix typo [no ci] (#16244)
2025-09-25 Douglas Hanleyllama : add support for qwen3 reranker (#15824)
2025-09-25 Georgi Gerganovmetal : fuse NORM + MUL + ADD, support non-multiples...
2025-09-25 Georgi Gerganovmetal : relax reorder conditions (#16216)
2025-09-25 Georgi Gerganovmetal : restore im2col perf (#16219)
2025-09-25 Radoslav Gerganovrpc : use ggml logging facilities
2025-09-25 Aaron Teocodeowners: add ownership of zdnn backend [no ci] ...
2025-09-25 Eveci: run the x64 and arm ci on the github machines inste...
2025-09-25 Aaron Teodevops: fix s390x docker release failure (#16231)
2025-09-24 Aaron Teocodeowners: add ownership of zdnn backend [no ci] ...
2025-09-24 Johannes Gäßlerllama: print memory breakdown on exit (#15860)
2025-09-24 Aclyggml : split graph allocations according to backend...
2025-09-24 Tarek Dakhranmodel : add label for LiquidAI LFM2-2.6B model (#16204)
2025-09-24 Jie Fu (傅杰)model-conversion : make causal-verify-logits fails...
2025-09-24 Uilian Riescommon : add missing chrono header for common.cpp ...
2025-09-24 Sigbjørn Skjæretcodeowners : match all requirements files (#16214)
2025-09-24 Jie Fu (傅杰)model-conversion : run-org-model.py fails to run on...
2025-09-24 Daniel Beveniuscodeowners : use slash prefix for root files [no ci...
2025-09-24 Jie Fu (傅杰)model-conversion : fix the make targets in the README...
2025-09-23 Georgi Gerganovci : disable AMD workflows + update NVIDIA workflows...
2025-09-23 Georgi Gerganovci : enable Vulkan workflow on Mac (#16194)
2025-09-23 Xiangyan Sunggml-cpu: Respect cpumask settings (#16164)
2025-09-23 Sigbjørn Skjæretggml : fix uninitialized is_on_grid in quantize_row_iq3...
2025-09-23 Aaron Teozdnn: refactor codebase + add docs (#16178)
2025-09-23 Daniel Beveniuscodeowners : add @danbev to model-conversion example...
2025-09-23 Aaron Teodevops: add s390x containers (#15915)
2025-09-23 Daniel Beveniusggml-cpu : fix typo in gemm comments [no ci] (#16189)
2025-09-22 Gabe Goodhartfeat: Add conversion support in GraniteHybrid for non...
2025-09-22 Haiyue Wangclang-tidy : disable warning about performance enum...
2025-09-22 Sigbjørn Skjæretggml : implement set_rows with i32 index (#16159)
2025-09-22 Georgi Gerganovcodeowners : update + cleanup (#16174)
2025-09-22 Adrien Gallouëtcommon : enable `--offline` mode without curl support...
2025-09-22 Quentin Bramaswebui : fix handling incomplete chunks (#16107)
2025-09-22 GideonSerfembedding : fix typos in README (#16171)
2025-09-22 Haiyue Wangcommon : remove unused local variables (#16140)
2025-09-22 Georgi Gerganovggml : extend ggml_can_fuse to work with non-sequential...
2025-09-22 Georgi Gerganovggml : add ggml_op_is_empty (#16122)
2025-09-22 Xuan-Son Nguyencodeowners : update ownership for @ngxson and @allozuar...
2025-09-22 Shin-myoung... Vulkan: add conv_transpose_2d operation (#16022)
2025-09-22 Sigbjørn Skjæretcodeowners : claim responsibility for ci, models, gguf...
2025-09-22 Georgi Gerganovcontrib : update roles (#16113)
2025-09-22 Georgi Gerganovci : remove vulkaninfo calls (#16169)
2025-09-22 Georgi Gerganovci : use smaller model (#16168)
2025-09-22 Jeff Bolzvulkan: add RTE variants of exp shader (#16165)
2025-09-22 Georgi Gerganovci : adjust params for less runtime (#16167)
2025-09-22 Ruben Ortlamvulkan: vec dot matrix multiplication fix (#16151)
2025-09-21 lhezopencl: fix concat crash on win arm64 with Adreno ...
2025-09-21 lhezopencl: initial `q8_0` mv support (#15732)
2025-09-21 Georgi Gerganovci : add label for the RISC-V runner (#16150)
2025-09-21 Georgi Gerganovci : migrate ggml ci to self-hosted runners (#16116)
2025-09-21 Giuseppe Scrivanovulkan: optimize UMA buffer operations and fix driver...
2025-09-21 Jeff Bolzvulkan: fix validation error about VK_PIPELINE_CREATE_C...
2025-09-20 Georgi Gerganovsync : ggml upstream/0.0.6527
2025-09-20 Daniel Beveniusggml : introduce semantic versioning (ggml/1336)
2025-09-20 Gregor JasnyCUDA : conditionally add cuda architectures (ggml/1341)
2025-09-20 Ruben Ortlamvulkan: use vec dot for matrix matrix multiplications...
2025-09-20 Benniserver: fix SSE and OpenAI compatibility for error...
2025-09-19 ssweensllama-bench: add --devices and --list-devices support...
2025-09-19 shun095chat: Fix streaming parser for granite models (#15682)
2025-09-19 Aleksander... feat: Improve mobile UI for Settings Dialog (#16084)
2025-09-19 Xuan-Son Nguyenchat : fix build on arm64 (#16101)
2025-09-19 Xuan-Son Nguyenggml : refactor forward_dup for cpu backend (#16062)
2025-09-18 Adrien Gallouëtggml-amx : fix ggml_amx_init() on generic Linux (#16049)
2025-09-18 Adrien Gallouëtcmake : fix static linking for OpenMP on Unix-like...
2025-09-18 Shawn Guopencl: optimize mxfp4 kernels (#16037)
2025-09-18 Jeff Bolzrename optimize_graph to graph_optimize (#16082)
2025-09-18 Bowen HanCUDA: Optimize PAD_REFLECT_1D (#15957)
next