]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-07-28 Radoslav Gerganovllama-bench : use local GPUs along with RPC servers...
2025-07-28 xctanggml-cpu : deduplicate scalar implementations (#14897)
2025-07-28 Akarshan BiswasSYCL: Add set_rows support for quantized types (#14883)
2025-07-28 Xuan-Son Nguyenmtmd : add support for Voxtral (#14862)
2025-07-28 Johannes GäßlerCUDA: fix pointer incrementation in FA (#14916)
2025-07-28 Dongliang Weimodel : add support for SmallThinker series (#14898)
2025-07-28 Alberto Cabrera... sycl: refactor quantization to q8_1 (#14815)
2025-07-28 Georgi Gerganovops : update BLAS (#14914)
2025-07-28 Georgi Gerganovops : update Metal (#14912)
2025-07-28 Georgi Gerganovsync : ggml
2025-07-28 Kai Pastorcmake : Indent ggml-config.cmake (ggml/1310)
2025-07-27 Ed Addarioquantize : update README.md (#14905)
2025-07-27 Ruben Ortlamvulkan: add ops docs (#14900)
2025-07-27 Akarshan BiswasSYCL: add ops doc (#14901)
2025-07-27 Daniel Beveniusllama : clarify comment about pp and tg graphs [no...
2025-07-27 Erik Scholzvulkan : add fp16 support for the conv_2d kernel (...
2025-07-27 Jeff Bolzvulkan: skip empty set_rows to avoid invalid API usage...
2025-07-27 Gabriel Larsonmodel : make rope_yarn_log_mul optional for deepseek2...
2025-07-27 Shunta Saitollama : fix kq_scale for the attention layers of PLaMo2...
2025-07-27 Aman GuptaDocs: add instructions for adding backends (#14889)
2025-07-26 deepsekHIP: Enable Matrix cores for MMQ Kernels, Enable stream...
2025-07-26 hipuddingCANN: Implement GLU ops (#14884)
2025-07-26 R0CKSTARmusa: fix build warnings (unused variable) (#14869)
2025-07-25 Aaron Teoggml-cpu : disable GGML_NNPA by default due to instabil...
2025-07-25 Gabe Goodhartmetal: SSM_SCAN performance (#14743)
2025-07-25 lhezopencl: add fused `rms_norm_mul` (#14841)
2025-07-25 wooksongdocs : update HOWTO‑add‑model.md for ModelBase and...
2025-07-25 Oliver Simonsggml : remove invalid portPos specifiers from dot files...
2025-07-25 Georgi Gerganovcontext : restore preemptive sched reset when LLAMA_SET...
2025-07-25 kiwimtmd : fix 32-bit narrowing issue in export-lora and...
2025-07-25 Chris Rohlfrpc : check for null buffers in get/set/copy tensor...
2025-07-25 Diego Devesasched : fix multiple evaluations of the same graph...
2025-07-24 R0CKSTARmusa: upgrade musa sdk to rc4.2.0 (#14498)
2025-07-24 Georgi Gerganovsync : ggml
2025-07-24 Kai Pastorcmake : fix usage issues (ggml/1257)
2025-07-24 Daniel Beveniusggml-cpu : remove stdlib include from repack.cpp (ggml...
2025-07-24 Georgi Gerganovcontext : perform output reorder lazily upon access...
2025-07-24 Xuan-Son Nguyenchat : fix kimi-k2 chat template (#14852)
2025-07-24 Alberto Cabrera... sycl: fixed semantics of block offset calculation ...
2025-07-24 yummyllama : fix MiniCPM inference after Granite Four change...
2025-07-24 Pouyadocs: add libcurl-dev install hint for Linux distros...
2025-07-24 Georgi Gerganovmetal : fix fusion across different encoders (#14849)
2025-07-24 Donghyeon Jeongsycl: fix undefined variable in work group size check...
2025-07-23 jacekpoplawskiconvert : text-only support for GLM-4.1V-9B-Thinking...
2025-07-23 Johannes GäßlerCUDA: fix overflow in FA, tune performance (#14840)
2025-07-23 Johannes GäßlerCUDA: fix compilation with GGML_CUDA_F16 (#14837)
2025-07-23 Sigbjørn Skjæretci : correct label refactor->refactoring (#14832)
2025-07-23 Johannes GäßlerCUDA: fix quantized KV cache + multiple sequences ...
2025-07-23 Georgi Gerganovtests : add non-cont K,V FA tests
2025-07-23 l3utterflymemory : handle saving/loading null layers in recurrent...
2025-07-23 lixing-starggml: fix loongarch quantize_row_q8_1 error (#14827)
2025-07-23 chen fanCANN: weight format to NZ for Ascend310P3 (#14407)
2025-07-23 Aman GuptaCUDA: add fused rms norm (#14800)
2025-07-22 Csaba Kecskemetiggml : model card yaml tab->2xspace (#14819)
2025-07-22 Jeff Bolzvulkan: fix rms_norm_mul to handle broadcasting dim0...
2025-07-22 Molly Sophiallama : add model type detection for rwkv7 7B&14B ...
2025-07-22 Ed Addarioimatrix: add option to display importance score statist...
2025-07-22 stduhpfMtmd: add a way to select device for vision encoder...
2025-07-22 Sigbjørn Skjæretcuda : implement bf16 cpy ops and enable bf16 cont...
2025-07-22 lhezopencl: remove unreachable `return` (#14806)
2025-07-22 Molly Sophiaserver : allow setting `--reverse-prompt` arg (#14799)
2025-07-21 R0CKSTARcuda: remove linking to cublasLt (#14790)
2025-07-21 Sigbjørn Skjæretopencl: fix `im2col` when `KW!=KH` (#14803)
2025-07-21 rmatifopencl: add conv2d kernel (#14403)
2025-07-21 Romain Biessysycl: Fix im2col (#14797)
2025-07-21 Charles Xukleidiai: add support for get_rows (#14676)
2025-07-21 Radoslav Gerganovdocs : fix backends table in README.md (#14796)
2025-07-21 Jeff Bolzvulkan/cuda: Fix im2col when KW!=KH (#14789)
2025-07-21 Molly Sophiallama : fix `--reverse-prompt` crashing issue (#14794)
2025-07-21 IsaacDynamoserver : add parse_special option to /tokenize endpoint...
2025-07-20 Aman Guptadocs : fix link for tools/perplexity in README.md ...
2025-07-20 rspOverflowDocumentation: Further revisions to the Vulkan section...
2025-07-20 Aman GuptaClang-format: local files first + fix BinPacking (...
2025-07-19 0cc4mContrib: add 0cc4m as codeowner for Vulkan backend...
2025-07-19 Ervin Áron... ggml: adds CONV_2D op and direct GEMM Vulkan implementa...
2025-07-19 compiladeimatrix : use GGUF to store importance matrices (#9400)
2025-07-19 Peter0x44vulkan: Add logging for bf16 features to ggml_vk_print_...
2025-07-19 0cc4mVulkan: Fix fprintf format-security warning (#14770)
2025-07-19 rspOverflowDocumentation: Update build.md's Vulkan section (#14736)
2025-07-19 Georgi Gerganovsync : ggml
2025-07-18 Georgi Gerganovmetal : fuse add, mul + add tests (#14596)
2025-07-18 Georgi Gerganovgraph : fix graph reuse reset of params (#14760)
2025-07-18 Georgi Gerganovparallel : add option for different RNG seeds (#14757)
2025-07-18 Oliver Simonscuda : Fix Gemma3n not executed as CUDA_GRAPH on NVGPUs...
2025-07-18 Georgi Gerganovgraph : avoid huge warm-up graphs for MoE models (...
2025-07-18 Georgi Gerganovmodel : fix build after merge conflict (#14754)
2025-07-18 lgai-exaonemodel : add EXAONE 4.0 support (#14630)
2025-07-18 Aman GuptaCUDA: set_rows + cpy.cu refactor (#14712)
2025-07-18 Georgi Gerganovgraph : refactor context to not pass gf explicitly...
2025-07-18 Nexes the Eldergraph : Pass the graph placeholder message in debug...
2025-07-18 Neo Zhang Jianyuuse max work group size for device to replace the magic...
2025-07-17 Piotr Wilkin... convert : fix Ernie4.5 MoE without shared experts ...
2025-07-17 Wroclawnix : use optionalAttrs for env mkDerivation attrset...
2025-07-17 Piotr Wilkin... model: add Ernie 4.5 MoE support (#14658)
2025-07-17 Georgi Gerganovkv-cache : fix k-shift for multiple streams (#14742)
2025-07-17 Georgi Gerganovllama : reuse compute graphs (#14482)
2025-07-17 Tarek Dakhranllama : fix parallel processing for lfm2 (#14705)
2025-07-17 Georgi Gerganovkv-cache : opt mask set input (#14600)
2025-07-17 Georgi Gerganovbatch : fix uninitialized has_cpl flag (#14733)
2025-07-16 Sigbjørn Skjæretci : disable failing vulkan crossbuilds (#14723)
next