]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-05-20 Georgi Gerganovkv-cache : add SWA support (#13194)
2025-05-20 Xinpeng DouCANN: Update CANN model support (#13162)
2025-05-20 Nicolò Scipionesycl : Overcoming workaround for mmap() allocation...
2025-05-19 psocolovskycommon : add load_progress_callback (#13617)
2025-05-19 0cc4mVulkan: Add f32 accumulator support to quantized mul...
2025-05-19 Alberto Cabrera... sycl : backend documentation review (#13544)
2025-05-19 Xuan-Son Nguyenmtmd : add vision support for llama 4 (#13282)
2025-05-19 Alberto Cabrera... ci : upgraded oneAPI version in SYCL workflows and...
2025-05-19 Georgi Gerganovsync : ggml
2025-05-19 Johannes Gäßlermnist: fix segmentation fault (ggml/1227)
2025-05-19 Diego Devesaggml : fix apple OS check in ggml_print_backtrace ...
2025-05-19 Daniel Tangggml : Fix missing backtrace on Linux (ggml/1228)
2025-05-19 Nickfix: check model pointer validity before use (#13631)
2025-05-19 Chenguang LiCANN: Support MOE Model MUL_MAT_ID (#13042)
2025-05-17 Isaac McFadyenserver : added --no-prefill-assistant flag (#13608)
2025-05-17 Gilad S.cmake: use the current build config for vulkan-shaders...
2025-05-17 Georgi Gerganovparallel : add option for non-shared and larger prompts...
2025-05-17 Jeff Bolzvulkan: move common FA code to flash_attn_base.comp...
2025-05-17 Jeff Bolzvulkan: use scalar FA rather than coopmat2 when N=...
2025-05-16 Zllguidance : official v0.7.20 release (no actual change...
2025-05-16 Xuan-Son Nguyenserver : do not return error out of context (with ctx...
2025-05-16 Xuan-Son Nguyenwebui : improve accessibility for visually impaired...
2025-05-16 Xuan-Son Nguyenreadme : add list of dependencies and their license...
2025-05-16 Diego Devesareleases : use arm version of curl for arm releases...
2025-05-16 Georgi Gerganovmetal : add FA-vec kernel for head size 64 (#13583)
2025-05-16 Diego Devesallama : print hint when loading a model when no backend...
2025-05-16 Sigbjørn Skjæretci : add ppc64el to build-linux-cross (#13575)
2025-05-16 Łukasz Ślusarczyksycl : fixed compilation warnings (#13582)
2025-05-15 Olivier Chafikminja: sync (qwen3) (#13573)
2025-05-15 Diego Devesagguf : use ggml log system (#13571)
2025-05-15 Daniel Tanggguf-py : fix disconnect-before-connect in editor-gui...
2025-05-15 Xuan-Son Nguyenconvert : fix conversion for llama 4 (#13567)
2025-05-15 Atharva Dubeysycl: simplify bin_bcast_kernel (#13383)
2025-05-15 Svetlozar Georgievsycl: reordered Q4_K MMVQ (#13109)
2025-05-15 Łukasz Ślusarczyksycl: use oneDNN for matrices multiplication (#12972)
2025-05-15 Diego Devesallama-bench : fix -ot with dl backends (#13563)
2025-05-15 Xuan-Son Nguyenwebui : handle PDF input (as text or image) + convert...
2025-05-15 Piotr Wilkin... server : proper error handling for missing elements...
2025-05-15 Georgi Gerganovbench : handle decode errors (#13548)
2025-05-15 Olivier Chafik`server`: inject date_string in llama 3.x template...
2025-05-14 Georgi Gerganovkv-cache : fix out-of-bounds view during reserve graph...
2025-05-14 Yibo Caiarm64: optimize q6_k_q8_k kernel with i8mm (#13519)
2025-05-14 Olivier Chafik`common`: add partial regex support (#12808)
2025-05-14 Sigbjørn Skjæreteditorconfig : fix trailing whitespace from #13542...
2025-05-14 Gilad S.fix: crash when calling `llama_state_get_size` on a...
2025-05-14 Johannes GäßlerCUDA: fix crash on large batch size for quant. MoE...
2025-05-14 Diego Devesallama : fix quantize with dl backends (#13539)
2025-05-14 Johannes GäßlerCUDA: faster Deepseek FA, add Turing support (#13435)
2025-05-14 Gabe Goodhartfix: Move build_inp_pos to the top of the graph section...
2025-05-14 Georgi Gerganovserver : passthrough the /models endpoint during loadin...
2025-05-14 Xuan-Son Nguyenserver : fix cache_tokens bug with no cache_prompt...
2025-05-14 bandoticmake: simplify vulkan shader test logic (#13263)
2025-05-14 Jeff Bolzvulkan: KHR_coopmat flash attention (#13506)
2025-05-14 Xuan-Son Nguyenwebui : use fflate for more deterministic gzip compress...
2025-05-14 Luca Stefaniwebui: Allow pasting file from clipboard (#13526)
2025-05-14 ddpasadocs: Update link to ggml-org in multimodal.md (#13513)
2025-05-14 Sigbjørn Skjæretscripts : fix compare-llama-bench.py show parameter...
2025-05-14 Jeff Bolzvulkan: workaround FA compile failures on macos (#13517)
2025-05-13 Ed Addarioquantize : improve tensor-type pattern matching (#13033)
2025-05-13 Xuan-Son Nguyenclip : clip.h become private API (⚠️ breaking change...
2025-05-13 Georgi Gerganovmetal : use FA-vec kernel up to batch size 20 (#13496)
2025-05-13 Georgi Gerganovmetal : optimize multi-sequence FA vec kernel (#13493)
2025-05-13 Dan Johanssonggml-cpu: Update KleidiAI to v1.6 and fix include direc...
2025-05-13 Georgi Gerganovbatched-bench : fix pp batch contents (#13492)
2025-05-13 Xuan-Son Nguyenmtmd : remove libllava, remove clip-quantize-cli (...
2025-05-13 Sigbjørn Skjæretscripts : support arbitrary input file formats in compa...
2025-05-13 Gabe Goodhartmodel : Granite MoE shared (#13269)
2025-05-13 Georgi Gerganovsync : ggml
2025-05-12 Diego Devesallama-bench : add defrag-thold, check for invalid range...
2025-05-12 lhezopencl: remove unnecessary assert for `add` (#13257)
2025-05-12 Xuan-Son Nguyenclip : cap max image size 1024 for qwen vl model (...
2025-05-12 Johannes Gäßlerllama/ggml: add LLM training support (#10544)
2025-05-12 Georgi Gerganovcontext : fix state io for memory-less contexts (#13470)
2025-05-12 Anudit Nagarserver : allow content to be null in oaicompat_completi...
2025-05-12 Diego Devesallama-bench : accept ranges for integer parameters...
2025-05-12 Dan Johanssonggml-cpu: Integrate fp32=bf16xbf16 SME KleidiAI kernel...
2025-05-12 Johannes GäßlerCUDA: fix misaligned synchronization in FA (#13469)
2025-05-12 Xuan-Son Nguyenggml : add mrope kernel for metal (#13457)
2025-05-12 Atharva Dubeyenable dpcpp nightly builds with libraries (#13406)
2025-05-11 Citymtmd : Use RMS norm for InternVL 3 38B and 78B mmproj...
2025-05-11 Anthony Umfertools : fix uninitialized llama_batch in server (#13436)
2025-05-11 Sigbjørn Skjæretscripts : exit compare-llama-bench.py gracefully when...
2025-05-11 Johannes GäßlerCUDA: fix crash with partial offloading of MoE (#13439)
2025-05-11 David HuangAdd `--no-op-offload` to improve `-ot` pp perf in MoE...
2025-05-11 Citymtmd : support InternVL 3 38B and 78B mmproj (#13443)
2025-05-11 Xuan-Son Nguyenmtmd : move helpers to dedicated file (#13442)
2025-05-10 Thomas Germerdocs : Fix typo in InternVL3 model name (#13440)
2025-05-10 Johannes GäßlerCUDA: fix race conditions FlashAttention kernels (...
2025-05-10 Sigbjørn Skjæretvocab : add ByteDance-Seed/Seed-Coder (#13423)
2025-05-10 Xuan-Son Nguyenmtmd : add hard limit on image resolution for qwen2vl...
2025-05-10 Xuan-Son Nguyenserver : update docs (#13432)
2025-05-10 Sigbjørn Skjæretllguidance : set tokenizer slices to default (#13424)
2025-05-10 Thammachart... ci: free_disk_space flag enabled for intel variant...
2025-05-10 Xuan-Son Nguyenmtmd : support InternVL 2.5 and 3 (#13422)
2025-05-10 Johannes GäßlerCUDA: fix FlashAttention on Turing (#13415)
2025-05-10 Xuan-Son Nguyenarg : add env var to control mmproj (#13416)
2025-05-10 Jeff Bolzvulkan: scalar flash attention implementation (#13324)
2025-05-09 Helton Reischore(llguidance): use tagged version that does not...
2025-05-09 Xuan-Son Nguyen server : vision support via libmtmd (#12898)
2025-05-09 Alberto Cabrera... sycl : implementation of reordered Q4_0 MMVQ for Intel...
next