]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-08-14 Georgi Gerganoveval-callback : stop on first NaN (#15320)
2025-08-14 Diego Devesachat : include kwargs in template example (#15309)
2025-08-14 Daniel Beveniusllama : add 18-layer model type for Gemma 3-270m (...
2025-08-14 simevodevops : fix compile bug when the BASE_CUDA_DEV_CONTAIN...
2025-08-14 uvosHIP: Cleanup hipification header (#15285)
2025-08-14 Aldehir Rojasgpt-oss: implement harmony parsing (#15181) upstream/0.0.6164
2025-08-14 Christian Kastnerdocker : Enable GGML_CPU_ALL_VARIANTS for ARM (#15267)
2025-08-14 Georgi Gerganovreadme : update hot topics (#15315)
2025-08-14 Jeff Bolzvulkan: perf_logger improvements (#15246)
2025-08-14 Georgi Gerganovserver : add SWA checkpoints (#15293)
2025-08-14 Georgi Gerganovsync : ggml
2025-08-14 Jason Niggml: fix ggml_conv_1d_dw bug (ggml/1323)
2025-08-14 Georgi Gerganovtests : remove unused includes (ggml/0)
2025-08-14 kallewoofperplexity : provide a helpful hint for has_cpl case...
2025-08-14 Sigbjørn Skjæretcuda : fix GGML_CUDA_GRAPHS=OFF (#15300)
2025-08-14 Jonathan Graehlfinetune: SGD optimizer, more CLI args (#13873)
2025-08-14 kallewoofperplexity: give more information about constraints...
2025-08-13 uvosHIP: bump requirement to rocm 6.1 (#15296)
2025-08-13 Bas Nijholtfix(nix): remove non-functional llama-cpp cachix cache...
2025-08-13 Sigbjørn Skjæretserver : enable -td and -tbd parameters (#15172)
2025-08-13 Juddggml : update `ggml_rope_multi` (#12665)
2025-08-13 Copilot common : add --override-tensor-draft, --cpu-moe-draft...
2025-08-13 Aldehir Rojasserver : filter out harmony thought messages (#15278)
2025-08-13 Ali Tariqci : Added CI with RISC-V RVV1.0 Hardware (#14439)
2025-08-13 Sigbjørn Skjæretci : add more python requirements to copilot-setup...
2025-08-13 Georgi Gerganovggml : repack block_iq4_nlx8 (#14904)
2025-08-13 Oliver SimonsCUDA: Optimize `reduce_rows_f32` kernel, leading up...
2025-08-13 Sigbjørn Skjæretci : add copilot-setup-steps.yml (#15214)
2025-08-13 Tak-RSggml-rpc: chunk send()/recv() to avoid EINVAL for very...
2025-08-12 uvosHIP: disable sync warp shuffel operators from clr amd_w...
2025-08-12 Romain Biessysycl: Fix and disable more configurations of mul_mat...
2025-08-12 rmatifopencl: allow mixed f16/f32 `add` (#15140)
2025-08-12 Aman GuptaCUDA cmake: add `-lineinfo` for easier debug (#15260)
2025-08-12 Chenguang LiCANN: GGML_OP_CPY optimization (#15070)
2025-08-12 R0CKSTARmusa: fix failures in test-backend-ops for mul_mat_id...
2025-08-11 hipuddingCANN: Add broadcast for softmax and FA (#15208)
2025-08-11 rainredmtmd : Fix MinicpmV model converter and clip to avoid...
2025-08-11 Xuan-Son Nguyenchat : hotfix gpt-oss jinja raising an exception (...
2025-08-11 Xuan-Son Nguyenserver : allow specifying reasoning_format in HTTP...
2025-08-11 Zagajreadme : update infra list (#15234)
2025-08-11 Georgi Gerganovkv-cache : fix seq_rm with seq_id == -1 (#15226)
2025-08-11 Daniel Beveniuskv-cache : log (debug) all streams in find_slot (#15176)
2025-08-11 Sigbjørn Skjæretconvert : fix merge conflicts (#15229)
2025-08-11 Daniel Beveniusperplexity : update comments/error msg to use decode...
2025-08-11 Julien Denizeconvert : improve Mistral models integration (#14737)
2025-08-11 Charles Xukleidiai: fix unsigned overflow bug (#15150)
2025-08-09 David Zhaocuda: refactored ssm_scan and use CUB (#13291)
2025-08-09 Aman GuptaCUDA: add attention sinks for tile and wmma (#15178)
2025-08-08 compiladegguf-py : add Numpy MXFP4 de/quantization support ...
2025-08-08 Johannes Gäßlerserver-bench: external OAI servers, sqlite (#15179)
2025-08-08 AN Longggml : fix field name when new ggml_backend (#14944)
2025-08-08 Olivier Chafikvendor: sync minja (#15161)
2025-08-08 Johannes GäßlerCUDA: attention sinks for mma FlashAttention (#15157)
2025-08-08 lhezopencl: support sink in `soft_max` (attn sinks) (#15152)
2025-08-07 Xuan-Son Nguyenconvert : support non-mxfp4 HF model (#15153)
2025-08-07 Jeff Bolzvulkan: support fattn sinks (#15126)
2025-08-07 Jeff Bolzvulkan: Add env var to disable host visible vidmem...
2025-08-07 RunningLeonllama : Support intern-s1 (#14875)
2025-08-07 uvosHIP: add cmake option to enable compiler output of...
2025-08-07 Christian Kastnerggml: Skip backend library linking code when GGML_BACKE...
2025-08-07 Johannes GäßlerCUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (#15131)
2025-08-07 Johannes Gäßlerscripts: fix crash when --tool is not set (#15133)
2025-08-07 Daniel Beveniusrequirements : fix PyTorch uint64 compatibility (#15134)
2025-08-06 Reese Levineggml: Add basic SET_ROWS support in WebGPU (#15137)
2025-08-06 rmatiffix profiling crash (#15072)
2025-08-06 lhezopencl: add `swiglu_oai` and `add_id` (#15121)
2025-08-06 Sachin Desaichat : support Granite model reasoning and tool call...
2025-08-06 Juk ArmstrongFixed name `-override-tensors` to `-override-tensor...
2025-08-06 Diego Devesaggml : fix fallback to CPU for ununsupported ops (...
2025-08-06 Sigbjørn Skjæretchat : fix yandex chat template (#15116)
2025-08-06 stevenkuangchat : fix hunyuan auto-detection (#15114)
2025-08-06 Chenguang LiCANN: add support for ACL Graph (#15065)
2025-08-05 Reese Levineggml: WebGPU disable SET_ROWS for now (#15078)
2025-08-05 Georgi Gerganovllama : add gpt-oss (#15091)
2025-08-05 Sigbjørn Skjæretchat : only remove double bos/eos if added (#15086)
2025-08-05 Georgi Gerganovreadme : update hot topics (#15097)
2025-08-05 Romain Biessysycl: fix mul_mat selection (#15092)
2025-08-05 Juk ArmstrongFix `glm4moe` bug (#15088)
2025-08-05 Alex Wuwebui: fix markdown table (#15081)
2025-08-05 compiladecontext : fix index overflow on huge outputs (#15080)
2025-08-04 Diego Devesallama : add --n-cpu-moe option (#15077)
2025-08-04 compiladeimatrix : warn when GGUF imatrix is saved without ...
2025-08-04 Christian Kastnercmake: Add GGML_BACKEND_DIR option (#15074)
2025-08-04 Sigbjørn Skjæretgguf-py : add --chat-template-file to gguf_new_metadata...
2025-08-04 Sammodel: support GLM 4.5 family of models (#14939)
2025-08-04 Sigbjørn Skjæretquantize : fix confusing error message if ftype is...
2025-08-04 Reese Levineggml: WebGPU backend host improvements and style fixing...
2025-08-04 Jeff Bolzvulkan: fix build when using glslang that does not...
2025-08-03 compiladeimatrix : use GGUF by default (#14842)
2025-08-03 compiladeimatrix : fix 3d activation handling for hybrid and...
2025-08-03 compiladememory : handle kv_unified for hybrid models (#15050)
2025-08-03 Csaba Kecskemetivocab : JetBrains Mellum pre-tokenizer (#15045)
2025-08-03 Gabriel Larsonmodel : add text-only support for Kimi-VL (and find...
2025-08-03 Jeff Bolzvulkan: Use coopmat2 for conv2d (#14982)
2025-08-02 lhezopencl: fix adreno compiler detection logic (#15029)
2025-08-02 Johannes GäßlerCUDA: use mma FA kernel for gqa > 4 on RTX 4000 (#15035)
2025-08-02 leejetcuda: make im2col a little faster (#15025) upstream/0.0.6073
2025-08-02 Daniel Beveniuskv-cache : skip alignment of n_stream in kv-cache log...
2025-08-02 Georgi Gerganovllama : enable LLAMA_SET_ROWS=1 by default (#14959)
2025-08-02 Georgi Gerganovcuda, sycl : fix batched gemm when ne02 == 1 && ne03...
next