| 2026-02-16 |
Mathieu Baudier | Update upstream debian/latest |
commit | commitdiff | tree |
| 2026-02-16 |
Mathieu Baudier | Merge tag 'upstream/1.8.3+155' into debian/latest |
commit | commitdiff | tree |
| 2026-02-15 |
Georgi Gerganov | talk-llama : sync llama.cpp upstream/1.8.3+155 |
commit | commitdiff | tree |
| 2026-02-15 |
Georgi Gerganov | sync : ggml |
commit | commitdiff | tree |
| 2026-02-15 |
Georgi Gerganov | models : optimize qwen3next graph (llama/19375) |
commit | commitdiff | tree |
| 2026-02-15 |
Adrien Gallouët | ggml : fix GGML_DEBUG with OpenMP (llama/19599) |
commit | commitdiff | tree |
| 2026-02-15 |
Georgi Gerganov | metal : fix ACC op (llama/19427) |
commit | commitdiff | tree |
| 2026-02-15 |
Jeff Bolz | vulkan: support L2_NORM with contiguous rows (llama... |
commit | commitdiff | tree |
| 2026-02-15 |
Jeff Bolz | vulkan: support GGML_OP_SET (llama/19584) |
commit | commitdiff | tree |
| 2026-02-15 |
Sophon | vulkan: Add vendor id for Qualcomm drivers (llama/19569) |
commit | commitdiff | tree |
| 2026-02-15 |
Max Krasnyansky | hexagon: further optimizations and refactoring for... |
commit | commitdiff | tree |
| 2026-02-15 |
Jeff Bolz | vulkan: restore -inf check in FA shaders (llama/19582) |
commit | commitdiff | tree |
| 2026-02-15 |
Alberto Cabrera... | Fix wrong memcpy length for block_interleave == 4 ... |
commit | commitdiff | tree |
| 2026-02-15 |
ymcki | fix vulkan ggml_acc only works in 3d but not 4d (llama... |
commit | commitdiff | tree |
| 2026-02-15 |
Aman Gupta | CUDA: loop over ne2*ne3 in case it overflows (llama... |
commit | commitdiff | tree |
| 2026-02-15 |
Oliver Simons | CUDA: Do not mutate cgraph for fused ADDs (llama/19566) |
commit | commitdiff | tree |
| 2026-02-15 |
Georgi Gerganov | metal : improve concurrency (llama/19555) |
commit | commitdiff | tree |
| 2026-02-15 |
Georgi Gerganov | metal : support GGML_OP_SET (llama/19548) |
commit | commitdiff | tree |
| 2026-02-15 |
Shupei Fan | hexagon: fix typo in vtcm_needs_release (llama/19545) |
commit | commitdiff | tree |
| 2026-02-15 |
lhez | opencl: add basic support for q4_1 (llama/19534) |
commit | commitdiff | tree |
| 2026-02-15 |
Georgi Gerganov | metal : update sum_rows kernel to support float4 (llama... |
commit | commitdiff | tree |
| 2026-02-15 |
Mario Limonciello | Add a workaround for compilation with ROCWMMA_FATTN... |
commit | commitdiff | tree |
| 2026-02-15 |
Max Krasnyansky | hexagon: further optimization and tuning of matmul... |
commit | commitdiff | tree |
| 2026-02-15 |
lhez | opencl: add general Q6_K mm and Q4_K mv (llama/19347) |
commit | commitdiff | tree |
| 2026-02-15 |
Georgi Gerganov | ggml : unary ops support non-cont src0 + metal F16... |
commit | commitdiff | tree |
| 2026-02-15 |
Georgi Gerganov | metal : extend l2_norm support for non-cont src0 (llama... |
commit | commitdiff | tree |
| 2026-02-15 |
Max Krasnyansky | hexagon: Add ARGSORT, DIV, SQR, SQRT, SUM_ROWS, GEGLU... |
commit | commitdiff | tree |
| 2026-02-15 |
Georgi Gerganov | ggml : extend bin bcast for permuted src1 (llama/19484) |
commit | commitdiff | tree |
| 2026-02-15 |
Georgi Gerganov | metal : consolidate unary ops (llama/19490) |
commit | commitdiff | tree |
| 2026-02-15 |
Oliver Simons | CUDA : Update CCCL-tag for 3.2 to final release from... |
commit | commitdiff | tree |
| 2026-02-15 |
Nikhil Jain | Plug memory leaks and free resources on shutdown (llama... |
commit | commitdiff | tree |
| 2026-02-15 |
Alberto Cabrera... | ggml-cpu: arm64: q6_K repack gemm and gemv (and generic... |
commit | commitdiff | tree |
| 2026-02-15 |
k4ss4n | ggml : use noexcept overload for is_regular_file in... |
commit | commitdiff | tree |
| 2026-02-15 |
Raul Torres | CANN: Remove unnecessary wrapper for `gml_backend_buft_... |
commit | commitdiff | tree |
| 2026-02-15 |
hipudding | CANN: implement quantized MUL_MAT_ID for MoE models... |
commit | commitdiff | tree |
| 2026-02-15 |
Georgi Gerganov | cuda : extend GGML_OP_PAD to work with non-cont src0... |
commit | commitdiff | tree |
| 2026-02-15 |
Oliver Simons | CUDA: Fix non-contig rope (llama/19338) |
commit | commitdiff | tree |
| 2026-02-09 |
Nuno | ci: add vulkan docker image (#3644) |
commit | commitdiff | tree |
| 2026-02-09 |
Pádraic Slattery | chore: Update outdated GitHub Actions versions (#3646) |
commit | commitdiff | tree |
| 2026-02-09 |
Christian Kastner | cmake: Drop obsolete build-time configuration of backen... |
commit | commitdiff | tree |
| 2026-02-09 |
Sid Mohan | server : fix hardcoded /inference path in default HTML... |
commit | commitdiff | tree |
| 2026-02-09 |
Georgi Gerganov | ci : try fix mirrors (#3655) |
commit | commitdiff | tree |
| 2026-02-08 |
Georgi Gerganov | talk-llama : sync llama.cpp |
commit | commitdiff | tree |
| 2026-02-08 |
Georgi Gerganov | sync : ggml |
commit | commitdiff | tree |
| 2026-02-08 |
Georgi Gerganov | metal : consolidate bin kernels (llama/19390) |
commit | commitdiff | tree |
| 2026-02-08 |
Georgi Gerganov | metal : fix event synchronization in cpy_tensor_async... |
commit | commitdiff | tree |
| 2026-02-08 |
Abhijit Ramesh | ggml-webgpu: JIT compile binary operators and handle... |
commit | commitdiff | tree |
| 2026-02-08 |
Nechama Krashinski | sycl: add F16 support for GGML_OP_CEIL (llama/19306) |
commit | commitdiff | tree |
| 2026-02-08 |
Jeff Bolz | vulkan: For coopmat2 FA, use fp16 accumulators for... |
commit | commitdiff | tree |
| 2026-02-08 |
Jeff Bolz | vulkan: make FA mask/softcap enables spec constants... |
commit | commitdiff | tree |
| 2026-02-08 |
Georgi Gerganov | metal : skip loading all-zero mask (llama/19337) |
commit | commitdiff | tree |
| 2026-02-08 |
Georgi Gerganov | cuda : cuda graphs now compare all node params (llama... |
commit | commitdiff | tree |
| 2026-02-08 |
Georgi Gerganov | metal : adaptive CPU/GPU interleave based on number... |
commit | commitdiff | tree |
| 2026-02-08 |
Jeff Bolz | vulkan: Preprocess FA mask to detect all-neg-inf and... |
commit | commitdiff | tree |
| 2026-02-08 |
Georgi Gerganov | metal : add diag (llama/19330) |
commit | commitdiff | tree |
| 2026-02-08 |
Oleksandr Kuvshynov | vulkan: fix GPU deduplication logic. (llama/19222) |
commit | commitdiff | tree |
| 2026-02-08 |
Jeff Bolz | vulkan: Set k_load_shmem to false when K is too large... |
commit | commitdiff | tree |
| 2026-02-08 |
Jeff Bolz | vulkan: fix non-contig rope (llama/19299) |
commit | commitdiff | tree |
| 2026-02-08 |
will-lms | metal : add missing includes (llama/19348) |
commit | commitdiff | tree |
| 2026-02-08 |
Kevin Pouget | ggml-virtgpu: make the code thread safe (llama/19204) |
commit | commitdiff | tree |
| 2026-02-08 |
Aman Gupta | ggml-cpu: use LUT for converting e8->f32 scales on... |
commit | commitdiff | tree |
| 2026-02-08 |
Georgi Gerganov | metal : add solve_tri (llama/19302) |
commit | commitdiff | tree |
| 2026-02-08 |
Ruben Ortlam | vulkan: disable coopmat1 fa on Nvidia Turing (llama... |
commit | commitdiff | tree |
| 2026-02-08 |
Aman Gupta | CUDA: use mmvq for mul-mat-id for small batch sizes... |
commit | commitdiff | tree |
| 2026-02-08 |
Georgi Gerganov | metal : minor cleanup (llama/19251) |
commit | commitdiff | tree |
| 2026-02-08 |
Oliver Simons | CUDA: Fix loop unrolling for BW in mul_mat_q_stream_k_f... |
commit | commitdiff | tree |
| 2026-02-08 |
George | ggml: added cleanups in ggml_quantize_free (llama/19278) |
commit | commitdiff | tree |
| 2026-02-08 |
Gaurav Garg | cuda : revert CUDA_SCALE_LAUNCH_QUEUES override until... |
commit | commitdiff | tree |
| 2026-02-08 |
lhez | opencl: refactor some ops, concat, repeat, tanh and... |
commit | commitdiff | tree |
| 2026-02-08 |
Aman Gupta | ggml-cpu: FA split across kv for faster TG (llama/19209) |
commit | commitdiff | tree |
| 2026-02-08 |
Neo Zhang | Remove support for Nvidia & AMD GPU, because the oneAPI... |
commit | commitdiff | tree |
| 2026-02-08 |
Tamar | sycl: implement GGML_OP_TOP_K (llama/19242) |
commit | commitdiff | tree |
| 2026-02-08 |
Georgi Gerganov | metal : support virtual devices (llama/18919) |
commit | commitdiff | tree |
| 2026-02-08 |
Johannes Gäßler | ggml-backend: fix async set/get fallback sync (llama... |
commit | commitdiff | tree |
| 2026-02-08 |
Christian Kastner | docs : Minor cleanups (llama/19252) |
commit | commitdiff | tree |
| 2026-02-08 |
Nikhil Jain | Remove pipeline cache mutexes (llama/19195) |
commit | commitdiff | tree |
| 2026-02-08 |
Max Krasnyansky | Bump cmake max version (needed for Windows on Snapdrago... |
commit | commitdiff | tree |
| 2026-02-08 |
nullname | ggml-hexagon: flash-attention and reduce-sum optimizati... |
commit | commitdiff | tree |
| 2026-02-08 |
shaofeiqi | opencl: add optimized q8_0 mm kernel for adreno (llama... |
commit | commitdiff | tree |
| 2026-02-08 |
Simon Redman | Correctly fetch q8_1 quantize pipeline in test as neede... |
commit | commitdiff | tree |
| 2026-02-08 |
Georgi Gerganov | ggml : bump version to 0.9.6 (ggml/1423) |
commit | commitdiff | tree |
| 2026-02-08 |
Georgi Gerganov | cmake : remove unused file (ggml/1419) |
commit | commitdiff | tree |
| 2026-02-04 |
KITAITI Makoto | ruby : add `Whisper::Context::Params`, fix token memory... |
commit | commitdiff | tree |
| 2026-02-03 |
Mathieu Baudier | Add patch removing obsolete build-time configuration... |
commit | commitdiff | tree |
| 2026-01-30 |
KITAITI Makoto | ruby : add `VAD::Context#segments_from_samples`, allow... |
commit | commitdiff | tree |
| 2026-01-30 |
Frieder Bluemle | scripts : Fix dSYMs path case for macOS xcframework... |
commit | commitdiff | tree |
| 2026-01-30 |
Georgi Gerganov | cuda : fix compile warnings (#0) |
commit | commitdiff | tree |
| 2026-01-30 |
Georgi Gerganov | talk-llama : sync llama.cpp |
commit | commitdiff | tree |
| 2026-01-30 |
Georgi Gerganov | sync : ggml |
commit | commitdiff | tree |
| 2026-01-30 |
bssrdf | add tensor type checking as part of cuda graph properti... |
commit | commitdiff | tree |
| 2026-01-30 |
s8322 | sycl: implement GGML_UNARY_OP_SOFTPLUS (llama/19114) |
commit | commitdiff | tree |
| 2026-01-30 |
RachelMantel | sycl: implement GGML_OP_TRI (llama/19089) |
commit | commitdiff | tree |
| 2026-01-30 |
Zheyuan Chen | ggml-webgpu: improve flastAttention performance by... |
commit | commitdiff | tree |
| 2026-01-30 |
Todor Boinovski | hexagon: enable offloading to Hexagon on Windows on... |
commit | commitdiff | tree |
| 2026-01-30 |
Georgi Gerganov | cuda : fix nkvo, offload and cuda graph node properties... |
commit | commitdiff | tree |
| 2026-01-30 |
yulo | HIP: add mmf for CDNA (llama/18896) |
commit | commitdiff | tree |
| 2026-01-30 |
Vishal Singh | ggml-zendnn : resolve ZenDNN backend cross-module symbo... |
commit | commitdiff | tree |
| 2026-01-30 |
Aman Gupta | CUDA: refactor topk-moe to enable more models (GLM... |
commit | commitdiff | tree |
| 2026-01-30 |
Neo Zhang | sycl: fix norm kernels: l2_norm, group_norm, rms_norm... |
commit | commitdiff | tree |
| 2026-01-30 |
Ruben Ortlam | Vulkan Flash Attention Coopmat1 Refactor (llama/19075) |
commit | commitdiff | tree |
| next |