| 2026-03-30 |
Georgi Gerganov | ggml : bump version to 0.9.9 (#1449) v0.9.9 |
commit | commitdiff | tree |
| 2026-03-30 |
Georgi Gerganov | sync : whisper.cpp |
commit | commitdiff | tree |
| 2026-03-28 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2026-03-28 |
Ruben Ortlam | vulkan: add noncontiguous GLU support (llama/21081) |
commit | commitdiff | tree |
| 2026-03-28 |
Yiwei Shao | hexagon: support for IQ4_NL and MXFP4 (llama/21018) |
commit | commitdiff | tree |
| 2026-03-28 |
Radoslav Gerganov | rpc : proper handling of data pointers to CPU buffers... |
commit | commitdiff | tree |
| 2026-03-28 |
ren | metal : Fix dimension constraint violation in matmul2d... |
commit | commitdiff | tree |
| 2026-03-28 |
uvos | hip: use fnuz fp8 for conversion on CDNA3 (llama/21040) |
commit | commitdiff | tree |
| 2026-03-28 |
lhez | opencl: allow large buffer for adreno (llama/20997) |
commit | commitdiff | tree |
| 2026-03-28 |
ihb2032 | fix(ggml): correct RISC-V ISA string canonical ordering... |
commit | commitdiff | tree |
| 2026-03-28 |
Michael Wand | ggml-cuda: Add NVFP4 dp4a kernel (llama/20644) |
commit | commitdiff | tree |
| 2026-03-28 |
Yihao Wang | CUDA & CPU: support F32 kernel type for `CONV_TRANSPOSE... |
commit | commitdiff | tree |
| 2026-03-28 |
Saba Fallah | mtmd: Add DeepSeekOCR Support (llama/17400) |
commit | commitdiff | tree |
| 2026-03-28 |
Johannes Gäßler | llama: fix llama-model-saver (llama/20503) |
commit | commitdiff | tree |
| 2026-03-28 |
Neo Zhang | sycl : fix wrong variable check by assert (llama/20903) |
commit | commitdiff | tree |
| 2026-03-28 |
nuri | metal : add FLOOR, CEIL, ROUND, TRUNC unary ops (llama... |
commit | commitdiff | tree |
| 2026-03-28 |
Georgi Gerganov | metal : add FA instantiations for HSK=512, HSV=512... |
commit | commitdiff | tree |
| 2026-03-28 |
Max Krasnyansky | hexagon: general DMA and Binary Op fixes for large... |
commit | commitdiff | tree |
| 2026-03-28 |
lhez | opencl: add q6_K gemm and gemv kernels for Adreno ... |
commit | commitdiff | tree |
| 2026-03-28 |
las7 | rpc : RCE patch (llama/20908) |
commit | commitdiff | tree |
| 2026-03-28 |
Rashid Ul Islam | metal: add CONV_3D (llama/19927) |
commit | commitdiff | tree |
| 2026-03-28 |
Chenguang Li | CANN: add RoPE cache preload before ACL graph capture... |
commit | commitdiff | tree |
| 2026-03-28 |
Dan Hoffman | fix(openvino): explicit memset in buffer_context alloca... |
commit | commitdiff | tree |
| 2026-03-28 |
shaofeiqi | opencl: add flattened Q4_K mv and general Q4_K mm ... |
commit | commitdiff | tree |
| 2026-03-28 |
Johannes Gäßler | CUDA: fix BF16 FA compilation (llama/20865) |
commit | commitdiff | tree |
| 2026-03-28 |
Neo Zhang | support bf16 and quantized type (llama/20803) |
commit | commitdiff | tree |
| 2026-03-28 |
Patrick Buckley | ggml-cuda: native bf16 flash attention for vec kernel... |
commit | commitdiff | tree |
| 2026-03-28 |
Gaurav Garg | Increase number of output elements per-thread block... |
commit | commitdiff | tree |
| 2026-03-28 |
y198 | fix(rpc): prevent division by zero in deserialize_tenso... |
commit | commitdiff | tree |
| 2026-03-28 |
Matt Corallo | Add shader count for Intel Arc Pro B60 (llama/20818) |
commit | commitdiff | tree |
| 2026-03-28 |
shalinib-ibm | ggml-cpu: add always_inline to tinyBLAS_PPC accumulator... |
commit | commitdiff | tree |
| 2026-03-28 |
Jeff Bolz | vulkan: change gated_delta_net to shard a column across... |
commit | commitdiff | tree |
| 2026-03-28 |
hipudding | CANN: add BF16 support for core operators (llama/20152) |
commit | commitdiff | tree |
| 2026-03-28 |
Sundaram krishnan | ggml: guard KleidiAI DOWNLOAD_EXTRACT_TIMESTAMP for... |
commit | commitdiff | tree |
| 2026-03-28 |
Rail Chabdarov | hip: Avoid compiler bug in RDNA code generation during... |
commit | commitdiff | tree |
| 2026-03-28 |
Yiwei Shao | hexagon: add Matrix Extensions (HMX) for Hexagon NPU... |
commit | commitdiff | tree |
| 2026-03-28 |
uvos | ci : add hip quality check (llama/20430) |
commit | commitdiff | tree |
| 2026-03-28 |
Reese Levine | ggml webgpu: ops support for qwen3.5 (SET, TRI_SOLVE... |
commit | commitdiff | tree |
| 2026-03-28 |
Eve | vulkan: dequantize iq4_xs 4 at a time (llama/20657) |
commit | commitdiff | tree |
| 2026-03-28 |
Charles Xu | cmake : fix build warning when kleidiai is enabled... |
commit | commitdiff | tree |
| 2026-03-28 |
Chenguang Li | CANN: handle in-place ROPE on non-contiguous f32 tensor... |
commit | commitdiff | tree |
| 2026-03-28 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2026-03-28 |
Masashi Yoshimura | ggml-webgpu: Update the `RMS_NORM` preprocessor and... |
commit | commitdiff | tree |
| 2026-03-28 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2026-03-28 |
Masashi Yoshimura | ggml-webgpu: Add supports for `DIAG` and `TRI` (llama... |
commit | commitdiff | tree |
| 2026-03-28 |
Chenguang Li | CANN: support flash attention for head dim not multiple... |
commit | commitdiff | tree |
| 2026-03-28 |
Reese Levine | Move to no timeout for WaitAny in graph submission... |
commit | commitdiff | tree |
| 2026-03-28 |
Shaw Nguyen | ggml-cpu/x86: fix unused changemask warning in repack... |
commit | commitdiff | tree |
| 2026-03-28 |
uvos | HIP : ignore return of hipMemAdvise [no ci] (llama... |
commit | commitdiff | tree |
| 2026-03-28 |
Krishna Sridhar | hexagon: add neg, exp, sigmoid, softplus ops, cont... |
commit | commitdiff | tree |
| 2026-03-28 |
Ruben Ortlam | vulkan: disable mmvq on Intel Windows driver (llama... |
commit | commitdiff | tree |
| 2026-03-28 |
Kevin Hannon | ggml-blas: set mkl threads from thread context (llama... |
commit | commitdiff | tree |
| 2026-03-28 |
Taimur Ahmad | ggml-cpu: fix RVV checks in quants and repacking (llama... |
commit | commitdiff | tree |
| 2026-03-28 |
Ruben Ortlam | vulkan: async and event fixes (llama/20518) |
commit | commitdiff | tree |
| 2026-03-28 |
Justin Bradford | kleidiai : fix MUL_MAT support for batched (3D) inputs... |
commit | commitdiff | tree |
| 2026-03-28 |
Ruben Ortlam | vulkan: allow graphics queue only through env var ... |
commit | commitdiff | tree |
| 2026-03-28 |
Neo Zhang | ehance UPSCALE to support all UT cases (llama/20637) |
commit | commitdiff | tree |
| 2026-03-28 |
Martin Klacer | kleidiai: add data type check to get_tensor_traits... |
commit | commitdiff | tree |
| 2026-03-28 |
Ruben Ortlam | vulkan: fix flash attention dot product precision ... |
commit | commitdiff | tree |
| 2026-03-28 |
Aman Gupta | CUDA: GDN hide memory latency (llama/20537) |
commit | commitdiff | tree |
| 2026-03-28 |
Sigbjørn Skjæret | sycl : fix for untransposed GDA recurrent state (llama... |
commit | commitdiff | tree |
| 2026-03-16 |
Georgi Gerganov | ci : disable AMX jobs |
commit | commitdiff | tree |
| 2026-03-16 |
Georgi Gerganov | ggml : bump version to 0.9.8 (#1442) v0.9.8 |
commit | commitdiff | tree |
| 2026-03-16 |
Georgi Gerganov | ggml : restore ggml_type_sizef() to aboid major version... |
commit | commitdiff | tree |
| 2026-03-16 |
Georgi Gerganov | readme : simplify |
commit | commitdiff | tree |
| 2026-03-16 |
Georgi Gerganov | sync : whisper.cpp |
commit | commitdiff | tree |
| 2026-03-16 |
Georgi Gerganov | ggml : try fix arm build (whisper/0) |
commit | commitdiff | tree |
| 2026-03-15 |
David366AI | ggml : extend im2col f16 (#1434) |
commit | commitdiff | tree |
| 2026-03-15 |
Georgi Gerganov | common : add nvfp4 (#0) |
commit | commitdiff | tree |
| 2026-03-15 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2026-03-15 |
Johannes Gäßler | CUDA: limit number of FA stream-k CUDA blocks (llama... |
commit | commitdiff | tree |
| 2026-03-15 |
Pascal | ggml: avoid creating CUDA context during device init... |
commit | commitdiff | tree |
| 2026-03-15 |
MoonShadow | ggml/hip: fix APU compatibility - soft error handling... |
commit | commitdiff | tree |
| 2026-03-15 |
Bartowski | ggml : guard against sumq2 being 0 in IQ4_NL (llama... |
commit | commitdiff | tree |
| 2026-03-15 |
PikaPikachu | cuda : add RDNA4-specific MMVQ parameter table for... |
commit | commitdiff | tree |
| 2026-03-15 |
Ruben Ortlam | vulkan: use graphics queue on AMD (llama/20551) |
commit | commitdiff | tree |
| 2026-03-15 |
Georgi Gerganov | metal : add FA specialization for HSK = 320, HSV =... |
commit | commitdiff | tree |
| 2026-03-15 |
Max Krasnyansky | hexagon: Q4_0 and MXFP4 repack fixes (llama/20527) |
commit | commitdiff | tree |
| 2026-03-15 |
Neo Zhang | add op gated_delta_net (llama/20455) |
commit | commitdiff | tree |
| 2026-03-15 |
Adrien Gallouët | ggml : add native AVX512-FP16 support for F16 operation... |
commit | commitdiff | tree |
| 2026-03-15 |
Wallentri | Use fp32 in cuBLAS V100 to avoid overflows, env variabl... |
commit | commitdiff | tree |
| 2026-03-15 |
Zijun Yu | ggml : add OpenVINO backend (llama/15307) |
commit | commitdiff | tree |
| 2026-03-15 |
Rail Chabdarov | Fix data race in CUDA's "cpy" kernel (influences GGML... |
commit | commitdiff | tree |
| 2026-03-15 |
lhez | opencl: fix l2_norm (llama/20480) |
commit | commitdiff | tree |
| 2026-03-15 |
Georgi Gerganov | graph : remove redundant GDN state transposes (llama... |
commit | commitdiff | tree |
| 2026-03-15 |
rehan-10xengineer | ggml-cpu: add RVV vec dot kernels for quantization... |
commit | commitdiff | tree |
| 2026-03-15 |
Adrien Gallouët | ggml : fix typo gmml (llama/20512) |
commit | commitdiff | tree |
| 2026-03-15 |
Georgi Gerganov | metal : fix l2 norm scale (llama/20493) |
commit | commitdiff | tree |
| 2026-03-15 |
Georgi Gerganov | llama : disable graph reuse with pipeline parallelism... |
commit | commitdiff | tree |
| 2026-03-15 |
Ruben Ortlam | test-backend-ops: allow loading tests from file and... |
commit | commitdiff | tree |
| 2026-03-15 |
ProgenyAlpha | vulkan: add GATED_DELTA_NET op support (llama/20334) |
commit | commitdiff | tree |
| 2026-03-15 |
ProgenyAlpha | vulkan: fix SSM_CONV PP scaling with large ubatch sizes... |
commit | commitdiff | tree |
| 2026-03-15 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2026-03-15 |
Georgi Gerganov | metal : avoid divisions in bin kernel (llama/20426) |
commit | commitdiff | tree |
| 2026-03-15 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2026-03-15 |
Jeff Bolz | vulkan: fix l2_norm epsilon handling (llama/20350) |
commit | commitdiff | tree |
| 2026-03-15 |
Jeff Bolz | vulkan: fix OOB check in flash_attn_mask_opt (llama... |
commit | commitdiff | tree |
| 2026-03-15 |
Masato Nakasaka | vulkan: Fix ErrorOutOfHostMemory on Intel GPU when... |
commit | commitdiff | tree |
| 2026-03-15 |
lhez | opencl: use larger workgroup size for get_rows (llama... |
commit | commitdiff | tree |
| 2026-03-15 |
shaofeiqi | opencl: add cumsum op (llama/18981) |
commit | commitdiff | tree |
| next |