| 2025-12-31 |
Shouyu | ggml-hexagon: gelu operation (llama/17921) |
commit | commitdiff | tree |
| 2025-12-31 |
Alberto Cabrera... | ggml-cpu: ARM64: repack version of q8_0 (dotprod and... |
commit | commitdiff | tree |
| 2025-12-31 |
yulo | HIP: Refactor mma for RDNA and CDNA (llama/17990) |
commit | commitdiff | tree |
| 2025-12-17 |
Georgi Gerganov | sync : llama.cpp upstream/0.9.4.395 |
commit | commitdiff | tree |
| 2025-12-17 |
Naco Siren | llama.android : Rewrite Android binding (w/o cpu_featur... |
commit | commitdiff | tree |
| 2025-12-17 |
Aadeshveer... | ggml : use WARP_SIZE/2 for argmax reduction offset... |
commit | commitdiff | tree |
| 2025-12-17 |
Shouyu | ggml-hexagon: mm for mtmd (llama/17894) |
commit | commitdiff | tree |
| 2025-12-17 |
Jeremy Demeule | metal: use shared buffers on eGPU (llama/17866) |
commit | commitdiff | tree |
| 2025-12-17 |
Johannes Gäßler | llama: automatically set parameters not set by the... |
commit | commitdiff | tree |
| 2025-12-17 |
Neo Zhang Jianyu | Support gpt-oss by OPs add-id, mul_mat for mxfp4, swigl... |
commit | commitdiff | tree |
| 2025-12-17 |
Ruben Ortlam | vulkan: fix mul_mat_vec_iq1_s formatting (llama/18026) |
commit | commitdiff | tree |
| 2025-12-14 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2025-12-14 |
Jeff Bolz | vulkan: Fix data race/hang in scalar/cm1 flash attentio... |
commit | commitdiff | tree |
| 2025-12-14 |
lovedheart | vulkan: improve mul_mat_vec_iq1_s speed (llama/17874) |
commit | commitdiff | tree |
| 2025-12-14 |
Eve | vulkan: faster q6_k matmul (llama/17813) |
commit | commitdiff | tree |
| 2025-12-14 |
Georgi Gerganov | ggml : arm repack fix build (llama/0) |
commit | commitdiff | tree |
| 2025-12-14 |
Jeff Bolz | vulkan: support get_rows for i32 (llama/17941) |
commit | commitdiff | tree |
| 2025-12-14 |
Jeff Bolz | vulkan: support GGML_OP_DIAG (llama/17893) |
commit | commitdiff | tree |
| 2025-12-14 |
Jeff Bolz | vulkan: Multi-pass softmax for large number of cols... |
commit | commitdiff | tree |
| 2025-12-14 |
Jeff Bolz | vulkan: Allow non-pow2 n_experts in topk_moe (llama... |
commit | commitdiff | tree |
| 2025-12-14 |
Johannes Gäßler | CUDA: fix overflow in MMA kernel without stream-k ... |
commit | commitdiff | tree |
| 2025-12-14 |
Sigbjørn Skjæret | cann : fix ops broken by circular padding guard (llama... |
commit | commitdiff | tree |
| 2025-12-14 |
ixgbe | ggml-cpu : fix RISC-V Q4_0 repack select and RVV featur... |
commit | commitdiff | tree |
| 2025-12-14 |
yulo | HIP: enable mmf for RDNA3 (llama/17879) |
commit | commitdiff | tree |
| 2025-12-14 |
Piotr Wilkin... | SOLVE_TRI extension to more dimensions (llama/17793) |
commit | commitdiff | tree |
| 2025-12-13 |
Georgi Gerganov | sync : whisper.cpp |
commit | commitdiff | tree |
| 2025-12-13 |
Georgi Gerganov | ggml : arm repack fix build (whisper/0) |
commit | commitdiff | tree |
| 2025-12-12 |
Congcong Cai | cmake : set `CMAKE_RUNTIME_OUTPUT_DIRECTORY` for non... |
commit | commitdiff | tree |
| 2025-12-12 |
Copilot | ci : remove retired macOS-13 runners from CI (#1397) |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | ggml-alloc : fix reuse-parent logic for misaligned... |
commit | commitdiff | tree |
| 2025-12-11 |
nullname | ggml-hexagon: fix `rope` failure at `test-backend-ops... |
commit | commitdiff | tree |
| 2025-12-11 |
Max Krasnyansky | Fix race conditions in threadpool when dealing with... |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | ggml : remove GGML_KQ_MASK_PAD constant (llama/17910) |
commit | commitdiff | tree |
| 2025-12-11 |
Sigbjørn Skjæret | cuda : add missing support check for xielu (llama/17895) |
commit | commitdiff | tree |
| 2025-12-11 |
Johannes Gäßler | CUDA: fix unpadded strides in MMA FA kernel (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Neo Zhang Jianyu | fix softmax for iGPU (llama/17838) |
commit | commitdiff | tree |
| 2025-12-11 |
Gabe Goodhart | metal: SSM kernel improvements (llama/17876) |
commit | commitdiff | tree |
| 2025-12-11 |
Piotr Wilkin... | Add DIAG for CUDA (llama/17873) |
commit | commitdiff | tree |
| 2025-12-11 |
Gabe Goodhart | ggml : Provide macos-specific backtrace printing to... |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | metal : print node names for debugging (llama/17882) |
commit | commitdiff | tree |
| 2025-12-11 |
Sigbjørn Skjæret | ggml : allow fill node alloc inplace (llama/17870) |
commit | commitdiff | tree |
| 2025-12-11 |
Chenguang Li | CANN: add support for partial RoPE and Vision mode... |
commit | commitdiff | tree |
| 2025-12-11 |
Johannes Gäßler | CUDA: fix FP16 overflow in tile FA kernel (llama/17875) |
commit | commitdiff | tree |
| 2025-12-11 |
Jay Zenith | cuda : add FILL op support (llama/17851) |
commit | commitdiff | tree |
| 2025-12-11 |
wsbagnsv1 | cuda: optimize SOLVE_TRI using registers and FMAF ... |
commit | commitdiff | tree |
| 2025-12-11 |
ixgbe | ggml-cpu: add ggml_thread_cpu_relax with Zihintpause... |
commit | commitdiff | tree |
| 2025-12-11 |
lovedheart | Vulkan: improve mul_mat_vec_iq1_m (llama/16907) |
commit | commitdiff | tree |
| 2025-12-11 |
Law Po Ying | sycl: add missing BF16 conversion support for Intel... |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: perf_logger improvements (llama/17672) |
commit | commitdiff | tree |
| 2025-12-11 |
Vishal Singh | ggml-zendnn : add ZenDNN backend for AMD CPUs (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Phylliida Dev | ggml : add circular tiling support to pad, for Vulkan... |
commit | commitdiff | tree |
| 2025-12-11 |
Johannes Gäßler | HIP: fix RDNA3 FP16/BF16 matrix multiplication (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Sky | ggml : improve error handling for search path existence... |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: Use one row per workgroup for f32 mmv (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: support solve_tri with larger N/K values (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | metal : fix build(#17799) |
commit | commitdiff | tree |
| 2025-12-11 |
Masato Nakasaka | vulkan: Replace deprecated VK_EXT_validation_features... |
commit | commitdiff | tree |
| 2025-12-11 |
Masato Nakasaka | vulkan: Fix mismatch in TOPK_MOE unit test (llama/17541) |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: add more num_blocks instantiations in rms_norm... |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: fix top_k bug when there are ties in the input... |
commit | commitdiff | tree |
| 2025-12-11 |
Acly | vulkan : support conv-2d with large output size (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Reese Levine | ggml webgpu: unary op suppport, code refactoring, ops... |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: enable mmvq for q2_k on NVIDIA (llama/17675) |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: set all memory allocations to high priority... |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | rpc : fix alloc size logic (llama/17116) |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | metal : add residency sets keep-alive heartbeat (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Johannes Gäßler | HIP : fix RDNA4 build (llama/17792) |
commit | commitdiff | tree |
| 2025-12-11 |
shalinib-ibm | Q4/Q8 Tiled Gemm Optimization. (llama/16999) |
commit | commitdiff | tree |
| 2025-12-11 |
Johannes Gäßler | CUDA: fix FA VKQ accumulator overflow (llama/17746) |
commit | commitdiff | tree |
| 2025-12-11 |
Jiacheng (Jason... | HIP: enable WMMA-MMQ INT kernels for RDNA 3 (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Piotr Wilkin... | Add support for CUMSUM and TRI for CUDA. (llama/17584) |
commit | commitdiff | tree |
| 2025-12-11 |
Gabe Goodhart | metal: TRI, FILL, EXPM1, SOFTPLUS (llama/16623) |
commit | commitdiff | tree |
| 2025-12-11 |
Alberto Cabrera... | ggml-cpu : remove asserts always evaluating to false... |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | metal : use params per pipeline instance (llama/17739) |
commit | commitdiff | tree |
| 2025-12-11 |
Adrien Gallouët | build : move _WIN32_WINNT definition to headers (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Herman Semenoff | ggml-cpu: remove duplicate conditional check 'iid'... |
commit | commitdiff | tree |
| 2025-12-11 |
Johannes Gäßler | CUDA: generalized (mma) FA, add Volta support (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | metal : fix data race in pipeline library (llama/17731) |
commit | commitdiff | tree |
| 2025-12-11 |
Reese Levine | ggml webgpu: add support for emscripten builds (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: Reduce temporary memory usage for TOP_K (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
xiaobing318 | cmake : add utf8 compilation options for msvc (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Adrien Gallouët | ggml : use svcntb() for SVE vector length detection... |
commit | commitdiff | tree |
| 2025-12-11 |
TianHao324 | CANN: Disable Ger operator of OUT_PROD on 310p device... |
commit | commitdiff | tree |
| 2025-12-11 |
Daniel Bevenius | ggml : remove redundant n_copies check when setting... |
commit | commitdiff | tree |
| 2025-12-11 |
Adrien Gallouët | ggml : add fallback definition for HWCAP2_SVE2 (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Aman Gupta | ggml-cuda: reorder only relevant nodes (llama/17639) |
commit | commitdiff | tree |
| 2025-12-11 |
Neo Zhang Jianyu | enhance argsort for UT (llama/17573) |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | metal : add FA head size 48 (llama/17619) |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | ggml : extend the GGML_SCHED_NO_REALLOC debug logic... |
commit | commitdiff | tree |
| 2025-12-11 |
Aman Gupta | llama-graph: avoid expand_forward for fusion (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Tarek Dakhran | model: LFM2-VL fixes (llama/17577) |
commit | commitdiff | tree |
| 2025-12-11 |
Gilad S. | ggml: fix: macOS build with `-DGGML_BACKEND_DL=ON`... |
commit | commitdiff | tree |
| 2025-12-11 |
Aman Gupta | CUDA: add stream-based concurrency (llama/16991) |
commit | commitdiff | tree |
| 2025-12-11 |
Mahekk Shaikh | cuda : add error checking for cudaMemcpyAsync in argsor... |
commit | commitdiff | tree |
| 2025-12-11 |
Acly | vulkan : fix FA mask load with bounds check (coopmat2... |
commit | commitdiff | tree |
| 2025-12-11 |
Neo Zhang | sycl : support to malloc memory on device more than... |
commit | commitdiff | tree |
| 2025-12-11 |
ixgbe | ggml: replace hwcap with riscv_hwprobe for RVV detectio... |
commit | commitdiff | tree |
| 2025-12-11 |
Ruben Ortlam | Vulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support... |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: improve topk perf for large k, fix overflow... |
commit | commitdiff | tree |
| next |