| 2025-12-31 |
lhez | opencl: unpack q4_0 for adreno in get_tensor (llama... |
commit | commitdiff | tree |
| 2025-12-31 |
Jeff Bolz | vulkan: Extend rope fusions to allow mrope (llama/18264) |
commit | commitdiff | tree |
| 2025-12-31 |
Jeff Bolz | vulkan: Implement set_tensor_async and the event interf... |
commit | commitdiff | tree |
| 2025-12-31 |
Johannes Gäßler | llama: fix RPC for -fit on (llama/18233) |
commit | commitdiff | tree |
| 2025-12-31 |
Jeff Bolz | vulkan: fix im2col overflowing maxworkgroupcount (llama... |
commit | commitdiff | tree |
| 2025-12-31 |
Jeff Bolz | vulkan/cuda: fix topk_moe with exp_probs_b (llama/18071) |
commit | commitdiff | tree |
| 2025-12-31 |
Jeff Bolz | vulkan: support GGML_UNARY_OP_XIELU (llama/18062) |
commit | commitdiff | tree |
| 2025-12-31 |
Jeff Bolz | vulkan: in graph_optimize, try to group ADD operations... |
commit | commitdiff | tree |
| 2025-12-31 |
lovedheart | Vulkan: some improvement on mul_mat_iq2_xs (llama/18031) |
commit | commitdiff | tree |
| 2025-12-31 |
Jeff Bolz | tests: Avoid floating point precision false positives... |
commit | commitdiff | tree |
| 2025-12-31 |
Jeff Bolz | test-backend-ops: improve msvc build time (llama/18209) |
commit | commitdiff | tree |
| 2025-12-31 |
Aadeshveer... | Added comments explaining thread block size selection... |
commit | commitdiff | tree |
| 2025-12-31 |
Alfred | ggml-hexagon: Implement true Q8_0 quantization on Hexag... |
commit | commitdiff | tree |
| 2025-12-31 |
Jeff Bolz | vulkan: Add perf logger mode with concurrency (llama... |
commit | commitdiff | tree |
| 2025-12-31 |
Xuan-Son Nguyen | model : add ASR support for LFM2-Audio-1.5B (conformer... |
commit | commitdiff | tree |
| 2025-12-31 |
Taimur Ahmad | ggml-cpu: extend support for RVV floating-point kernels... |
commit | commitdiff | tree |
| 2025-12-31 |
yulo | remove i_major_dual (llama/18157) |
commit | commitdiff | tree |
| 2025-12-31 |
Shouyu | ggml-hexagon: swiglu_oai operation (llama/18114) |
commit | commitdiff | tree |
| 2025-12-31 |
Shouyu | ggml-hexagon: gelu operation (llama/17921) |
commit | commitdiff | tree |
| 2025-12-31 |
Alberto Cabrera... | ggml-cpu: ARM64: repack version of q8_0 (dotprod and... |
commit | commitdiff | tree |
| 2025-12-31 |
yulo | HIP: Refactor mma for RDNA and CDNA (llama/17990) |
commit | commitdiff | tree |
| 2025-12-17 |
Georgi Gerganov | sync : llama.cpp upstream/0.9.4.395 |
commit | commitdiff | tree |
| 2025-12-17 |
Naco Siren | llama.android : Rewrite Android binding (w/o cpu_featur... |
commit | commitdiff | tree |
| 2025-12-17 |
Aadeshveer... | ggml : use WARP_SIZE/2 for argmax reduction offset... |
commit | commitdiff | tree |
| 2025-12-17 |
Shouyu | ggml-hexagon: mm for mtmd (llama/17894) |
commit | commitdiff | tree |
| 2025-12-17 |
Jeremy Demeule | metal: use shared buffers on eGPU (llama/17866) |
commit | commitdiff | tree |
| 2025-12-17 |
Johannes Gäßler | llama: automatically set parameters not set by the... |
commit | commitdiff | tree |
| 2025-12-17 |
Neo Zhang Jianyu | Support gpt-oss by OPs add-id, mul_mat for mxfp4, swigl... |
commit | commitdiff | tree |
| 2025-12-17 |
Ruben Ortlam | vulkan: fix mul_mat_vec_iq1_s formatting (llama/18026) |
commit | commitdiff | tree |
| 2025-12-14 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2025-12-14 |
Jeff Bolz | vulkan: Fix data race/hang in scalar/cm1 flash attentio... |
commit | commitdiff | tree |
| 2025-12-14 |
lovedheart | vulkan: improve mul_mat_vec_iq1_s speed (llama/17874) |
commit | commitdiff | tree |
| 2025-12-14 |
Eve | vulkan: faster q6_k matmul (llama/17813) |
commit | commitdiff | tree |
| 2025-12-14 |
Georgi Gerganov | ggml : arm repack fix build (llama/0) |
commit | commitdiff | tree |
| 2025-12-14 |
Jeff Bolz | vulkan: support get_rows for i32 (llama/17941) |
commit | commitdiff | tree |
| 2025-12-14 |
Jeff Bolz | vulkan: support GGML_OP_DIAG (llama/17893) |
commit | commitdiff | tree |
| 2025-12-14 |
Jeff Bolz | vulkan: Multi-pass softmax for large number of cols... |
commit | commitdiff | tree |
| 2025-12-14 |
Jeff Bolz | vulkan: Allow non-pow2 n_experts in topk_moe (llama... |
commit | commitdiff | tree |
| 2025-12-14 |
Johannes Gäßler | CUDA: fix overflow in MMA kernel without stream-k ... |
commit | commitdiff | tree |
| 2025-12-14 |
Sigbjørn Skjæret | cann : fix ops broken by circular padding guard (llama... |
commit | commitdiff | tree |
| 2025-12-14 |
ixgbe | ggml-cpu : fix RISC-V Q4_0 repack select and RVV featur... |
commit | commitdiff | tree |
| 2025-12-14 |
yulo | HIP: enable mmf for RDNA3 (llama/17879) |
commit | commitdiff | tree |
| 2025-12-14 |
Piotr Wilkin... | SOLVE_TRI extension to more dimensions (llama/17793) |
commit | commitdiff | tree |
| 2025-12-13 |
Georgi Gerganov | sync : whisper.cpp |
commit | commitdiff | tree |
| 2025-12-13 |
Georgi Gerganov | ggml : arm repack fix build (whisper/0) |
commit | commitdiff | tree |
| 2025-12-12 |
Congcong Cai | cmake : set `CMAKE_RUNTIME_OUTPUT_DIRECTORY` for non... |
commit | commitdiff | tree |
| 2025-12-12 |
Copilot | ci : remove retired macOS-13 runners from CI (#1397) |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | ggml-alloc : fix reuse-parent logic for misaligned... |
commit | commitdiff | tree |
| 2025-12-11 |
nullname | ggml-hexagon: fix `rope` failure at `test-backend-ops... |
commit | commitdiff | tree |
| 2025-12-11 |
Max Krasnyansky | Fix race conditions in threadpool when dealing with... |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | ggml : remove GGML_KQ_MASK_PAD constant (llama/17910) |
commit | commitdiff | tree |
| 2025-12-11 |
Sigbjørn Skjæret | cuda : add missing support check for xielu (llama/17895) |
commit | commitdiff | tree |
| 2025-12-11 |
Johannes Gäßler | CUDA: fix unpadded strides in MMA FA kernel (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Neo Zhang Jianyu | fix softmax for iGPU (llama/17838) |
commit | commitdiff | tree |
| 2025-12-11 |
Gabe Goodhart | metal: SSM kernel improvements (llama/17876) |
commit | commitdiff | tree |
| 2025-12-11 |
Piotr Wilkin... | Add DIAG for CUDA (llama/17873) |
commit | commitdiff | tree |
| 2025-12-11 |
Gabe Goodhart | ggml : Provide macos-specific backtrace printing to... |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | metal : print node names for debugging (llama/17882) |
commit | commitdiff | tree |
| 2025-12-11 |
Sigbjørn Skjæret | ggml : allow fill node alloc inplace (llama/17870) |
commit | commitdiff | tree |
| 2025-12-11 |
Chenguang Li | CANN: add support for partial RoPE and Vision mode... |
commit | commitdiff | tree |
| 2025-12-11 |
Johannes Gäßler | CUDA: fix FP16 overflow in tile FA kernel (llama/17875) |
commit | commitdiff | tree |
| 2025-12-11 |
Jay Zenith | cuda : add FILL op support (llama/17851) |
commit | commitdiff | tree |
| 2025-12-11 |
wsbagnsv1 | cuda: optimize SOLVE_TRI using registers and FMAF ... |
commit | commitdiff | tree |
| 2025-12-11 |
ixgbe | ggml-cpu: add ggml_thread_cpu_relax with Zihintpause... |
commit | commitdiff | tree |
| 2025-12-11 |
lovedheart | Vulkan: improve mul_mat_vec_iq1_m (llama/16907) |
commit | commitdiff | tree |
| 2025-12-11 |
Law Po Ying | sycl: add missing BF16 conversion support for Intel... |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: perf_logger improvements (llama/17672) |
commit | commitdiff | tree |
| 2025-12-11 |
Vishal Singh | ggml-zendnn : add ZenDNN backend for AMD CPUs (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Phylliida Dev | ggml : add circular tiling support to pad, for Vulkan... |
commit | commitdiff | tree |
| 2025-12-11 |
Johannes Gäßler | HIP: fix RDNA3 FP16/BF16 matrix multiplication (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Sky | ggml : improve error handling for search path existence... |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: Use one row per workgroup for f32 mmv (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: support solve_tri with larger N/K values (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | metal : fix build(#17799) |
commit | commitdiff | tree |
| 2025-12-11 |
Masato Nakasaka | vulkan: Replace deprecated VK_EXT_validation_features... |
commit | commitdiff | tree |
| 2025-12-11 |
Masato Nakasaka | vulkan: Fix mismatch in TOPK_MOE unit test (llama/17541) |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: add more num_blocks instantiations in rms_norm... |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: fix top_k bug when there are ties in the input... |
commit | commitdiff | tree |
| 2025-12-11 |
Acly | vulkan : support conv-2d with large output size (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Reese Levine | ggml webgpu: unary op suppport, code refactoring, ops... |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: enable mmvq for q2_k on NVIDIA (llama/17675) |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: set all memory allocations to high priority... |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | rpc : fix alloc size logic (llama/17116) |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | metal : add residency sets keep-alive heartbeat (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Johannes Gäßler | HIP : fix RDNA4 build (llama/17792) |
commit | commitdiff | tree |
| 2025-12-11 |
shalinib-ibm | Q4/Q8 Tiled Gemm Optimization. (llama/16999) |
commit | commitdiff | tree |
| 2025-12-11 |
Johannes Gäßler | CUDA: fix FA VKQ accumulator overflow (llama/17746) |
commit | commitdiff | tree |
| 2025-12-11 |
Jiacheng (Jason... | HIP: enable WMMA-MMQ INT kernels for RDNA 3 (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Piotr Wilkin... | Add support for CUMSUM and TRI for CUDA. (llama/17584) |
commit | commitdiff | tree |
| 2025-12-11 |
Gabe Goodhart | metal: TRI, FILL, EXPM1, SOFTPLUS (llama/16623) |
commit | commitdiff | tree |
| 2025-12-11 |
Alberto Cabrera... | ggml-cpu : remove asserts always evaluating to false... |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | metal : use params per pipeline instance (llama/17739) |
commit | commitdiff | tree |
| 2025-12-11 |
Adrien Gallouët | build : move _WIN32_WINNT definition to headers (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Herman Semenoff | ggml-cpu: remove duplicate conditional check 'iid'... |
commit | commitdiff | tree |
| 2025-12-11 |
Johannes Gäßler | CUDA: generalized (mma) FA, add Volta support (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Georgi Gerganov | metal : fix data race in pipeline library (llama/17731) |
commit | commitdiff | tree |
| 2025-12-11 |
Reese Levine | ggml webgpu: add support for emscripten builds (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
Jeff Bolz | vulkan: Reduce temporary memory usage for TOP_K (llama... |
commit | commitdiff | tree |
| 2025-12-11 |
xiaobing318 | cmake : add utf8 compilation options for msvc (llama... |
commit | commitdiff | tree |
| next |