| 2025-10-12 |
lhez | opencl: support pad_ext (llama/15888) |
commit | commitdiff | tree |
| 2025-10-12 |
Reese Levine | ggml webgpu: support for rope,div,sub,glu,scale,cont... |
commit | commitdiff | tree |
| 2025-10-12 |
lhez | opencl: support ne3 in get_rows (llama/15866) |
commit | commitdiff | tree |
| 2025-09-30 |
Georgi Gerganov | ggml : bump version to 0.9.4 (#1363) upstream/0.9.4 v0.9.4 |
commit | commitdiff | tree |
| 2025-09-30 |
Georgi Gerganov | sync : whisper.cpp [no ci] |
commit | commitdiff | tree |
| 2025-09-30 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2025-09-30 |
anavp-nvidia | cuda : Enable CUDA Graph usage for Nemotron Nano v2... |
commit | commitdiff | tree |
| 2025-09-30 |
Georgi Gerganov | metal : dynamic simdgroups for MV kernels (llama/16340) |
commit | commitdiff | tree |
| 2025-09-30 |
Charles Xu | kleidiai : fix work size and threads sync for fp16... |
commit | commitdiff | tree |
| 2025-09-30 |
Jeff Bolz | tests: override test_set_rows::max_nmse_err to allow... |
commit | commitdiff | tree |
| 2025-09-29 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2025-09-29 |
alex-spacemit | ggml: riscv: add riscv spacemit backend (llama/15288) |
commit | commitdiff | tree |
| 2025-09-29 |
Rafal Lewczuk | ggml-backend : add root cause in error message if loadi... |
commit | commitdiff | tree |
| 2025-09-29 |
Georgi Gerganov | sync : whisper.cpp (#1359) |
commit | commitdiff | tree |
| 2025-09-29 |
Georgi Gerganov | ci : print results [no ci] (#1358) |
commit | commitdiff | tree |
| 2025-09-29 |
Georgi Gerganov | ci : add self-hosted workflows (#1357) |
commit | commitdiff | tree |
| 2025-09-29 |
Georgi Gerganov | cmake : remove metal flag (llama/0) |
commit | commitdiff | tree |
| 2025-09-29 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2025-09-29 |
Sigbjørn Skjæret | ggml : check cuda and metal argsort limits and add... |
commit | commitdiff | tree |
| 2025-09-29 |
Georgi Gerganov | ggml : fix dependencies for ggml_set_rows (llama/16318) |
commit | commitdiff | tree |
| 2025-09-29 |
Jeff Bolz | vulkan: Fix validation failure in quantized flash atten... |
commit | commitdiff | tree |
| 2025-09-29 |
Sigbjørn Skjæret | ggml : fix GGML_F32_VEC_FMA argument order in ggml_vec_... |
commit | commitdiff | tree |
| 2025-09-29 |
Jeff Bolz | vulkan: 64-bit im2col (llama/16135) |
commit | commitdiff | tree |
| 2025-09-29 |
Georgi Gerganov | metal : extend mat-mat multiplication support (llama... |
commit | commitdiff | tree |
| 2025-09-29 |
Georgi Gerganov | metal : fuse non-sequential nodes (llama/16102) |
commit | commitdiff | tree |
| 2025-09-29 |
Jeff Bolz | vulkan: handle mat_mul with A matrix > 4GB (llama/16176) |
commit | commitdiff | tree |
| 2025-09-29 |
Jeff Bolz | vulkan: support arbitrary KV dimension in flash attenti... |
commit | commitdiff | tree |
| 2025-09-29 |
Acly | vulkan : make the vulkan.hpp dynamic dispatcher instanc... |
commit | commitdiff | tree |
| 2025-09-29 |
Aman Gupta | CUDA: mul_mat_id for mmf for bs <= 64 for f16 and bs... |
commit | commitdiff | tree |
| 2025-09-29 |
Johannes Gäßler | CUDA: refactor and deduplicate vector FA kernels (llama... |
commit | commitdiff | tree |
| 2025-09-29 |
Dmytro Minochkin | vulkan: throw system error instead of SIGABRT during... |
commit | commitdiff | tree |
| 2025-09-29 |
Jeff Bolz | vulkan: support GET_ROWS for k-quants (llama/16235) |
commit | commitdiff | tree |
| 2025-09-29 |
Aaron Teo | devops: add s390x & ppc64le CI (llama/15925) |
commit | commitdiff | tree |
| 2025-09-29 |
Georgi Gerganov | metal : report OOM errors (llama/16274) |
commit | commitdiff | tree |
| 2025-09-29 |
Adrien Gallouët | common : use cpp-httplib as a cURL alternative for... |
commit | commitdiff | tree |
| 2025-09-29 |
Aaron Teo | ggml-cpu: implement MXFP4 SIMD for s390x (llama/16193) |
commit | commitdiff | tree |
| 2025-09-29 |
R0CKSTAR | musa: fix build warnings (llama/15611) |
commit | commitdiff | tree |
| 2025-09-29 |
Aman Gupta | CUDA: add a fused top-K MoE kernel (llama/16130) |
commit | commitdiff | tree |
| 2025-09-29 |
junchao-zhao | ggml : fix loongarch lsx compilation error (llama/15864) |
commit | commitdiff | tree |
| 2025-09-26 |
Daniel Bevenius | ggml : remove -dev suffix from release version (#1355) |
commit | commitdiff | tree |
| 2025-09-25 |
Christoph Reiter | pkg-config: include the new GGML_VERSION as a version... |
commit | commitdiff | tree |
| 2025-09-25 |
hebangwen | examples : fix typo mismatch in gpt (#1349) |
commit | commitdiff | tree |
| 2025-09-25 |
Daniel Bevenius | ggml : bump version to 0.9.3 (#1353) v0.9.3 |
commit | commitdiff | tree |
| 2025-09-25 |
Daniel Bevenius | scripts : refactor release script into prepare and... |
commit | commitdiff | tree |
| 2025-09-25 |
Daniel Bevenius | scripts : fix next dev version calculation [no ci]... |
commit | commitdiff | tree |
| 2025-09-25 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2025-09-25 |
Georgi Gerganov | metal : fuse NORM + MUL + ADD, support non-multiples... |
commit | commitdiff | tree |
| 2025-09-25 |
Georgi Gerganov | metal : relax reorder conditions (llama/16216) |
commit | commitdiff | tree |
| 2025-09-25 |
Georgi Gerganov | metal : restore im2col perf (llama/16219) |
commit | commitdiff | tree |
| 2025-09-25 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2025-09-25 |
Radoslav Gerganov | rpc : use ggml logging facilities |
commit | commitdiff | tree |
| 2025-09-25 |
Eve | ci: run the x64 and arm ci on the github machines inste... |
commit | commitdiff | tree |
| 2025-09-25 |
Johannes Gäßler | llama: print memory breakdown on exit (llama/15860) |
commit | commitdiff | tree |
| 2025-09-25 |
Acly | ggml : split graph allocations according to backend... |
commit | commitdiff | tree |
| 2025-09-25 |
Xiangyan Sun | ggml-cpu: Respect cpumask settings (llama/16164) |
commit | commitdiff | tree |
| 2025-09-25 |
Sigbjørn Skjæret | ggml : fix uninitialized is_on_grid in quantize_row_iq3... |
commit | commitdiff | tree |
| 2025-09-25 |
Aaron Teo | zdnn: refactor codebase + add docs (llama/16178) |
commit | commitdiff | tree |
| 2025-09-25 |
Daniel Bevenius | ggml-cpu : fix typo in gemm comments [no ci] (llama... |
commit | commitdiff | tree |
| 2025-09-25 |
Sigbjørn Skjæret | ggml : implement set_rows with i32 index (llama/16159) |
commit | commitdiff | tree |
| 2025-09-25 |
Georgi Gerganov | ggml : extend ggml_can_fuse to work with non-sequential... |
commit | commitdiff | tree |
| 2025-09-25 |
Georgi Gerganov | ggml : add ggml_op_is_empty (llama/16122) |
commit | commitdiff | tree |
| 2025-09-25 |
Shin-myoung... | Vulkan: add conv_transpose_2d operation (llama/16022) |
commit | commitdiff | tree |
| 2025-09-25 |
Jeff Bolz | vulkan: add RTE variants of exp shader (llama/16165) |
commit | commitdiff | tree |
| 2025-09-25 |
Ruben Ortlam | vulkan: vec dot matrix multiplication fix (llama/16151) |
commit | commitdiff | tree |
| 2025-09-25 |
lhez | opencl: fix concat crash on win arm64 with Adreno ... |
commit | commitdiff | tree |
| 2025-09-25 |
lhez | opencl: initial `q8_0` mv support (llama/15732) |
commit | commitdiff | tree |
| 2025-09-25 |
Giuseppe Scrivano | vulkan: optimize UMA buffer operations and fix driver... |
commit | commitdiff | tree |
| 2025-09-25 |
Jeff Bolz | vulkan: fix validation error about VK_PIPELINE_CREATE_C... |
commit | commitdiff | tree |
| 2025-09-20 |
Georgi Gerganov | ggml : prepare for development of 0.9.2-dev |
commit | commitdiff | tree |
| 2025-09-20 |
Georgi Gerganov | ggml : bump version to 0.9.1 |
commit | commitdiff | tree |
| 2025-09-20 |
Georgi Gerganov | scripts : fix sed usage to work on Mac (#1345) |
commit | commitdiff | tree |
| 2025-09-20 |
Georgi Gerganov | tests : adjust to new timestep_embedding operator |
commit | commitdiff | tree |
| 2025-09-20 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2025-09-20 |
Ruben Ortlam | vulkan: use vec dot for matrix matrix multiplications... |
commit | commitdiff | tree |
| 2025-09-20 |
Xuan-Son Nguyen | ggml : refactor forward_dup for cpu backend (llama... |
commit | commitdiff | tree |
| 2025-09-20 |
Adrien Gallouët | ggml-amx : fix ggml_amx_init() on generic Linux (llama... |
commit | commitdiff | tree |
| 2025-09-20 |
Adrien Gallouët | cmake : fix static linking for OpenMP on Unix-like... |
commit | commitdiff | tree |
| 2025-09-20 |
Shawn Gu | opencl: optimize mxfp4 kernels (llama/16037) |
commit | commitdiff | tree |
| 2025-09-20 |
Jeff Bolz | rename optimize_graph to graph_optimize (llama/16082) |
commit | commitdiff | tree |
| 2025-09-20 |
Bowen Han | CUDA: Optimize PAD_REFLECT_1D (llama/15957) |
commit | commitdiff | tree |
| 2025-09-20 |
Johannes Gäßler | CUDA: fix compilation on CC 6.0 (llama/16091) |
commit | commitdiff | tree |
| 2025-09-20 |
Georgi Gerganov | metal : use function constants for mul_mv_ext kernels... |
commit | commitdiff | tree |
| 2025-09-20 |
Sigbjørn Skjæret | cuda : add missing F32<->I32 entries in ggml_cuda_cpy_f... |
commit | commitdiff | tree |
| 2025-09-20 |
Georgi Gerganov | metal : improve F32, F16 and BF16 mat-vec multiplicatio... |
commit | commitdiff | tree |
| 2025-09-20 |
Jhen-Jie Hong | metal : avoid call free for non-owned buffer (llama... |
commit | commitdiff | tree |
| 2025-09-20 |
Georgi Gerganov | metal : handle nil cv during pipeline creation (llama... |
commit | commitdiff | tree |
| 2025-09-20 |
Chenguang Li | CANN: Remove print (llama/16044) |
commit | commitdiff | tree |
| 2025-09-20 |
Reese Levine | GGML WebGPU: Support for ADD, MUL, RMS_NORM, GET_ROWS... |
commit | commitdiff | tree |
| 2025-09-20 |
Georgi Gerganov | metal : refactor + optimize v2 (llama/15995) |
commit | commitdiff | tree |
| 2025-09-20 |
Georgi Gerganov | sync : llama.cpp |
commit | commitdiff | tree |
| 2025-09-20 |
Johannes Gäßler | CUDA: fix FA occupancy, optimize tile kernel (llama... |
commit | commitdiff | tree |
| 2025-09-20 |
Eve | vulkan: automatically remove unsupported devices (llama... |
commit | commitdiff | tree |
| 2025-09-20 |
Chenguang Li | CANN: Optimize ggml_cann_set_device (llama/15935) |
commit | commitdiff | tree |
| 2025-09-20 |
Daniel Bevenius | ggml : fix padding in timestep embedding kernels (llama... |
commit | commitdiff | tree |
| 2025-09-20 |
Jake Karnes | CUDA: fix im2col_3d to respect non-contiguous inputs... |
commit | commitdiff | tree |
| 2025-09-20 |
yael-works | SYCL: Add COUNT_EQUAL operator support (llama/15991) |
commit | commitdiff | tree |
| 2025-09-20 |
Aman Gupta | CUDA: some micro-optimizations in mmf.cuh for mul_mat_i... |
commit | commitdiff | tree |
| 2025-09-20 |
Georgi Gerganov | metal : remove memory pools (llama/15966) |
commit | commitdiff | tree |
| 2025-09-20 |
Ruben Ortlam | Vulkan: Clean up mul_mm shader (llama/15987) |
commit | commitdiff | tree |
| 2025-09-20 |
Georgi Gerganov | metal : fix kernel requirements (llama/15983) |
commit | commitdiff | tree |
| next |