| 2025-09-24 |
Jie Fu (傅杰) | model-conversion : run-org-model.py fails to run on... |
commit | commitdiff | tree |
| 2025-09-24 |
Daniel Bevenius | codeowners : use slash prefix for root files [no ci... |
commit | commitdiff | tree |
| 2025-09-24 |
Jie Fu (傅杰) | model-conversion : fix the make targets in the README... |
commit | commitdiff | tree |
| 2025-09-23 |
Georgi Gerganov | ci : disable AMD workflows + update NVIDIA workflows... |
commit | commitdiff | tree |
| 2025-09-23 |
Georgi Gerganov | ci : enable Vulkan workflow on Mac (#16194) |
commit | commitdiff | tree |
| 2025-09-23 |
Xiangyan Sun | ggml-cpu: Respect cpumask settings (#16164) |
commit | commitdiff | tree |
| 2025-09-23 |
Sigbjørn Skjæret | ggml : fix uninitialized is_on_grid in quantize_row_iq3... |
commit | commitdiff | tree |
| 2025-09-23 |
Aaron Teo | zdnn: refactor codebase + add docs (#16178) |
commit | commitdiff | tree |
| 2025-09-23 |
Daniel Bevenius | codeowners : add @danbev to model-conversion example... |
commit | commitdiff | tree |
| 2025-09-23 |
Aaron Teo | devops: add s390x containers (#15915) |
commit | commitdiff | tree |
| 2025-09-23 |
Daniel Bevenius | ggml-cpu : fix typo in gemm comments [no ci] (#16189) |
commit | commitdiff | tree |
| 2025-09-22 |
Gabe Goodhart | feat: Add conversion support in GraniteHybrid for non... |
commit | commitdiff | tree |
| 2025-09-22 |
Haiyue Wang | clang-tidy : disable warning about performance enum... |
commit | commitdiff | tree |
| 2025-09-22 |
Sigbjørn Skjæret | ggml : implement set_rows with i32 index (#16159) |
commit | commitdiff | tree |
| 2025-09-22 |
Georgi Gerganov | codeowners : update + cleanup (#16174) |
commit | commitdiff | tree |
| 2025-09-22 |
Adrien Gallouët | common : enable `--offline` mode without curl support... |
commit | commitdiff | tree |
| 2025-09-22 |
Quentin Bramas | webui : fix handling incomplete chunks (#16107) |
commit | commitdiff | tree |
| 2025-09-22 |
GideonSerf | embedding : fix typos in README (#16171) |
commit | commitdiff | tree |
| 2025-09-22 |
Haiyue Wang | common : remove unused local variables (#16140) |
commit | commitdiff | tree |
| 2025-09-22 |
Georgi Gerganov | ggml : extend ggml_can_fuse to work with non-sequential... |
commit | commitdiff | tree |
| 2025-09-22 |
Georgi Gerganov | ggml : add ggml_op_is_empty (#16122) |
commit | commitdiff | tree |
| 2025-09-22 |
Xuan-Son Nguyen | codeowners : update ownership for @ngxson and @allozuar... |
commit | commitdiff | tree |
| 2025-09-22 |
Shin-myoung... | Vulkan: add conv_transpose_2d operation (#16022) |
commit | commitdiff | tree |
| 2025-09-22 |
Sigbjørn Skjæret | codeowners : claim responsibility for ci, models, gguf... |
commit | commitdiff | tree |
| 2025-09-22 |
Georgi Gerganov | contrib : update roles (#16113) |
commit | commitdiff | tree |
| 2025-09-22 |
Georgi Gerganov | ci : remove vulkaninfo calls (#16169) |
commit | commitdiff | tree |
| 2025-09-22 |
Georgi Gerganov | ci : use smaller model (#16168) |
commit | commitdiff | tree |
| 2025-09-22 |
Jeff Bolz | vulkan: add RTE variants of exp shader (#16165) |
commit | commitdiff | tree |
| 2025-09-22 |
Georgi Gerganov | ci : adjust params for less runtime (#16167) |
commit | commitdiff | tree |
| 2025-09-22 |
Ruben Ortlam | vulkan: vec dot matrix multiplication fix (#16151) |
commit | commitdiff | tree |
| 2025-09-21 |
lhez | opencl: fix concat crash on win arm64 with Adreno ... |
commit | commitdiff | tree |
| 2025-09-21 |
lhez | opencl: initial `q8_0` mv support (#15732) |
commit | commitdiff | tree |
| 2025-09-21 |
Georgi Gerganov | ci : add label for the RISC-V runner (#16150) |
commit | commitdiff | tree |
| 2025-09-21 |
Georgi Gerganov | ci : migrate ggml ci to self-hosted runners (#16116) |
commit | commitdiff | tree |
| 2025-09-21 |
Giuseppe Scrivano | vulkan: optimize UMA buffer operations and fix driver... |
commit | commitdiff | tree |
| 2025-09-21 |
Jeff Bolz | vulkan: fix validation error about VK_PIPELINE_CREATE_C... |
commit | commitdiff | tree |
| 2025-09-20 |
Georgi Gerganov | sync : ggml upstream/0.0.6527 |
commit | commitdiff | tree |
| 2025-09-20 |
Daniel Bevenius | ggml : introduce semantic versioning (ggml/1336) |
commit | commitdiff | tree |
| 2025-09-20 |
Gregor Jasny | CUDA : conditionally add cuda architectures (ggml/1341) |
commit | commitdiff | tree |
| 2025-09-20 |
Ruben Ortlam | vulkan: use vec dot for matrix matrix multiplications... |
commit | commitdiff | tree |
| 2025-09-20 |
Benni | server: fix SSE and OpenAI compatibility for error... |
commit | commitdiff | tree |
| 2025-09-19 |
ssweens | llama-bench: add --devices and --list-devices support... |
commit | commitdiff | tree |
| 2025-09-19 |
shun095 | chat: Fix streaming parser for granite models (#15682) |
commit | commitdiff | tree |
| 2025-09-19 |
Aleksander... | feat: Improve mobile UI for Settings Dialog (#16084) |
commit | commitdiff | tree |
| 2025-09-19 |
Xuan-Son Nguyen | chat : fix build on arm64 (#16101) |
commit | commitdiff | tree |
| 2025-09-19 |
Xuan-Son Nguyen | ggml : refactor forward_dup for cpu backend (#16062) |
commit | commitdiff | tree |
| 2025-09-18 |
Adrien Gallouët | ggml-amx : fix ggml_amx_init() on generic Linux (#16049) |
commit | commitdiff | tree |
| 2025-09-18 |
Adrien Gallouët | cmake : fix static linking for OpenMP on Unix-like... |
commit | commitdiff | tree |
| 2025-09-18 |
Shawn Gu | opencl: optimize mxfp4 kernels (#16037) |
commit | commitdiff | tree |
| 2025-09-18 |
Jeff Bolz | rename optimize_graph to graph_optimize (#16082) |
commit | commitdiff | tree |
| 2025-09-18 |
Bowen Han | CUDA: Optimize PAD_REFLECT_1D (#15957) |
commit | commitdiff | tree |
| 2025-09-18 |
Johannes Gäßler | CUDA: fix compilation on CC 6.0 (#16091) |
commit | commitdiff | tree |
| 2025-09-18 |
Eric Curtin | Add resumable downloads for llama-server model loading... |
commit | commitdiff | tree |
| 2025-09-18 |
Georgi Gerganov | metal : use function constants for mul_mv_ext kernels... |
commit | commitdiff | tree |
| 2025-09-18 |
Sigbjørn Skjæret | cuda : add missing F32<->I32 entries in ggml_cuda_cpy_f... |
commit | commitdiff | tree |
| 2025-09-18 |
Radoslav Gerganov | server : include usage statistics only when user reques... |
commit | commitdiff | tree |
| 2025-09-18 |
Georgi Gerganov | llama : bump max seq limit from 64 to 256 (#15916) |
commit | commitdiff | tree |
| 2025-09-18 |
Georgi Gerganov | metal : improve F32, F16 and BF16 mat-vec multiplicatio... |
commit | commitdiff | tree |
| 2025-09-18 |
Jhen-Jie Hong | metal : avoid call free for non-owned buffer (#16067) |
commit | commitdiff | tree |
| 2025-09-18 |
Georgi Gerganov | metal : handle nil cv during pipeline creation (#16065) |
commit | commitdiff | tree |
| 2025-09-18 |
Chenguang Li | CANN: Remove print (#16044) |
commit | commitdiff | tree |
| 2025-09-17 |
Reese Levine | GGML WebGPU: Support for ADD, MUL, RMS_NORM, GET_ROWS... |
commit | commitdiff | tree |
| 2025-09-17 |
Georgi Gerganov | metal : refactor + optimize v2 (#15995) |
commit | commitdiff | tree |
| 2025-09-17 |
Aleksander... | SvelteKit-based WebUI (#14839) |
commit | commitdiff | tree |
| 2025-09-17 |
Xuan-Son Nguyen | convert : add Llama4ForCausalLM (#16042) |
commit | commitdiff | tree |
| 2025-09-17 |
Johannes Gäßler | CUDA: fix FA occupancy, optimize tile kernel (#15982) |
commit | commitdiff | tree |
| 2025-09-17 |
David Ribeiro... | common : Fix corrupted memory error on json grammar... |
commit | commitdiff | tree |
| 2025-09-17 |
Eve | vulkan: automatically remove unsupported devices (... |
commit | commitdiff | tree |
| 2025-09-17 |
Daniel Bevenius | ci : revert back to macos-13 for macOS-latest-cmake... |
commit | commitdiff | tree |
| 2025-09-17 |
Jie Fu (傅杰) | llama-quant : fix the verification of attention layers... |
commit | commitdiff | tree |
| 2025-09-17 |
Jie Fu (傅杰) | examples : support encoder-decoder models in the simple... |
commit | commitdiff | tree |
| 2025-09-17 |
Shane A | model : add OLMo3 support (#16015) |
commit | commitdiff | tree |
| 2025-09-17 |
Chenguang Li | CANN: Optimize ggml_cann_set_device (#15935) |
commit | commitdiff | tree |
| 2025-09-16 |
jacekpoplawski | llama-bench: add --n-cpu-moe support (#15952) |
commit | commitdiff | tree |
| 2025-09-16 |
Daniel Bevenius | ci : use macos-latest for arm64 webgpu build (#16029) |
commit | commitdiff | tree |
| 2025-09-16 |
Daniel Bevenius | ggml : fix padding in timestep embedding kernels (... |
commit | commitdiff | tree |
| 2025-09-16 |
Daniel Bevenius | ci : upload xcframework artifact from ios-xcode-build... |
commit | commitdiff | tree |
| 2025-09-16 |
Bowen Han | fix: apply clang-format to CUDA macros (#16017) |
commit | commitdiff | tree |
| 2025-09-16 |
Daniel Bevenius | ci : update macos-latest* jobs to use macos-latest... |
commit | commitdiff | tree |
| 2025-09-16 |
Yuri Khrustalev | cmake : Do not install tools on iOS targets (#15903) |
commit | commitdiff | tree |
| 2025-09-16 |
Aman Gupta | Add LLaDA-7b-MoE diffusion model (#16003) |
commit | commitdiff | tree |
| 2025-09-15 |
Jake Karnes | CUDA: fix im2col_3d to respect non-contiguous inputs... |
commit | commitdiff | tree |
| 2025-09-15 |
Diego Devesa | docker : enable rocWMMA in ROCm images, add gfx1151... |
commit | commitdiff | tree |
| 2025-09-15 |
Diego Devesa | releases : switch to rocWMMA develop branch, add gfx115... |
commit | commitdiff | tree |
| 2025-09-15 |
yael-works | SYCL: Add COUNT_EQUAL operator support (#15991) |
commit | commitdiff | tree |
| 2025-09-15 |
Nikolay Popov | llama-run: Fix model download on Windows (#15988) |
commit | commitdiff | tree |
| 2025-09-15 |
Aman Gupta | CUDA: some micro-optimizations in mmf.cuh for mul_mat_i... |
commit | commitdiff | tree |
| 2025-09-15 |
ddh0 | fix KLD percentile output (#15999) |
commit | commitdiff | tree |
| 2025-09-14 |
Sigbjørn Skjæret | model : add grok-2 support (#15539) |
commit | commitdiff | tree |
| 2025-09-14 |
Sigbjørn Skjæret | server : only attempt to enable thinking if using jinja... |
commit | commitdiff | tree |
| 2025-09-14 |
Georgi Gerganov | metal : remove memory pools (#15966) |
commit | commitdiff | tree |
| 2025-09-14 |
Adam | rocm.Dockerfile: added gfx1200,gfx1201 architectures... |
commit | commitdiff | tree |
| 2025-09-14 |
Ruben Ortlam | Vulkan: Clean up mul_mm shader (#15987) |
commit | commitdiff | tree |
| 2025-09-14 |
lcy | build: fix the build failures of Windows HIP release... |
commit | commitdiff | tree |
| 2025-09-14 |
Georgi Gerganov | metal : fix kernel requirements (#15983) |
commit | commitdiff | tree |
| 2025-09-14 |
Radoslav Gerganov | rpc : fix regression when --device is used (#15981) |
commit | commitdiff | tree |
| 2025-09-14 |
Diego Devesa | releases : update ROCM, add gfx1200, gfx1201, gfx1151... |
commit | commitdiff | tree |
| 2025-09-14 |
Radoslav Gerganov | doc : update documentation for --tensor-split (#15980) |
commit | commitdiff | tree |
| 2025-09-14 |
Aaron Teo | ggml-zdnn: rm user mapped buffers (#15965) |
commit | commitdiff | tree |
| 2025-09-13 |
Jeff Bolz | vulkan: fix failing dequant shaders (#15862) |
commit | commitdiff | tree |
| next |