| 2025-10-21 |
Aman Gupta | ggml: add ggml_can_fuse_subgraph (#16662) |
commit | commitdiff | tree |
| 2025-10-21 |
lhez | opencl: fix warnings and clean up profiling (#16688) |
commit | commitdiff | tree |
| 2025-10-21 |
Jeff Bolz | vulkan: Handle FA with all -inf mask values (#16447) |
commit | commitdiff | tree |
| 2025-10-20 |
YehuditE | sycl : add PAD_REFLECT_D1 operator support (#16145) |
commit | commitdiff | tree |
| 2025-10-20 |
Sigbjørn Skjæret | model : add BailingMoeV2 support (#16063) |
commit | commitdiff | tree |
| 2025-10-20 |
Aleksander... | Handle legacy 'context' attachments (#16687) |
commit | commitdiff | tree |
| 2025-10-20 |
Diego Devesa | ggml-alloc : fix leak when reusing a tensor with a... |
commit | commitdiff | tree |
| 2025-10-20 |
Aleksander... | Prevent premature submission on IME input (#16673) |
commit | commitdiff | tree |
| 2025-10-20 |
Aleksander... | Import/Export UX improvements (#16619) |
commit | commitdiff | tree |
| 2025-10-20 |
Aleksander... | Enable per-conversation loading states to allow having... |
commit | commitdiff | tree |
| 2025-10-20 |
takuya kodama | llama-batch: fix build fails with `-Werror=missing... |
commit | commitdiff | tree |
| 2025-10-20 |
Ron Evans | readme: update bindings (#16651) |
commit | commitdiff | tree |
| 2025-10-20 |
safranowith | SYCL: Add support for FLOOR,CEIL,ROUND and TRUNC unary... |
commit | commitdiff | tree |
| 2025-10-20 |
takuya kodama | llama-context: only warn on pooling_type when user... |
commit | commitdiff | tree |
| 2025-10-19 |
Giuseppe Scrivano | model : add Granite Hybrid types (#16635) |
commit | commitdiff | tree |
| 2025-10-19 |
Aaron Teo | ci : fix binaries release failure for s390x (binaries... |
commit | commitdiff | tree |
| 2025-10-19 |
Sigbjørn Skjæret | ci : avoid manual updates of docs/ops.md (#16663) |
commit | commitdiff | tree |
| 2025-10-19 |
Aaron Teo | ci: include s390x release binaries (#16648) |
commit | commitdiff | tree |
| 2025-10-19 |
Aman Gupta | CODEOWNERS: update for ggml-cuda/mmf (#16660) |
commit | commitdiff | tree |
| 2025-10-18 |
Johannes Gäßler | HIP: fix GPU_TARGETS (#16642) |
commit | commitdiff | tree |
| 2025-10-18 |
Jeff Bolz | vulkan: Implement topk_moe fused shader, ported from... |
commit | commitdiff | tree |
| 2025-10-18 |
Aman Gupta | CUDA: use registers instead of smem in topk-moe (#16647) |
commit | commitdiff | tree |
| 2025-10-18 |
Shawn Gu | opencl: transposed gemm/gemv moe kernel with mxfp4... |
commit | commitdiff | tree |
| 2025-10-17 |
Johannes Gäßler | llama-model: fix insonsistent ctxs <-> bufs order ... |
commit | commitdiff | tree |
| 2025-10-17 |
Radoslav Gerganov | rpc : report actual free memory (#16616) |
commit | commitdiff | tree |
| 2025-10-17 |
Giuseppe Scrivano | vulkan: Add State Space Model (SSM) Operations Support... |
commit | commitdiff | tree |
| 2025-10-17 |
muggle-stack | ggml : fix SpaceMit IME array out-of-bounds in task... |
commit | commitdiff | tree |
| 2025-10-17 |
Pascal | webui: reorganize settings layout (#16607) |
commit | commitdiff | tree |
| 2025-10-17 |
Jeff Bolz | vulkan: fix debug build (add_rms_len/data not found... |
commit | commitdiff | tree |
| 2025-10-17 |
Ilia Ilmer | metal : add `CONV_TRANSPOSE_2D` (#16542) |
commit | commitdiff | tree |
| 2025-10-17 |
Olivier Chafik | grammar : use int64_t to avoid int overflows in int... |
commit | commitdiff | tree |
| 2025-10-17 |
GittyBurstein | SYCL SET operator optimized for F32 tensors (#16350) |
commit | commitdiff | tree |
| 2025-10-16 |
Xuan-Son Nguyen | mtmd : support home-cooked Mistral Small Omni (#14928) |
commit | commitdiff | tree |
| 2025-10-16 |
Pascal | fix: added a normalization step for MathJax-style ... |
commit | commitdiff | tree |
| 2025-10-16 |
GittyBurstein | sycl : add ARANGE operator (#16362) |
commit | commitdiff | tree |
| 2025-10-16 |
Chenguang Li | CANN: format code using .clang-format (#15863) |
commit | commitdiff | tree |
| 2025-10-16 |
takasurazeem | common : Update the docs on -t --threads (#16236) |
commit | commitdiff | tree |
| 2025-10-16 |
takuya kodama | ggml-cpu: replace putenv with setenv for const-correctn... |
commit | commitdiff | tree |
| 2025-10-16 |
yael-works | SYCL: Add GGML_OP_MEAN operator support (#16009) |
commit | commitdiff | tree |
| 2025-10-15 |
Aleksei Nikiforov | gguf-py : add support for endian conversion of BF16... |
commit | commitdiff | tree |
| 2025-10-15 |
safranowith | cpu : add FLOOR, CEIL, ROUND and TRUNC unary operators... |
commit | commitdiff | tree |
| 2025-10-15 |
lhez | opencl: add q8_0 mm support (#16469) |
commit | commitdiff | tree |
| 2025-10-15 |
lhez | opencl: fix FA for f32 (#16584) |
commit | commitdiff | tree |
| 2025-10-15 |
Aleksander... | Add server-driven parameter defaults and syncing (... |
commit | commitdiff | tree |
| 2025-10-15 |
Sam/Samuel | metal: optimise `GGML_OP_SUM` (#16559) |
commit | commitdiff | tree |
| 2025-10-15 |
Georgi Gerganov | server : fix img token logs (#16595) |
commit | commitdiff | tree |
| 2025-10-15 |
Xuan-Son Nguyen | llama-quant: add support for mmproj (#16592) |
commit | commitdiff | tree |
| 2025-10-15 |
Julius Tischbein | CUDA: Changing the CUDA scheduling strategy to spin... |
commit | commitdiff | tree |
| 2025-10-15 |
Georgi Gerganov | server : fix mtmd checkpoints (#16591) |
commit | commitdiff | tree |
| 2025-10-14 |
Georgi Gerganov | metal : avoid using Metal's gpuAddress property (#16576) |
commit | commitdiff | tree |
| 2025-10-14 |
SavicStefan | vulkan: Add ACC_TYPE_VEC2 implementation (#16203) upstream/0.0.6764 |
commit | commitdiff | tree |
| 2025-10-14 |
Aman Gupta | CUDA + openCL: fix bug in accessing rms_norm->src while... |
commit | commitdiff | tree |
| 2025-10-14 |
Jeff Bolz | vulkan: Support FA with K/V in F32 (#16543) |
commit | commitdiff | tree |
| 2025-10-14 |
Jeff Bolz | vulkan: Improve build time for MSVC (#16545) |
commit | commitdiff | tree |
| 2025-10-14 |
Johannes Gäßler | CUDA: enable FA for FP32 KV cache (#16546) |
commit | commitdiff | tree |
| 2025-10-14 |
Aman Gupta | CUDA: use fastdiv + ggml_cuda_mad for mmvf (#16557) |
commit | commitdiff | tree |
| 2025-10-14 |
Aman Gupta | CUDA: add fp kernel for larger batch size MoE (#16512) |
commit | commitdiff | tree |
| 2025-10-14 |
Anav Prasad | cuda : remove legacy copy-op pointer indirection code... |
commit | commitdiff | tree |
| 2025-10-14 |
Georgi Gerganov | server : dynamic token limit for prompt cache (#16560) |
commit | commitdiff | tree |
| 2025-10-13 |
Georgi Gerganov | metal : FA support F32 K and V and head size = 32 ... |
commit | commitdiff | tree |
| 2025-10-13 |
Georgi Gerganov | graph : support cacheless embeddings with FA and iSWA... |
commit | commitdiff | tree |
| 2025-10-13 |
lhez | opencl: fix build targeting CL 2 (#16554) |
commit | commitdiff | tree |
| 2025-10-13 |
Johannes Gäßler | CUDA: fix numerical issues in tile FA kernel (#16540) |
commit | commitdiff | tree |
| 2025-10-13 |
Jie Fu (傅杰) | ggml : fix build broken with -march=armv9-a on MacOS... |
commit | commitdiff | tree |
| 2025-10-13 |
Chenguang Li | CANN: fix CPU memory leak in CANN backend (#16549) |
commit | commitdiff | tree |
| 2025-10-13 |
Pascal | fix: add remark plugin to render raw HTML as literal... |
commit | commitdiff | tree |
| 2025-10-13 |
Sam/Samuel | metal: add support for opt_step_sgd (#16539) |
commit | commitdiff | tree |
| 2025-10-13 |
Georgi Gerganov | ggml : fix scalar path for computing norm (#16558) |
commit | commitdiff | tree |
| 2025-10-13 |
hipudding | CANN: Update several operators to support FP16 data... |
commit | commitdiff | tree |
| 2025-10-12 |
Sam/Samuel | metal : add opt_step_adamw and op_sum (#16529) |
commit | commitdiff | tree |
| 2025-10-12 |
Pascal | webui: remove client-side context pre-check and rely... |
commit | commitdiff | tree |
| 2025-10-12 |
Neo Zhang Jianyu | [SYCL] fix UT fault cases: count-equal, argsort, pad... |
commit | commitdiff | tree |
| 2025-10-12 |
Mathieu Baudier | ci : add Vulkan on Ubuntu with default packages build... |
commit | commitdiff | tree |
| 2025-10-12 |
Aldehir Rojas | common : handle unicode during partial json parsing... |
commit | commitdiff | tree |
| 2025-10-12 |
Georgi Gerganov | common : update presets (#16504) |
commit | commitdiff | tree |
| 2025-10-12 |
sirus20x6 | ggml : Fix FP16 ELU positive branch (#16519) |
commit | commitdiff | tree |
| 2025-10-12 |
Daniel Bevenius | hparams : add check for layer index in is_recurrent... |
commit | commitdiff | tree |
| 2025-10-12 |
sirus20x6 | ggml: Correct SVE implementation in ggml_vec_dot_f16_un... |
commit | commitdiff | tree |
| 2025-10-11 |
Johannes Gäßler | CUDA: faster tile FA, add oob checks, more HSs (#16492) |
commit | commitdiff | tree |
| 2025-10-11 |
Georgi Gerganov | metal : fix mul-mm condition + fix mul-mv permuted... |
commit | commitdiff | tree |
| 2025-10-11 |
Pascal | feat: render user content as markdown option (#16358) |
commit | commitdiff | tree |
| 2025-10-11 |
Yann Follet | server / ranking : add sorting and management of top_n... |
commit | commitdiff | tree |
| 2025-10-11 |
Diego Devesa | cuda : avoid initializing unused devices (#16510) |
commit | commitdiff | tree |
| 2025-10-11 |
amirai21 | convert : correctly handle LLaMA tokenizer for Jamba... |
commit | commitdiff | tree |
| 2025-10-10 |
Georgi Gerganov | server : fix division by zero when reporting stats... |
commit | commitdiff | tree |
| 2025-10-10 |
Georgi Gerganov | vocab : mark EOT token for Granite models (#16499) |
commit | commitdiff | tree |
| 2025-10-10 |
Radoslav Gerganov | server : return HTTP 400 if prompt exceeds context... |
commit | commitdiff | tree |
| 2025-10-10 |
Radoslav Gerganov | server : log requests to /v1/completions (#16495) |
commit | commitdiff | tree |
| 2025-10-10 |
Prajwal B Mehendarkar | cmake : Dont define XOPENSOURCE on AIX (#16481) |
commit | commitdiff | tree |
| 2025-10-09 |
Pascal | webui: updated the chat service to only include max_tok... |
commit | commitdiff | tree |
| 2025-10-09 |
duduta | cpu : optimize the ggml NORM operation (#15953) |
commit | commitdiff | tree |
| 2025-10-09 |
Georgi Gerganov | server : host-memory prompt caching (#16391) |
commit | commitdiff | tree |
| 2025-10-09 |
Pascal | No markdown in cot (#16483) |
commit | commitdiff | tree |
| 2025-10-09 |
Daniel Bevenius | model-conversion : add support for SentenceTransformers... |
commit | commitdiff | tree |
| 2025-10-09 |
sudhiarm | ci: add ARM64 Kleidiai build and test support (#16462) |
commit | commitdiff | tree |
| 2025-10-09 |
Chenguang Li | CANN: Improve ACL graph matching (#16166) |
commit | commitdiff | tree |
| 2025-10-09 |
Charles Xu | kleidiai: kernel interface refactoring (#16460) |
commit | commitdiff | tree |
| 2025-10-09 |
Neo Zhang Jianyu | [SYCL] refactor soft_max, add soft_max_back (#16472) |
commit | commitdiff | tree |
| 2025-10-09 |
Saba Fallah | model: EmbeddingGemma Adding Support for SentenceTransf... |
commit | commitdiff | tree |
| 2025-10-08 |
Pascal | refactor: centralize CoT parsing in backend for streami... |
commit | commitdiff | tree |
| next |