2025-06-05 |
Jeff Bolz | vulkan: automatically deduce size of push constants... |
commit | commitdiff | tree |
2025-06-04 |
Ervin Áron... | ggml-vulkan: adds support for op CONV_TRANSPOSE_1D... |
commit | commitdiff | tree |
2025-06-04 |
Georgi Gerganov | kv-cache : refactor the update/defrag mechanism (#13988) |
commit | commitdiff | tree |
2025-06-04 |
Diego Devesa | ci : remove cuda 11.7 releases, switch runner to window... |
commit | commitdiff | tree |
2025-06-04 |
Diego Devesa | releases : use dl backend for linux release, remove... |
commit | commitdiff | tree |
2025-06-04 |
Xuan-Son Nguyen | llama-graph : use ggml_repeat_4d (#13998) |
commit | commitdiff | tree |
2025-06-04 |
Johannes Gäßler | CUDA: fix FTZ in FA for Gemma 3 (#13991) |
commit | commitdiff | tree |
2025-06-04 |
Georgi Gerganov | kv-cache : fix unified::seq_rm to work with seq_id... |
commit | commitdiff | tree |
2025-06-03 |
Jeff Bolz | vulkan: fix warnings in perf logger querypool code... |
commit | commitdiff | tree |
2025-06-03 |
Xuan-Son Nguyen | docs : add "Quick start" section for new users (#13862) |
commit | commitdiff | tree |
2025-06-02 |
lhez | opencl: add `backend_synchronize` (#13939) |
commit | commitdiff | tree |
2025-06-02 |
rmatif | OpenCL: Add concat, tsembd, upscale, tanh, pad and... |
commit | commitdiff | tree |
2025-06-02 |
Georgi Gerganov | server : disable speculative decoding for SWA models... |
commit | commitdiff | tree |
2025-06-02 |
Georgi Gerganov | metal : use F32 accumulators in FA kernels (#13975) |
commit | commitdiff | tree |
2025-06-02 |
Georgi Gerganov | gemma : more consistent attention scaling for v2 and... |
commit | commitdiff | tree |
2025-06-02 |
Olivier Chafik | `server`: update deepseek reasoning format (pass reason... |
commit | commitdiff | tree |
2025-06-02 |
Xuan-Son Nguyen | mtmd : fix memory leak in mtmd_helper_eval_chunk_single... |
commit | commitdiff | tree |
2025-06-02 |
shalinib-ibm | cmake : Handle mixed-case 'Power' strings in POWER... |
commit | commitdiff | tree |
2025-06-02 |
Atharva Dubey | sycl: quantize and reorder the input to q8_1 when reord... |
commit | commitdiff | tree |
2025-06-01 |
Johannes Gäßler | gguf: fix failure on version == 0 (#13956) |
commit | commitdiff | tree |
2025-06-01 |
Sigbjørn Skjæret | convert : fix nomic-bert-moe mask token (#13757) |
commit | commitdiff | tree |
2025-06-01 |
Sigbjørn Skjæret | convert : fix vocab padding code for bert models (... |
commit | commitdiff | tree |
2025-06-01 |
Aaron Teo | ggml: check if non-native endian model is being loaded... |
commit | commitdiff | tree |
2025-06-01 |
Georgi Gerganov | sync : ggml |
commit | commitdiff | tree |
2025-06-01 |
Kai Pastor | vulkan : Remove unexpected ; (ggml/1253) |
commit | commitdiff | tree |
2025-06-01 |
Kai Pastor | cmake : Fix broken CMake error messages (ggml/1252) |
commit | commitdiff | tree |
2025-06-01 |
Radoslav Gerganov | ggml : remove ggml_graph_import and ggml_graph_export... |
commit | commitdiff | tree |
2025-06-01 |
Georgi Gerganov | sync : whisper.cpp (ggml/1250) |
commit | commitdiff | tree |
2025-06-01 |
Radoslav Gerganov | ggml : install dynamic backends (ggml/1240) |
commit | commitdiff | tree |
2025-06-01 |
Daniel Tang | ggml : Print backtrace on uncaught C++ exceptions ... |
commit | commitdiff | tree |
2025-06-01 |
ddh0 | readme : update bindings (#13950) |
commit | commitdiff | tree |
2025-06-01 |
Georgi Gerganov | parallel : fix n_junk == 0 (#13952) |
commit | commitdiff | tree |
2025-06-01 |
Georgi Gerganov | kv-cache : split implementation in separate sources... |
commit | commitdiff | tree |
2025-05-31 |
Max Krasnyansky | threading: support for GGML_SCHED_PRIO_LOW, update... |
commit | commitdiff | tree |
2025-05-31 |
Jiří Podivín | docs : Note about necessity of having libcurl installed... |
commit | commitdiff | tree |
2025-05-31 |
Olivier Chafik | server: allow unclosed thinking tags (#13931) |
commit | commitdiff | tree |
2025-05-31 |
Georgi Gerganov | llama : deprecate explicit kv_self defrag/update calls... |
commit | commitdiff | tree |
2025-05-31 |
Georgi Gerganov | llama : use n_swa + n_ubatch cells for SWA cache (... |
commit | commitdiff | tree |
2025-05-31 |
igardev | webui : Replace alert and confirm with custom modals... |
commit | commitdiff | tree |
2025-05-31 |
Georgi Gerganov | llama : auto-batch preparation (#13845) |
commit | commitdiff | tree |
2025-05-31 |
Xuan-Son Nguyen | mtmd : drop `_shared` from `libmtmd` name, merge helper... |
commit | commitdiff | tree |
2025-05-31 |
Georgi Gerganov | kv-cache : refactor + add llama_memory_state_i (#13746) |
commit | commitdiff | tree |
2025-05-31 |
Shawn yang | CUDA: add a prop in ggml_cuda_device_infor for distingu... |
commit | commitdiff | tree |
2025-05-30 |
Johannes Gäßler | CUDA: fix typo in FlashAttention code (#13926) |
commit | commitdiff | tree |
2025-05-30 |
Diego Devesa | sched : avoid changing cur_copy when a graph is already... |
commit | commitdiff | tree |
2025-05-30 |
Georgi Gerganov | parallel : increase the variability of the prompt lengt... |
commit | commitdiff | tree |
2025-05-30 |
Diego Devesa | cuda : prevent using split buffers with 3d/4d matrices... |
commit | commitdiff | tree |
2025-05-30 |
Akarshan Biswas | SYCL: Add mrope kernel (#13755) |
commit | commitdiff | tree |
2025-05-30 |
Georgi Gerganov | sync : vendor (#13901) |
commit | commitdiff | tree |
2025-05-30 |
Sigbjørn Skjæret | convert : fix rwkv bos/eos token (#13844) |
commit | commitdiff | tree |
2025-05-30 |
Xuan-Son Nguyen | convert : allow partial update to the chkhsh pre-tokeni... |
commit | commitdiff | tree |
2025-05-30 |
Đinh Trọng Huy | llama : add support for DistilBert (#13907) |
commit | commitdiff | tree |
2025-05-30 |
zhangkaihuo | llama : use llm_build_granite for minicpm (#13911) |
commit | commitdiff | tree |
2025-05-29 |
Christian Kastner | cmake: Guard GGML_CPU_ALL_VARIANTS by architecture... |
commit | commitdiff | tree |
2025-05-29 |
Sigbjørn Skjæret | llama : add support for jina-reranker-v2 (#13900) |
commit | commitdiff | tree |
2025-05-29 |
Sigbjørn Skjæret | gguf-py : add support for sub_type (in arrays) in GGUFW... |
commit | commitdiff | tree |
2025-05-29 |
Yibo Cai | arm64: optimize q4_k_q8_k kernel with i8mm (#13886) |
commit | commitdiff | tree |
2025-05-29 |
Christian Kastner | cmake: Factor out CPU architecture detection (#13883) |
commit | commitdiff | tree |
2025-05-29 |
Vineel Abhinav | ggml: aarch64: Implement SVE F32 kernels for Mamba... |
commit | commitdiff | tree |
2025-05-29 |
Georgi Gerganov | tests : remove json.hpp from a test (#13880) |
commit | commitdiff | tree |
2025-05-29 |
Sigbjørn Skjæret | convert : workaround for AutoConfig dummy labels (... |
commit | commitdiff | tree |
2025-05-29 |
Sigbjørn Skjæret | llama : add RobertaForSequenceClassification reranker... |
commit | commitdiff | tree |
2025-05-29 |
Vineel Abhinav | ggml: aarch64: Implement SVE F32 kernels for vector... |
commit | commitdiff | tree |
2025-05-28 |
Beinsezii | gguf-py : fix SafetensorRemote return on undefined... |
commit | commitdiff | tree |
2025-05-28 |
Xuan-Son Nguyen | llama : fix KV shift for qwen2vl (#13870) |
commit | commitdiff | tree |
2025-05-28 |
Xuan-Son Nguyen | mtmd : move helpers to dedicated library (⚠️ breaking... |
commit | commitdiff | tree |
2025-05-28 |
bandoti | ci: disable LLAMA_CURL for Linux cross-builds (#13871) |
commit | commitdiff | tree |
2025-05-28 |
Đinh Trọng Huy | llama : add support for BertForSequenceClassification... |
commit | commitdiff | tree |
2025-05-28 |
Đinh Trọng Huy | convert: small addition to support LlamaModel (#13838) |
commit | commitdiff | tree |
2025-05-28 |
Sky | server: fix remove 'image_url'/'input_audio' json-objec... |
commit | commitdiff | tree |
2025-05-28 |
Xuan-Son Nguyen | convert : fix qwen omni conversion (#13859) |
commit | commitdiff | tree |
2025-05-28 |
Alex Fanthome | tests : change umlaut test (#11600) |
commit | commitdiff | tree |
2025-05-28 |
Johannes Gäßler | CUDA: fix FA tg at long context for CC >= 8.9 (#13852) |
commit | commitdiff | tree |
2025-05-28 |
Xuan-Son Nguyen | convert : fix tensor naming conflict for llama 4 vision... |
commit | commitdiff | tree |
2025-05-28 |
leo-pony | CANN: Add SOC TYPE printing in cmake configuration... |
commit | commitdiff | tree |
2025-05-27 |
lhez | opencl: add new ops - `argsort`, `div`, `sub`, `addrows... |
commit | commitdiff | tree |
2025-05-27 |
lhez | opencl: mark `mul_mat` `f32f32` as supporting non-conti... |
commit | commitdiff | tree |
2025-05-27 |
Jeff Bolz | vulkan: use timestamp queries for GGML_VULKAN_PERF... |
commit | commitdiff | tree |
2025-05-27 |
Georgi Gerganov | cmake : add llama-cparams.cpp to build (#13832) |
commit | commitdiff | tree |
2025-05-27 |
Akarshan Biswas | SYCL: add gelu_erf kernel (#13749) |
commit | commitdiff | tree |
2025-05-27 |
Georgi Gerganov | sync : ggml |
commit | commitdiff | tree |
2025-05-27 |
Xuan-Son Nguyen | ggml : add ggml_repeat_4d (#13824) |
commit | commitdiff | tree |
2025-05-27 |
xctan | ggml : riscv: add xtheadvector support (#13720) |
commit | commitdiff | tree |
2025-05-27 |
Xuan-Son Nguyen | mtmd : support Qwen 2.5 Omni (input audio+vision, no... |
commit | commitdiff | tree |
2025-05-27 |
bandoti | docs: remove link for llama-cli function calling (... |
commit | commitdiff | tree |
2025-05-27 |
Christian Kastner | ggml-cpu: x86 feature detection is specific to x86... |
commit | commitdiff | tree |
2025-05-27 |
Diego Devesa | ggml : allow CUDA graphs when using pipeline parallelis... |
commit | commitdiff | tree |
2025-05-27 |
Georgi Gerganov | kv-cells : track min/max used cells and per-sequence... |
commit | commitdiff | tree |
2025-05-27 |
Georgi Gerganov | sampling : make sure samplers return at least 1 token... |
commit | commitdiff | tree |
2025-05-27 |
Georgi Gerganov | llama : validate seq id batch input (#13809) |
commit | commitdiff | tree |
2025-05-26 |
Olivier Chafik | server: --offline mode (#13804) |
commit | commitdiff | tree |
2025-05-26 |
Georgi Gerganov | scripts : add option to compare commits in Debug (... |
commit | commitdiff | tree |
2025-05-26 |
Georgi Gerganov | cuda : avoid cuGetErrorString (#13791) |
commit | commitdiff | tree |
2025-05-26 |
Akarshan Biswas | SYCL: Add non contiguous support in RMS_NORM and NORM... |
commit | commitdiff | tree |
2025-05-26 |
Olivier Chafik | server: fix streaming crashes (#13786) |
commit | commitdiff | tree |
2025-05-26 |
standby24x7 | examples/training: Fix file name in README (#13803) |
commit | commitdiff | tree |
2025-05-26 |
Olivier Chafik | `server`: fix format of streamed tool call deltas ... |
commit | commitdiff | tree |
2025-05-26 |
Olivier Chafik | server: fix regression on streamed non-chat completion... |
commit | commitdiff | tree |
2025-05-26 |
Georgi Gerganov | examples : allow extracting embeddings from decoder... |
commit | commitdiff | tree |
2025-05-26 |
Georgi Gerganov | llama : clarify deprecation message (#13794) |
commit | commitdiff | tree |
next |