]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-06-05 Jeff Bolzvulkan: automatically deduce size of push constants...
2025-06-04 Ervin Áron... ggml-vulkan: adds support for op CONV_TRANSPOSE_1D...
2025-06-04 Georgi Gerganovkv-cache : refactor the update/defrag mechanism (#13988)
2025-06-04 Diego Devesaci : remove cuda 11.7 releases, switch runner to window...
2025-06-04 Diego Devesareleases : use dl backend for linux release, remove...
2025-06-04 Xuan-Son Nguyenllama-graph : use ggml_repeat_4d (#13998)
2025-06-04 Johannes GäßlerCUDA: fix FTZ in FA for Gemma 3 (#13991)
2025-06-04 Georgi Gerganovkv-cache : fix unified::seq_rm to work with seq_id...
2025-06-03 Jeff Bolzvulkan: fix warnings in perf logger querypool code...
2025-06-03 Xuan-Son Nguyendocs : add "Quick start" section for new users (#13862)
2025-06-02 lhezopencl: add `backend_synchronize` (#13939)
2025-06-02 rmatifOpenCL: Add concat, tsembd, upscale, tanh, pad and...
2025-06-02 Georgi Gerganovserver : disable speculative decoding for SWA models...
2025-06-02 Georgi Gerganovmetal : use F32 accumulators in FA kernels (#13975)
2025-06-02 Georgi Gerganovgemma : more consistent attention scaling for v2 and...
2025-06-02 Olivier Chafik`server`: update deepseek reasoning format (pass reason...
2025-06-02 Xuan-Son Nguyenmtmd : fix memory leak in mtmd_helper_eval_chunk_single...
2025-06-02 shalinib-ibmcmake : Handle mixed-case 'Power' strings in POWER...
2025-06-02 Atharva Dubeysycl: quantize and reorder the input to q8_1 when reord...
2025-06-01 Johannes Gäßlergguf: fix failure on version == 0 (#13956)
2025-06-01 Sigbjørn Skjæretconvert : fix nomic-bert-moe mask token (#13757)
2025-06-01 Sigbjørn Skjæretconvert : fix vocab padding code for bert models (...
2025-06-01 Aaron Teoggml: check if non-native endian model is being loaded...
2025-06-01 Georgi Gerganovsync : ggml
2025-06-01 Kai Pastorvulkan : Remove unexpected ; (ggml/1253)
2025-06-01 Kai Pastorcmake : Fix broken CMake error messages (ggml/1252)
2025-06-01 Radoslav Gerganovggml : remove ggml_graph_import and ggml_graph_export...
2025-06-01 Georgi Gerganovsync : whisper.cpp (ggml/1250)
2025-06-01 Radoslav Gerganovggml : install dynamic backends (ggml/1240)
2025-06-01 Daniel Tangggml : Print backtrace on uncaught C++ exceptions ...
2025-06-01 ddh0readme : update bindings (#13950)
2025-06-01 Georgi Gerganovparallel : fix n_junk == 0 (#13952)
2025-06-01 Georgi Gerganovkv-cache : split implementation in separate sources...
2025-05-31 Max Krasnyanskythreading: support for GGML_SCHED_PRIO_LOW, update...
2025-05-31 Jiří Podivíndocs : Note about necessity of having libcurl installed...
2025-05-31 Olivier Chafikserver: allow unclosed thinking tags (#13931)
2025-05-31 Georgi Gerganovllama : deprecate explicit kv_self defrag/update calls...
2025-05-31 Georgi Gerganovllama : use n_swa + n_ubatch cells for SWA cache (...
2025-05-31 igardevwebui : Replace alert and confirm with custom modals...
2025-05-31 Georgi Gerganovllama : auto-batch preparation (#13845)
2025-05-31 Xuan-Son Nguyenmtmd : drop `_shared` from `libmtmd` name, merge helper...
2025-05-31 Georgi Gerganovkv-cache : refactor + add llama_memory_state_i (#13746)
2025-05-31 Shawn yangCUDA: add a prop in ggml_cuda_device_infor for distingu...
2025-05-30 Johannes GäßlerCUDA: fix typo in FlashAttention code (#13926)
2025-05-30 Diego Devesasched : avoid changing cur_copy when a graph is already...
2025-05-30 Georgi Gerganovparallel : increase the variability of the prompt lengt...
2025-05-30 Diego Devesacuda : prevent using split buffers with 3d/4d matrices...
2025-05-30 Akarshan BiswasSYCL: Add mrope kernel (#13755)
2025-05-30 Georgi Gerganovsync : vendor (#13901)
2025-05-30 Sigbjørn Skjæretconvert : fix rwkv bos/eos token (#13844)
2025-05-30 Xuan-Son Nguyenconvert : allow partial update to the chkhsh pre-tokeni...
2025-05-30 Đinh Trọng Huyllama : add support for DistilBert (#13907)
2025-05-30 zhangkaihuollama : use llm_build_granite for minicpm (#13911)
2025-05-29 Christian Kastnercmake: Guard GGML_CPU_ALL_VARIANTS by architecture...
2025-05-29 Sigbjørn Skjæretllama : add support for jina-reranker-v2 (#13900)
2025-05-29 Sigbjørn Skjæretgguf-py : add support for sub_type (in arrays) in GGUFW...
2025-05-29 Yibo Caiarm64: optimize q4_k_q8_k kernel with i8mm (#13886)
2025-05-29 Christian Kastnercmake: Factor out CPU architecture detection (#13883)
2025-05-29 Vineel Abhinavggml: aarch64: Implement SVE F32 kernels for Mamba...
2025-05-29 Georgi Gerganovtests : remove json.hpp from a test (#13880)
2025-05-29 Sigbjørn Skjæretconvert : workaround for AutoConfig dummy labels (...
2025-05-29 Sigbjørn Skjæretllama : add RobertaForSequenceClassification reranker...
2025-05-29 Vineel Abhinavggml: aarch64: Implement SVE F32 kernels for vector...
2025-05-28 Beinseziigguf-py : fix SafetensorRemote return on undefined...
2025-05-28 Xuan-Son Nguyenllama : fix KV shift for qwen2vl (#13870)
2025-05-28 Xuan-Son Nguyenmtmd : move helpers to dedicated library (⚠️ breaking...
2025-05-28 bandotici: disable LLAMA_CURL for Linux cross-builds (#13871)
2025-05-28 Đinh Trọng Huyllama : add support for BertForSequenceClassification...
2025-05-28 Đinh Trọng Huyconvert: small addition to support LlamaModel (#13838)
2025-05-28 Skyserver: fix remove 'image_url'/'input_audio' json-objec...
2025-05-28 Xuan-Son Nguyenconvert : fix qwen omni conversion (#13859)
2025-05-28 Alex Fanthometests : change umlaut test (#11600)
2025-05-28 Johannes GäßlerCUDA: fix FA tg at long context for CC >= 8.9 (#13852)
2025-05-28 Xuan-Son Nguyenconvert : fix tensor naming conflict for llama 4 vision...
2025-05-28 leo-ponyCANN: Add SOC TYPE printing in cmake configuration...
2025-05-27 lhezopencl: add new ops - `argsort`, `div`, `sub`, `addrows...
2025-05-27 lhezopencl: mark `mul_mat` `f32f32` as supporting non-conti...
2025-05-27 Jeff Bolzvulkan: use timestamp queries for GGML_VULKAN_PERF...
2025-05-27 Georgi Gerganovcmake : add llama-cparams.cpp to build (#13832)
2025-05-27 Akarshan BiswasSYCL: add gelu_erf kernel (#13749)
2025-05-27 Georgi Gerganovsync : ggml
2025-05-27 Xuan-Son Nguyenggml : add ggml_repeat_4d (#13824)
2025-05-27 xctanggml : riscv: add xtheadvector support (#13720)
2025-05-27 Xuan-Son Nguyenmtmd : support Qwen 2.5 Omni (input audio+vision, no...
2025-05-27 bandotidocs: remove link for llama-cli function calling (...
2025-05-27 Christian Kastnerggml-cpu: x86 feature detection is specific to x86...
2025-05-27 Diego Devesaggml : allow CUDA graphs when using pipeline parallelis...
2025-05-27 Georgi Gerganovkv-cells : track min/max used cells and per-sequence...
2025-05-27 Georgi Gerganovsampling : make sure samplers return at least 1 token...
2025-05-27 Georgi Gerganovllama : validate seq id batch input (#13809)
2025-05-26 Olivier Chafikserver: --offline mode (#13804)
2025-05-26 Georgi Gerganovscripts : add option to compare commits in Debug (...
2025-05-26 Georgi Gerganovcuda : avoid cuGetErrorString (#13791)
2025-05-26 Akarshan BiswasSYCL: Add non contiguous support in RMS_NORM and NORM...
2025-05-26 Olivier Chafikserver: fix streaming crashes (#13786)
2025-05-26 standby24x7examples/training: Fix file name in README (#13803)
2025-05-26 Olivier Chafik`server`: fix format of streamed tool call deltas ...
2025-05-26 Olivier Chafikserver: fix regression on streamed non-chat completion...
2025-05-26 Georgi Gerganovexamples : allow extracting embeddings from decoder...
2025-05-26 Georgi Gerganovllama : clarify deprecation message (#13794)
next