]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-04-02 Diego Devesallama : add option to override model tensor buffers... upstream/0.0.5028
2025-04-02 Georgi Gerganovllama : refactor kv cache guard (#12695)
2025-04-02 Sigbjørn Skjæretvocab : BailingMoE : change possessive quantifiers...
2025-04-02 Xuan-Son Nguyencommon : remove json.hpp from common.cpp (#12697)
2025-04-02 Chenguang Li[CANN] get_rows and dup optimization (#12671)
2025-04-01 Xuan-Son Nguyencommon : refactor downloading system, handle mmproj...
2025-04-01 Junil Kimopencl : fix memory allocation size (#12649)
2025-04-01 jklincnllama : use LLM_KV_GENERAL_FILE_TYPE instead of gguf_fi...
2025-04-01 Sigbjørn Skjæretconvert : BailingMoE : fix qkv split when head_dim...
2025-04-01 Georgi Gerganovmetal : use F32 prec in FA kernels (#12688)
2025-04-01 R0CKSTARFix clang warning in gguf_check_reserved_keys (#12686)
2025-04-01 Wagner Brunavulkan: fix build when glslc doesn't support coopmat...
2025-04-01 Romain BiessySYCL: Rename oneMKL to oneMath (#12192)
2025-04-01 Akarshan BiswasSYCL: switch to SYCL namespace (#12674)
2025-03-31 Sigbjørn Skjæretconvert : BailingMoE : avoid setting rope_dim to 0...
2025-03-31 Daniel Beveniusvocab : add special infill tokens for CodeLlama (#11850)
2025-03-31 a3shggml : faster ssm scan (#10558)
2025-03-31 Sigbjørn Skjæretconvert : Qwerky : use lora_rank_tokenshift and lora_ra...
2025-03-31 0cc4mVulkan: Add DP4A MMQ and Q8_1 quantization shader ...
2025-03-31 Georgi Gerganovcmake : fix whitespace (#0)
2025-03-31 Georgi Gerganovsync : ggml
2025-03-31 Sandro Haneacmake: improve Vulkan cooperative matrix support checks...
2025-03-31 Sigbjørn Skjæretllava : proper description fix (#12668)
2025-03-31 Akarshan BiswasSYCL: Remove misleading ggml_sycl_op_flatten function...
2025-03-31 Sigbjørn Skjæretllava : fix clip loading GGUFs with missing description...
2025-03-31 marcoStocchitts : remove printfs (#12640)
2025-03-30 Sigbjørn Skjæretllama : support BailingMoE (Ling) (#12634)
2025-03-30 Georgi Gerganovmetal : use constexpr in FA kernels + fix typedef ...
2025-03-30 Juyoung Sukllama : add Trillion 7B model support (#12556)
2025-03-30 Sergei Vorobyovllama-chat : Add Yandex instruct model template support...
2025-03-30 R0CKSTARmusa: fix all warnings, re-enable `-DLLAMA_FATAL_WARNIN...
2025-03-30 Georgi Gerganovsync : ggml
2025-03-30 Xuan-Son Nguyencpu : rm unused variable (ggml/1166)
2025-03-30 cmdr2cpu: de-duplicate some of the operators and refactor...
2025-03-30 Daniel Beveniusggml : add logging for native build options/vars (whisp...
2025-03-30 Daniel Beveniusexamples : command.wasm updates (whisper/2904)
2025-03-29 Xuan-Son Nguyenllama : fix non-causal mask for gemma 3 (#12615)
2025-03-29 Djip007llama : change cpu_buft_list order: ACCEL -> GPU host...
2025-03-29 Jaycmake : fix ccache conflict (#12522)
2025-03-29 hipuddingCANN : remove clang-format in ggml-cann (#12607)
2025-03-28 Sigbjørn Skjæretllama : fix incorrect Qwen2Moe ffn_moe_out graph callba...
2025-03-28 Georgi Gerganovmetal : improve FA + improve MoE (#12612)
2025-03-28 Icenowy Zhengvulkan: fix coopmat shader generation when cross-compil...
2025-03-28 Johannes Gäßlerllama: fix error on bad grammar (#12628)
2025-03-28 Benson Wongserver : include speculative decoding stats when timing...
2025-03-28 Radoslav Gerganovrpc : update README for cache usage (#12620)
2025-03-28 amritahs-ibmllamafile : ppc64le GEMV forwarding for FP32. (#12594)
2025-03-28 Radoslav Gerganovrpc : send hash when tensor data is above some fixed...
2025-03-27 Piotrserver : Support listening on a unix socket (#12613)
2025-03-27 Georgi Gerganovmedia : add SVG logo [no ci] (#12616)
2025-03-27 lhezopencl: add multi and vision rope, `gelu_quick` and...
2025-03-27 Si1wllama : add PLM GGUF Conversion & Inference Support...
2025-03-27 HighDopingmodel : restore support for T5Encoder (#12590)
2025-03-27 Csaba Kecskemeticonvert : Support Qwen2_5_VLForConditionalGeneration...
2025-03-27 Georgi Gerganovsync : ggml
2025-03-27 Georgi Gerganovscripts : update sync + fix cmake merge
2025-03-27 Georgi Gerganovsync : ggml
2025-03-27 Georgi Gerganovcmake : sync/merge PowerPC build commands (#0)
2025-03-27 amritahs-ibmllamafile : ppc64le MMA implementation for Q4_0. (...
2025-03-27 xctanggml : riscv: add 128-bit RVV support (#12530)
2025-03-27 Georgi Gerganovllama : make loras compatible with repacking (#12593)
2025-03-27 Akarshan BiswasSYCL: implement memset ggml backend buffer interface...
2025-03-26 Slobodan JosicHIP: Add support for RDNA4 targets (#12372)
2025-03-26 Georgi Gerganovmetal : refactor mat-vec code (#12569)
2025-03-26 Michał Moskalupgrade to llguidance 0.7.10 (#12576)
2025-03-26 Ivy233clip: Fix llama-llava-clip-quantize-cli quantization...
2025-03-26 Georgi Gerganovconvert : fix squeeze for ssm_conv tensors (#12573)
2025-03-26 Georgi Gerganovggml : fix MUL_MAT_ID repack with Q8_K (#12544)
2025-03-26 R0CKSTARdoc: [MUSA] minor changes (#12583)
2025-03-25 Sigbjørn Skjæretconvert: fix Mistral3/Gemma3 model hparams init (#12571)
2025-03-25 Eric Curtinrun: de-duplicate fmt and format functions and optimize...
2025-03-25 Dan Johanssonggml-cpu : update KleidiAI to v1.5.0 (#12568)
2025-03-25 Akarshan BiswasSYCL: disable Q4_0 reorder optimization (#12560)
2025-03-25 Dan Johanssondocs : add build instructions for KleidiAI (#12563)
2025-03-25 R0CKSTARci: [MUSA] add CI and update doc (#12562)
2025-03-25 Georgi Gerganovcontext : fix worst-case reserve outputs (#12545)
2025-03-24 Akarshan Biswasci: [SYCL] ggml-ci Use main GPU and enable sysman ...
2025-03-24 lhezopencl: simplify kernel embedding logic in cmakefile...
2025-03-24 Akarshan BiswasCI: fix SYCL build (#12546)
2025-03-24 Tei Homedocs: update: improve the Fedoa CUDA guide (#12536)
2025-03-24 compiladellama-vocab : add SuperBPE pre-tokenizer (#12532)
2025-03-24 R0CKSTARCUDA: Fix clang warnings (#12540)
2025-03-24 Prajwal B Mehendarkarmmap : skip resource limit checks on AIX (#12541)
2025-03-24 Jeff Bolzvulkan: fix mul_mat_vec failure in backend tests (...
2025-03-23 Marius Gerdesserver : Add verbose output to OAI compatible chat...
2025-03-23 Lars Sonchocky... install : add macports (#12518)
2025-03-22 Xuan-Son Nguyenllama : gemma3 : use output tensor if it exists in...
2025-03-22 Georgi Gerganovggml : fix quantized cpy op (#12310)
2025-03-22 R0CKSTARmusa: refine compute capability (#12493)
2025-03-22 Jeff Bolzvulkan: Optimize mul_mat_vec p021 and nc shaders (...
2025-03-21 stduhpfVulkan: RTE rounding for cpy to quant (#12480)
2025-03-21 Evevulkan: workaround for AMD Windows driver 16 bit unpack...
2025-03-21 Georgi Gerganovmodel : do not repack if a GPU device is present (...
2025-03-21 Sigbjørn Skjæretchore : cleanup llama_model_loader::TENSOR_ usage ...
2025-03-21 marcoStocchillama-tts : avoid crashes related to bad model file...
2025-03-21 蕭澧邦[SYCL] Fix build on Windows when ccache enabled (#9954...
2025-03-21 Svetlozar Georgievsycl: cleanup oneDNN related code (#12097)
2025-03-20 Woof Dogwebui : Prevent rerendering on textarea input (#12299)
2025-03-20 Sigbjørn Skjæretllama : make Qwen2MoE QKV bias optional (#12477)
2025-03-20 Srihari-mcwggml : block interleaving support for Q4_K quantization...
next