]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-03-27 Georgi Gerganovmedia : add SVG logo [no ci] (#12616)
2025-03-27 lhezopencl: add multi and vision rope, `gelu_quick` and...
2025-03-27 Si1wllama : add PLM GGUF Conversion & Inference Support...
2025-03-27 HighDopingmodel : restore support for T5Encoder (#12590)
2025-03-27 Csaba Kecskemeticonvert : Support Qwen2_5_VLForConditionalGeneration...
2025-03-27 Georgi Gerganovsync : ggml
2025-03-27 Georgi Gerganovscripts : update sync + fix cmake merge
2025-03-27 Georgi Gerganovsync : ggml
2025-03-27 Georgi Gerganovcmake : sync/merge PowerPC build commands (#0)
2025-03-27 amritahs-ibmllamafile : ppc64le MMA implementation for Q4_0. (...
2025-03-27 xctanggml : riscv: add 128-bit RVV support (#12530)
2025-03-27 Georgi Gerganovllama : make loras compatible with repacking (#12593)
2025-03-27 Akarshan BiswasSYCL: implement memset ggml backend buffer interface...
2025-03-26 Slobodan JosicHIP: Add support for RDNA4 targets (#12372)
2025-03-26 Georgi Gerganovmetal : refactor mat-vec code (#12569)
2025-03-26 Michał Moskalupgrade to llguidance 0.7.10 (#12576)
2025-03-26 Ivy233clip: Fix llama-llava-clip-quantize-cli quantization...
2025-03-26 Georgi Gerganovconvert : fix squeeze for ssm_conv tensors (#12573)
2025-03-26 Georgi Gerganovggml : fix MUL_MAT_ID repack with Q8_K (#12544)
2025-03-26 R0CKSTARdoc: [MUSA] minor changes (#12583)
2025-03-25 Sigbjørn Skjæretconvert: fix Mistral3/Gemma3 model hparams init (#12571)
2025-03-25 Eric Curtinrun: de-duplicate fmt and format functions and optimize...
2025-03-25 Dan Johanssonggml-cpu : update KleidiAI to v1.5.0 (#12568)
2025-03-25 Akarshan BiswasSYCL: disable Q4_0 reorder optimization (#12560)
2025-03-25 Dan Johanssondocs : add build instructions for KleidiAI (#12563)
2025-03-25 R0CKSTARci: [MUSA] add CI and update doc (#12562)
2025-03-25 Georgi Gerganovcontext : fix worst-case reserve outputs (#12545)
2025-03-24 Akarshan Biswasci: [SYCL] ggml-ci Use main GPU and enable sysman ...
2025-03-24 lhezopencl: simplify kernel embedding logic in cmakefile...
2025-03-24 Akarshan BiswasCI: fix SYCL build (#12546)
2025-03-24 Tei Homedocs: update: improve the Fedoa CUDA guide (#12536)
2025-03-24 compiladellama-vocab : add SuperBPE pre-tokenizer (#12532)
2025-03-24 R0CKSTARCUDA: Fix clang warnings (#12540)
2025-03-24 Prajwal B Mehendarkarmmap : skip resource limit checks on AIX (#12541)
2025-03-24 Jeff Bolzvulkan: fix mul_mat_vec failure in backend tests (...
2025-03-23 Marius Gerdesserver : Add verbose output to OAI compatible chat...
2025-03-23 Lars Sonchocky... install : add macports (#12518)
2025-03-22 Xuan-Son Nguyenllama : gemma3 : use output tensor if it exists in...
2025-03-22 Georgi Gerganovggml : fix quantized cpy op (#12310)
2025-03-22 R0CKSTARmusa: refine compute capability (#12493)
2025-03-22 Jeff Bolzvulkan: Optimize mul_mat_vec p021 and nc shaders (...
2025-03-21 stduhpfVulkan: RTE rounding for cpy to quant (#12480)
2025-03-21 Evevulkan: workaround for AMD Windows driver 16 bit unpack...
2025-03-21 Georgi Gerganovmodel : do not repack if a GPU device is present (...
2025-03-21 Sigbjørn Skjæretchore : cleanup llama_model_loader::TENSOR_ usage ...
2025-03-21 marcoStocchillama-tts : avoid crashes related to bad model file...
2025-03-21 蕭澧邦[SYCL] Fix build on Windows when ccache enabled (#9954...
2025-03-21 Svetlozar Georgievsycl: cleanup oneDNN related code (#12097)
2025-03-20 Woof Dogwebui : Prevent rerendering on textarea input (#12299)
2025-03-20 Sigbjørn Skjæretllama : make Qwen2MoE QKV bias optional (#12477)
2025-03-20 Srihari-mcwggml : block interleaving support for Q4_K quantization...
2025-03-20 Bartowskiconvert : avoid calls to tokenizer.added_tokens_decoder...
2025-03-19 fairydreamingcontext : clear sets containing encoder output sequence...
2025-03-19 Gaurav GargCUDA: Improve flash decoding kernel GPU occupancy for...
2025-03-19 Jeff Bolzvulkan: optimize iq1 coopmat2 dequant functions (#12427)
2025-03-19 Guus WaalsFix visionOS build and add CI (#12415)
2025-03-19 Sigbjørn Skjæretllama : add support for GPT2, Bloom and CodeShell tied...
2025-03-19 Sigbjørn Skjæretconvert : Support chat_template.json (#12460)
2025-03-19 Jeff Bolzvulkan: Submit once enough matmul work has been recorde...
2025-03-18 lhezopencl: improve profiling (#12442)
2025-03-18 Georgi Gerganovgraph : normalize Q, K, V shapes + sync cross attention...
2025-03-18 R0CKSTARmusa: override warp_size of musa device to 32 (#12445)
2025-03-18 Xuan-Son Nguyenllama : support converting Mistral Small text-only...
2025-03-18 Georgi Gerganovspeculative : fix seg fault in certain cases (#12454)
2025-03-18 Xuan-Son Nguyenllama : add support for EXAONE tied word embeddings...
2025-03-18 Georgi Gerganovcontext : always use non-causal attention for encoder...
2025-03-18 Łukasz ŚlusarczykSYCL: using graphs is configurable by environment varia...
2025-03-18 Georgi Gerganovserver : fix warmup draft cache type (#12446)
2025-03-18 Prajwal B Mehendarkarcmake : fix PowerPC build (#12241)
2025-03-18 fj-y-saitoggml : add SVE support for q6_K_q8_K (#12361)
2025-03-18 0cc4mVulkan: Default to 1GB allocations instead of 4GB to...
2025-03-18 Łukasz Ślusarczykfixed compilation warnings in ggml-sycl (#12424)
2025-03-17 Molly Sophiallama: Add support for RWKV v7 architecture (#12412)
2025-03-17 Sigbjørn Skjæretdocs : bring llama-cli conversation/template docs up...
2025-03-17 Gaurav Gargcuda : enable CUDA Graph on CUDA Toolkit < 12.x (#12394)
2025-03-17 Guus Waalsggml-vulkan: remove unused find_program(glslc) (#12416)
2025-03-17 Jeff Bolzvulkan: Add N/2 and N/4 optimized paths in coopmat2...
2025-03-17 Danielevulkan: subgroup size tuning (#12087)
2025-03-17 Jeff Bolzvulkan: use fp32 in coopmat2 q4_k dequant function...
2025-03-17 Jeff Bolzvulkan: Pad N dimension of B matrix for coopmat2 perf...
2025-03-17 Jeff Bolzvulkan: Adjust coopmat2 tile sizes and selection heuris...
2025-03-17 Christian Kastnercmake : enable building llama.cpp using system libggml...
2025-03-17 Akarshan BiswasSYCL: set extras only on GGML_TYPE_Q4_0 (#12366)
2025-03-16 Sigbjørn Skjæretllama : fix OLMo-2-0325-32B-Instruct K-norm size (...
2025-03-16 Georgi Gerganovcontext : fix init of n_outputs (#12397)
2025-03-16 Daniel Beveniusci : add --symlinks to xcframework zip command (#12409)
2025-03-15 marcoStocchillama-tts : add '-o' option (#12398)
2025-03-15 aubreyliSYCL: Delete redundant plus sign and space (#12391)
2025-03-15 fairydreamingSYCL : support non-contiguous tensors in binary ops...
2025-03-15 Chenguang Li[CANN]MUL_MAT optimization (#12382)
2025-03-14 Eric CurtinAdd CLI arg to llama-run to adjust the number of thread...
2025-03-14 Sigbjørn Skjæretmain : add -sysf / --system-prompt-file (#12249) (...
2025-03-14 fairydreamingLoad all MoE experts during warmup (#11571)
2025-03-14 Victorserver: fix "--grammar-file" parameter (#12285)
2025-03-14 Georgi Gerganovgraph : simplify attn input build for unified KV cache...
2025-03-14 Georgi Gerganovhparams : add SWA rope parameters (#12374)
2025-03-13 Georgi Gerganovllama : fix Gemma3 SWA KV cache shift (#12373)
2025-03-13 Xuan-Son Nguyenarg : no n_predict = -2 for examples except for main...
2025-03-13 Georgi Gerganovllama : refactor llama_context, llama_kv_cache, llm_bui...
2025-03-13 Ishaan Gandhiserver : fix crash when using verbose output with input...
next