]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-03-30 Georgi Gerganovmetal : use constexpr in FA kernels + fix typedef ...
2025-03-30 Juyoung Sukllama : add Trillion 7B model support (#12556)
2025-03-30 Sergei Vorobyovllama-chat : Add Yandex instruct model template support...
2025-03-30 R0CKSTARmusa: fix all warnings, re-enable `-DLLAMA_FATAL_WARNIN...
2025-03-30 Georgi Gerganovsync : ggml
2025-03-30 Xuan-Son Nguyencpu : rm unused variable (ggml/1166)
2025-03-30 cmdr2cpu: de-duplicate some of the operators and refactor...
2025-03-30 Daniel Beveniusggml : add logging for native build options/vars (whisp...
2025-03-30 Daniel Beveniusexamples : command.wasm updates (whisper/2904)
2025-03-29 Xuan-Son Nguyenllama : fix non-causal mask for gemma 3 (#12615)
2025-03-29 Djip007llama : change cpu_buft_list order: ACCEL -> GPU host...
2025-03-29 Jaycmake : fix ccache conflict (#12522)
2025-03-29 hipuddingCANN : remove clang-format in ggml-cann (#12607)
2025-03-28 Sigbjørn Skjæretllama : fix incorrect Qwen2Moe ffn_moe_out graph callba...
2025-03-28 Georgi Gerganovmetal : improve FA + improve MoE (#12612)
2025-03-28 Icenowy Zhengvulkan: fix coopmat shader generation when cross-compil...
2025-03-28 Johannes Gäßlerllama: fix error on bad grammar (#12628)
2025-03-28 Benson Wongserver : include speculative decoding stats when timing...
2025-03-28 Radoslav Gerganovrpc : update README for cache usage (#12620)
2025-03-28 amritahs-ibmllamafile : ppc64le GEMV forwarding for FP32. (#12594)
2025-03-28 Radoslav Gerganovrpc : send hash when tensor data is above some fixed...
2025-03-27 Piotrserver : Support listening on a unix socket (#12613)
2025-03-27 Georgi Gerganovmedia : add SVG logo [no ci] (#12616)
2025-03-27 lhezopencl: add multi and vision rope, `gelu_quick` and...
2025-03-27 Si1wllama : add PLM GGUF Conversion & Inference Support...
2025-03-27 HighDopingmodel : restore support for T5Encoder (#12590)
2025-03-27 Csaba Kecskemeticonvert : Support Qwen2_5_VLForConditionalGeneration...
2025-03-27 Georgi Gerganovsync : ggml
2025-03-27 Georgi Gerganovscripts : update sync + fix cmake merge
2025-03-27 Georgi Gerganovsync : ggml
2025-03-27 Georgi Gerganovcmake : sync/merge PowerPC build commands (#0)
2025-03-27 amritahs-ibmllamafile : ppc64le MMA implementation for Q4_0. (...
2025-03-27 xctanggml : riscv: add 128-bit RVV support (#12530)
2025-03-27 Georgi Gerganovllama : make loras compatible with repacking (#12593)
2025-03-27 Akarshan BiswasSYCL: implement memset ggml backend buffer interface...
2025-03-26 Slobodan JosicHIP: Add support for RDNA4 targets (#12372)
2025-03-26 Georgi Gerganovmetal : refactor mat-vec code (#12569)
2025-03-26 Michał Moskalupgrade to llguidance 0.7.10 (#12576)
2025-03-26 Ivy233clip: Fix llama-llava-clip-quantize-cli quantization...
2025-03-26 Georgi Gerganovconvert : fix squeeze for ssm_conv tensors (#12573)
2025-03-26 Georgi Gerganovggml : fix MUL_MAT_ID repack with Q8_K (#12544)
2025-03-26 R0CKSTARdoc: [MUSA] minor changes (#12583)
2025-03-25 Sigbjørn Skjæretconvert: fix Mistral3/Gemma3 model hparams init (#12571)
2025-03-25 Eric Curtinrun: de-duplicate fmt and format functions and optimize...
2025-03-25 Dan Johanssonggml-cpu : update KleidiAI to v1.5.0 (#12568)
2025-03-25 Akarshan BiswasSYCL: disable Q4_0 reorder optimization (#12560)
2025-03-25 Dan Johanssondocs : add build instructions for KleidiAI (#12563)
2025-03-25 R0CKSTARci: [MUSA] add CI and update doc (#12562)
2025-03-25 Georgi Gerganovcontext : fix worst-case reserve outputs (#12545)
2025-03-24 Akarshan Biswasci: [SYCL] ggml-ci Use main GPU and enable sysman ...
2025-03-24 lhezopencl: simplify kernel embedding logic in cmakefile...
2025-03-24 Akarshan BiswasCI: fix SYCL build (#12546)
2025-03-24 Tei Homedocs: update: improve the Fedoa CUDA guide (#12536)
2025-03-24 compiladellama-vocab : add SuperBPE pre-tokenizer (#12532)
2025-03-24 R0CKSTARCUDA: Fix clang warnings (#12540)
2025-03-24 Prajwal B Mehendarkarmmap : skip resource limit checks on AIX (#12541)
2025-03-24 Jeff Bolzvulkan: fix mul_mat_vec failure in backend tests (...
2025-03-23 Marius Gerdesserver : Add verbose output to OAI compatible chat...
2025-03-23 Lars Sonchocky... install : add macports (#12518)
2025-03-22 Xuan-Son Nguyenllama : gemma3 : use output tensor if it exists in...
2025-03-22 Georgi Gerganovggml : fix quantized cpy op (#12310)
2025-03-22 R0CKSTARmusa: refine compute capability (#12493)
2025-03-22 Jeff Bolzvulkan: Optimize mul_mat_vec p021 and nc shaders (...
2025-03-21 stduhpfVulkan: RTE rounding for cpy to quant (#12480)
2025-03-21 Evevulkan: workaround for AMD Windows driver 16 bit unpack...
2025-03-21 Georgi Gerganovmodel : do not repack if a GPU device is present (...
2025-03-21 Sigbjørn Skjæretchore : cleanup llama_model_loader::TENSOR_ usage ...
2025-03-21 marcoStocchillama-tts : avoid crashes related to bad model file...
2025-03-21 蕭澧邦[SYCL] Fix build on Windows when ccache enabled (#9954...
2025-03-21 Svetlozar Georgievsycl: cleanup oneDNN related code (#12097)
2025-03-20 Woof Dogwebui : Prevent rerendering on textarea input (#12299)
2025-03-20 Sigbjørn Skjæretllama : make Qwen2MoE QKV bias optional (#12477)
2025-03-20 Srihari-mcwggml : block interleaving support for Q4_K quantization...
2025-03-20 Bartowskiconvert : avoid calls to tokenizer.added_tokens_decoder...
2025-03-19 fairydreamingcontext : clear sets containing encoder output sequence...
2025-03-19 Gaurav GargCUDA: Improve flash decoding kernel GPU occupancy for...
2025-03-19 Jeff Bolzvulkan: optimize iq1 coopmat2 dequant functions (#12427)
2025-03-19 Guus WaalsFix visionOS build and add CI (#12415)
2025-03-19 Sigbjørn Skjæretllama : add support for GPT2, Bloom and CodeShell tied...
2025-03-19 Sigbjørn Skjæretconvert : Support chat_template.json (#12460)
2025-03-19 Jeff Bolzvulkan: Submit once enough matmul work has been recorde...
2025-03-18 lhezopencl: improve profiling (#12442)
2025-03-18 Georgi Gerganovgraph : normalize Q, K, V shapes + sync cross attention...
2025-03-18 R0CKSTARmusa: override warp_size of musa device to 32 (#12445)
2025-03-18 Xuan-Son Nguyenllama : support converting Mistral Small text-only...
2025-03-18 Georgi Gerganovspeculative : fix seg fault in certain cases (#12454)
2025-03-18 Xuan-Son Nguyenllama : add support for EXAONE tied word embeddings...
2025-03-18 Georgi Gerganovcontext : always use non-causal attention for encoder...
2025-03-18 Łukasz ŚlusarczykSYCL: using graphs is configurable by environment varia...
2025-03-18 Georgi Gerganovserver : fix warmup draft cache type (#12446)
2025-03-18 Prajwal B Mehendarkarcmake : fix PowerPC build (#12241)
2025-03-18 fj-y-saitoggml : add SVE support for q6_K_q8_K (#12361)
2025-03-18 0cc4mVulkan: Default to 1GB allocations instead of 4GB to...
2025-03-18 Łukasz Ślusarczykfixed compilation warnings in ggml-sycl (#12424)
2025-03-17 Molly Sophiallama: Add support for RWKV v7 architecture (#12412)
2025-03-17 Sigbjørn Skjæretdocs : bring llama-cli conversation/template docs up...
2025-03-17 Gaurav Gargcuda : enable CUDA Graph on CUDA Toolkit < 12.x (#12394)
2025-03-17 Guus Waalsggml-vulkan: remove unused find_program(glslc) (#12416)
2025-03-17 Jeff Bolzvulkan: Add N/2 and N/4 optimized paths in coopmat2...
2025-03-17 Danielevulkan: subgroup size tuning (#12087)
next