]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-03-26 Georgi Gerganovggml : fix MUL_MAT_ID repack with Q8_K (#12544)
2025-03-26 R0CKSTARdoc: [MUSA] minor changes (#12583)
2025-03-25 Sigbjørn Skjæretconvert: fix Mistral3/Gemma3 model hparams init (#12571)
2025-03-25 Eric Curtinrun: de-duplicate fmt and format functions and optimize...
2025-03-25 Dan Johanssonggml-cpu : update KleidiAI to v1.5.0 (#12568)
2025-03-25 Akarshan BiswasSYCL: disable Q4_0 reorder optimization (#12560)
2025-03-25 Dan Johanssondocs : add build instructions for KleidiAI (#12563)
2025-03-25 R0CKSTARci: [MUSA] add CI and update doc (#12562)
2025-03-25 Georgi Gerganovcontext : fix worst-case reserve outputs (#12545)
2025-03-24 Akarshan Biswasci: [SYCL] ggml-ci Use main GPU and enable sysman ...
2025-03-24 lhezopencl: simplify kernel embedding logic in cmakefile...
2025-03-24 Akarshan BiswasCI: fix SYCL build (#12546)
2025-03-24 Tei Homedocs: update: improve the Fedoa CUDA guide (#12536)
2025-03-24 compiladellama-vocab : add SuperBPE pre-tokenizer (#12532)
2025-03-24 R0CKSTARCUDA: Fix clang warnings (#12540)
2025-03-24 Prajwal B Mehendarkarmmap : skip resource limit checks on AIX (#12541)
2025-03-24 Jeff Bolzvulkan: fix mul_mat_vec failure in backend tests (...
2025-03-23 Marius Gerdesserver : Add verbose output to OAI compatible chat...
2025-03-23 Lars Sonchocky... install : add macports (#12518)
2025-03-22 Xuan-Son Nguyenllama : gemma3 : use output tensor if it exists in...
2025-03-22 Georgi Gerganovggml : fix quantized cpy op (#12310)
2025-03-22 R0CKSTARmusa: refine compute capability (#12493)
2025-03-22 Jeff Bolzvulkan: Optimize mul_mat_vec p021 and nc shaders (...
2025-03-21 stduhpfVulkan: RTE rounding for cpy to quant (#12480)
2025-03-21 Evevulkan: workaround for AMD Windows driver 16 bit unpack...
2025-03-21 Georgi Gerganovmodel : do not repack if a GPU device is present (...
2025-03-21 Sigbjørn Skjæretchore : cleanup llama_model_loader::TENSOR_ usage ...
2025-03-21 marcoStocchillama-tts : avoid crashes related to bad model file...
2025-03-21 蕭澧邦[SYCL] Fix build on Windows when ccache enabled (#9954...
2025-03-21 Svetlozar Georgievsycl: cleanup oneDNN related code (#12097)
2025-03-20 Woof Dogwebui : Prevent rerendering on textarea input (#12299)
2025-03-20 Sigbjørn Skjæretllama : make Qwen2MoE QKV bias optional (#12477)
2025-03-20 Srihari-mcwggml : block interleaving support for Q4_K quantization...
2025-03-20 Bartowskiconvert : avoid calls to tokenizer.added_tokens_decoder...
2025-03-19 fairydreamingcontext : clear sets containing encoder output sequence...
2025-03-19 Gaurav GargCUDA: Improve flash decoding kernel GPU occupancy for...
2025-03-19 Jeff Bolzvulkan: optimize iq1 coopmat2 dequant functions (#12427)
2025-03-19 Guus WaalsFix visionOS build and add CI (#12415)
2025-03-19 Sigbjørn Skjæretllama : add support for GPT2, Bloom and CodeShell tied...
2025-03-19 Sigbjørn Skjæretconvert : Support chat_template.json (#12460)
2025-03-19 Jeff Bolzvulkan: Submit once enough matmul work has been recorde...
2025-03-18 lhezopencl: improve profiling (#12442)
2025-03-18 Georgi Gerganovgraph : normalize Q, K, V shapes + sync cross attention...
2025-03-18 R0CKSTARmusa: override warp_size of musa device to 32 (#12445)
2025-03-18 Xuan-Son Nguyenllama : support converting Mistral Small text-only...
2025-03-18 Georgi Gerganovspeculative : fix seg fault in certain cases (#12454)
2025-03-18 Xuan-Son Nguyenllama : add support for EXAONE tied word embeddings...
2025-03-18 Georgi Gerganovcontext : always use non-causal attention for encoder...
2025-03-18 Łukasz ŚlusarczykSYCL: using graphs is configurable by environment varia...
2025-03-18 Georgi Gerganovserver : fix warmup draft cache type (#12446)
2025-03-18 Prajwal B Mehendarkarcmake : fix PowerPC build (#12241)
2025-03-18 fj-y-saitoggml : add SVE support for q6_K_q8_K (#12361)
2025-03-18 0cc4mVulkan: Default to 1GB allocations instead of 4GB to...
2025-03-18 Łukasz Ślusarczykfixed compilation warnings in ggml-sycl (#12424)
2025-03-17 Molly Sophiallama: Add support for RWKV v7 architecture (#12412)
2025-03-17 Sigbjørn Skjæretdocs : bring llama-cli conversation/template docs up...
2025-03-17 Gaurav Gargcuda : enable CUDA Graph on CUDA Toolkit < 12.x (#12394)
2025-03-17 Guus Waalsggml-vulkan: remove unused find_program(glslc) (#12416)
2025-03-17 Jeff Bolzvulkan: Add N/2 and N/4 optimized paths in coopmat2...
2025-03-17 Danielevulkan: subgroup size tuning (#12087)
2025-03-17 Jeff Bolzvulkan: use fp32 in coopmat2 q4_k dequant function...
2025-03-17 Jeff Bolzvulkan: Pad N dimension of B matrix for coopmat2 perf...
2025-03-17 Jeff Bolzvulkan: Adjust coopmat2 tile sizes and selection heuris...
2025-03-17 Christian Kastnercmake : enable building llama.cpp using system libggml...
2025-03-17 Akarshan BiswasSYCL: set extras only on GGML_TYPE_Q4_0 (#12366)
2025-03-16 Sigbjørn Skjæretllama : fix OLMo-2-0325-32B-Instruct K-norm size (...
2025-03-16 Georgi Gerganovcontext : fix init of n_outputs (#12397)
2025-03-16 Daniel Beveniusci : add --symlinks to xcframework zip command (#12409)
2025-03-15 marcoStocchillama-tts : add '-o' option (#12398)
2025-03-15 aubreyliSYCL: Delete redundant plus sign and space (#12391)
2025-03-15 fairydreamingSYCL : support non-contiguous tensors in binary ops...
2025-03-15 Chenguang Li[CANN]MUL_MAT optimization (#12382)
2025-03-14 Eric CurtinAdd CLI arg to llama-run to adjust the number of thread...
2025-03-14 Sigbjørn Skjæretmain : add -sysf / --system-prompt-file (#12249) (...
2025-03-14 fairydreamingLoad all MoE experts during warmup (#11571)
2025-03-14 Victorserver: fix "--grammar-file" parameter (#12285)
2025-03-14 Georgi Gerganovgraph : simplify attn input build for unified KV cache...
2025-03-14 Georgi Gerganovhparams : add SWA rope parameters (#12374)
2025-03-13 Georgi Gerganovllama : fix Gemma3 SWA KV cache shift (#12373)
2025-03-13 Xuan-Son Nguyenarg : no n_predict = -2 for examples except for main...
2025-03-13 Georgi Gerganovllama : refactor llama_context, llama_kv_cache, llm_bui...
2025-03-13 Ishaan Gandhiserver : fix crash when using verbose output with input...
2025-03-12 Oscar BarenysUpdate build.yml for Windows Vulkan builder to use...
2025-03-12 Daniel Beveniusllama.swiftui : fix xcframework dir in README [no ci...
2025-03-12 Alberto Cabrera... sycl : variable sg_size support for mmvq kernels (...
2025-03-12 uvosCUDA/HIP: Fix fattn-vec-* when device warp size is...
2025-03-12 Xuan-Son Nguyenllama : Add Gemma 3 support (+ experimental vision...
2025-03-12 Jeff Bolzvulkan: fix bug in coopmat1 mul_mat_id (#12316)
2025-03-11 uvosCUDA/HIP: refractor mmqv to unify the calculation of...
2025-03-11 jklincnggml-backend : fix backend search path (#12330)
2025-03-11 BB-fatmetal : Cache the Metal library at the device context...
2025-03-11 Xuan-Son Nguyenclip : bring back GPU support (#12322)
2025-03-10 Evemat vec double buffer (#12188)
2025-03-10 R0CKSTARmusa: support new arch mp_31 and update doc (#12296)
2025-03-10 Henry Linjamäkiopencl: use OpenCL C standard supported by the device...
2025-03-10 John Beanreadme: added Sidekick to available UIs (#12311)
2025-03-10 Georgi Gerganovtests : fix test-quantize-fns to init the CPU backend...
2025-03-10 marcoStocchicommon : refactor '-o' option (#12278)
2025-03-10 Olivier Chafik`server`: extract <think> tags from qwq outputs (#12297)
2025-03-10 Olivier Chafik`tool-call`: ensure there's always a non-empty tool...
next