]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-10-31 Piotr Wilkin... model : Minimax M2 (#16831)
2025-10-31 Giuseppe Scrivanomodel : add Granite Hybrid nano types (#16896)
2025-10-31 Johannes GäßlerCUDA: Volta tensor core support for MMF (#16843)
2025-10-31 Georgi Gerganovsync : ggml
2025-10-31 Aman GuptaCUDA: add expert reduce kernel (#16857)
2025-10-31 Georgi Gerganovbatch : fix consistency checks for the input positions...
2025-10-31 Georgi Gerganovserver : don't print user inputs to console (#16871)
2025-10-31 Daniel Beveniusserver : fix typos in server.cpp comments [no ci] ...
2025-10-31 Jeff Bolzvulkan: disable spirv-opt for rope shaders (#16872)
2025-10-31 Masato Nakasakavulkan: Fix crash when FP16 mul_mat accumulation is...
2025-10-31 Ruben Ortlamvulkan: fix shmem overrun in mmq id shader (#16873)
2025-10-31 l3utterflyggml-hexagon: respect input size when getting/setting...
2025-10-30 Sigbjørn Skjæretci : enable free-disk-space on cuda docker build (...
2025-10-30 lhezopencl: fix boundary handling for mul_mm (#16875)
2025-10-30 RodriMoraconvert : update transformers requirements (#16866)
2025-10-30 chansikparkserver : bump request URI max length to 32768 (#16862)
2025-10-30 Georgi Gerganovserver : remove n_past (#16818)
2025-10-30 Max Krasnyanskycpu: introduce chunking for repack matmuls and enable...
2025-10-30 Shagun Beracommon: fix typo in cli help text (#16864)
2025-10-30 JJJYmmmmodel: add support for qwen3vl series (#16780)
2025-10-30 Max Krasnyanskycpu: introduce chunking for flash attention (#16829)
2025-10-30 Tianyue-Zhaomodel: Add support for CogVLM model (#15002)
2025-10-30 Sigbjørn Skjæretcuda : fix argsort with 64k+ rows (#16849)
2025-10-30 Jan Boonllama : use std::abs instead of abs (#16853)
2025-10-30 Jeff Bolzvulkan: Handle argsort with a large number of rows...
2025-10-30 Oliver SimonsHide latency of bias and gate-loading (#16847)
2025-10-29 Jeff Bolzvulkan: Fuse rope+set_rows (#16769)
2025-10-29 Xuan-Son Nguyenllama: fix ASAN error with M-RoPE (#16848)
2025-10-29 Xuan-Son Nguyenllama: store mrope data in KV cell (#16825)
2025-10-29 Jeff Bolzvulkan: Update topk_moe fusion to handle gpt's late...
2025-10-29 Ruben OrtlamVulkan MMQ Integer Dot Refactor and K-Quant support...
2025-10-29 Max KrasnyanskyHexagon Op queue & dispatch optimizations (#16820)
2025-10-29 Aman GuptaCUDA: use fastdiv in set-rows (#16834)
2025-10-29 Sigbjørn Skjæretvendor : sync minja (#16500)
2025-10-29 Jeff Bolzvulkan: Call ggml_vk_buffer_write_2d from ggml_vk_buffe...
2025-10-29 Aman GuptaCUDA: Fix bug in topk-moe for gpt-oss (#16821)
2025-10-29 YaelLogicsycl: add RMS_NORM_BACK operation support (#16808)
2025-10-28 YaelGitAccountcuda: add SET operation support (#16804)
2025-10-28 Georgi Gerganovmemory : remove KV cache size padding (#16812)
2025-10-28 Georgi Gerganovllama-bench : clarify benchmarked parts of the computat...
2025-10-28 l3utterflyinitialise buffer.device in ggml_hexagon_session (...
2025-10-28 Sam Malayekembedding: add raw option for --embd-output-format...
2025-10-28 Johannes Gäßlerllama: consistent ctx <-> buf order for KV cache (...
2025-10-28 Aldehir Rojasgrammar : support array references in json schema ...
2025-10-28 Chenguang LiCANN: Improve device ID handling and aclnnArange checks...
2025-10-28 Aman GuptaCUDA: add unused vars to mmvf and mmvq (#16807)
2025-10-28 tamarPalsycl: add SSM_CONV operation support (#16800)
2025-10-27 Yuri Khrustalevchat: Add LFM2 tool handling (#16763)
2025-10-27 Xuan-Son Nguyenmtmd : fix idefics3 preprocessing (#16806)
2025-10-27 Diego Devesallama : disable pipeline parallelism if compute buffer...
2025-10-27 Aclyggml : fix interpolate with align-corners and ne=1...
2025-10-27 Johannes GäßlerHIP: fix AMDGPU_TARGETS, update documentation (#16803)
2025-10-27 Xuan-Son Nguyenmodel : add LightOnOCR-1B model (#16764)
2025-10-27 Johannes Gäßlerllama: fix leaked buffers for mmap + split files (...
2025-10-27 Aman Guptatest-backend-ops: print failed tests at the end (#16785)
2025-10-27 tamarPalsycl: add ROLL operation support (#16665)
2025-10-27 shani-fsycl: add REPEAT_BACK operation support (#16734)
2025-10-27 Aman GuptaCUDA: support for weight clamp in top-k norm (#16702)
2025-10-26 Aclyggml-alloc : make gallocr prefer chunks that allow...
2025-10-26 Sigbjørn Skjæretcuda : use fast copy when src and dst are of different...
2025-10-26 leejetggml: fix cuda kernel launch configuration for k_comput...
2025-10-26 Sigbjørn Skjæretconvert : enable expert group selection for all models...
2025-10-26 Sigbjørn Skjæretgraph : add clamping to ffn_moe_weights_sum to avoid...
2025-10-26 Sigbjørn Skjæretmodel : set res->t_embd in SmallThinker models (#16782)
2025-10-26 amirai21docs : add Jamba to Text-only models list (#16778)
2025-10-26 Aman GuptaCUDA: General GEMV fusion (#16715)
2025-10-26 Gilad S.vulkan: deduplicate Microsoft Direct3D12 devices (...
2025-10-25 Galunidconvert : handle mmproj filename/path properly (#16760)
2025-10-25 Shunta Saitomodel : set res->t_embd in PLaMo2 models (#16766)
2025-10-25 Giuseppe Scrivanovulkan: delete dead code (#16732)
2025-10-25 Jeff Bolzvulkan: Optimize SSM_SCAN (#16645)
2025-10-25 compiladeconvert : avoid dequantizing mxfp4 for GPT-OSS (#16756)
2025-10-24 leejetggml: fix CUDA grid launch condition for large block_nu...
2025-10-24 Aman GuptaCUDA: use CUB for arbitary size argsort (#16754)
2025-10-24 Florian Badiewebui: support q URL parameter (#16728)
2025-10-24 Daniel Beveniusmodel-conversion : add trust_remote_code for orig model...
2025-10-23 compiladeconvert : handle pre-quantized models (#14810)
2025-10-23 Johannes Gäßlerserver: add memory breakdown print (#16740)
2025-10-23 Julien Denizeconvert : Make mistral-common dependency optional ...
2025-10-23 Xuan-Son Nguyenmtmd-cli : allow using --jinja (#16718)
2025-10-23 Prajwal B MehendarkarManually link -lbsd to resolve flock symbol on AIX...
2025-10-23 Aman Guptaggml-cuda: use passed ops instead of hardcoded ops...
2025-10-23 matteoserver : send partial stop string when <EOG> is reached...
2025-10-23 Matthew Michelsycl: use async memory allocation to fix crashes during...
2025-10-22 Max KrasnyanskyAdd experimental ggml-hexagon backend for the Hexagon...
2025-10-22 Diego DevesaRevert "ggml : Leverage the existing GGML_F32_VEC helpe...
2025-10-22 Pascalwebui: introduce OpenAI-compatible model selector in...
2025-10-22 sirus20x6ggml : Leverage the existing GGML_F32_VEC helpers to...
2025-10-22 Aclytests : fix test-thread-safety when compiling with...
2025-10-22 Aman GuptaCUDA: fix bug in topk-moe softmax (#16711)
2025-10-21 Aman GuptaCUDA: topk-moe: add optional parameter for gpt-oss...
2025-10-21 Johannes GäßlerCUDA: better error for FA kernel with 0 occupancy ...
2025-10-21 Aman Guptaggml: add ggml_can_fuse_subgraph (#16662)
2025-10-21 lhezopencl: fix warnings and clean up profiling (#16688)
2025-10-21 Jeff Bolzvulkan: Handle FA with all -inf mask values (#16447)
2025-10-20 YehuditEsycl : add PAD_REFLECT_D1 operator support (#16145)
2025-10-20 Sigbjørn Skjæretmodel : add BailingMoeV2 support (#16063)
2025-10-20 Aleksander... Handle legacy 'context' attachments (#16687)
2025-10-20 Diego Devesaggml-alloc : fix leak when reusing a tensor with a...
2025-10-20 Aleksander... Prevent premature submission on IME input (#16673)
next