]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-10-23 compiladeconvert : handle pre-quantized models (#14810)
2025-10-23 Johannes Gäßlerserver: add memory breakdown print (#16740)
2025-10-23 Julien Denizeconvert : Make mistral-common dependency optional ...
2025-10-23 Xuan-Son Nguyenmtmd-cli : allow using --jinja (#16718)
2025-10-23 Prajwal B MehendarkarManually link -lbsd to resolve flock symbol on AIX...
2025-10-23 Aman Guptaggml-cuda: use passed ops instead of hardcoded ops...
2025-10-23 matteoserver : send partial stop string when <EOG> is reached...
2025-10-23 Matthew Michelsycl: use async memory allocation to fix crashes during...
2025-10-22 Max KrasnyanskyAdd experimental ggml-hexagon backend for the Hexagon...
2025-10-22 Diego DevesaRevert "ggml : Leverage the existing GGML_F32_VEC helpe...
2025-10-22 Pascalwebui: introduce OpenAI-compatible model selector in...
2025-10-22 sirus20x6ggml : Leverage the existing GGML_F32_VEC helpers to...
2025-10-22 Aclytests : fix test-thread-safety when compiling with...
2025-10-22 Aman GuptaCUDA: fix bug in topk-moe softmax (#16711)
2025-10-21 Aman GuptaCUDA: topk-moe: add optional parameter for gpt-oss...
2025-10-21 Johannes GäßlerCUDA: better error for FA kernel with 0 occupancy ...
2025-10-21 Aman Guptaggml: add ggml_can_fuse_subgraph (#16662)
2025-10-21 lhezopencl: fix warnings and clean up profiling (#16688)
2025-10-21 Jeff Bolzvulkan: Handle FA with all -inf mask values (#16447)
2025-10-20 YehuditEsycl : add PAD_REFLECT_D1 operator support (#16145)
2025-10-20 Sigbjørn Skjæretmodel : add BailingMoeV2 support (#16063)
2025-10-20 Aleksander... Handle legacy 'context' attachments (#16687)
2025-10-20 Diego Devesaggml-alloc : fix leak when reusing a tensor with a...
2025-10-20 Aleksander... Prevent premature submission on IME input (#16673)
2025-10-20 Aleksander... Import/Export UX improvements (#16619)
2025-10-20 Aleksander... Enable per-conversation loading states to allow having...
2025-10-20 takuya kodamallama-batch: fix build fails with `-Werror=missing...
2025-10-20 Ron Evansreadme: update bindings (#16651)
2025-10-20 safranowithSYCL: Add support for FLOOR,CEIL,ROUND and TRUNC unary...
2025-10-20 takuya kodamallama-context: only warn on pooling_type when user...
2025-10-19 Giuseppe Scrivanomodel : add Granite Hybrid types (#16635)
2025-10-19 Aaron Teoci : fix binaries release failure for s390x (binaries...
2025-10-19 Sigbjørn Skjæretci : avoid manual updates of docs/ops.md (#16663)
2025-10-19 Aaron Teoci: include s390x release binaries (#16648)
2025-10-19 Aman GuptaCODEOWNERS: update for ggml-cuda/mmf (#16660)
2025-10-18 Johannes GäßlerHIP: fix GPU_TARGETS (#16642)
2025-10-18 Jeff Bolzvulkan: Implement topk_moe fused shader, ported from...
2025-10-18 Aman GuptaCUDA: use registers instead of smem in topk-moe (#16647)
2025-10-18 Shawn Guopencl: transposed gemm/gemv moe kernel with mxfp4...
2025-10-17 Johannes Gäßlerllama-model: fix insonsistent ctxs <-> bufs order ...
2025-10-17 Radoslav Gerganovrpc : report actual free memory (#16616)
2025-10-17 Giuseppe Scrivanovulkan: Add State Space Model (SSM) Operations Support...
2025-10-17 muggle-stackggml : fix SpaceMit IME array out-of-bounds in task...
2025-10-17 Pascalwebui: reorganize settings layout (#16607)
2025-10-17 Jeff Bolzvulkan: fix debug build (add_rms_len/data not found...
2025-10-17 Ilia Ilmermetal : add `CONV_TRANSPOSE_2D` (#16542)
2025-10-17 Olivier Chafikgrammar : use int64_t to avoid int overflows in int...
2025-10-17 GittyBursteinSYCL SET operator optimized for F32 tensors (#16350)
2025-10-16 Xuan-Son Nguyenmtmd : support home-cooked Mistral Small Omni (#14928)
2025-10-16 Pascalfix: added a normalization step for MathJax-style ...
2025-10-16 GittyBursteinsycl : add ARANGE operator (#16362)
2025-10-16 Chenguang LiCANN: format code using .clang-format (#15863)
2025-10-16 takasurazeemcommon : Update the docs on -t --threads (#16236)
2025-10-16 takuya kodamaggml-cpu: replace putenv with setenv for const-correctn...
2025-10-16 yael-worksSYCL: Add GGML_OP_MEAN operator support (#16009)
2025-10-15 Aleksei Nikiforovgguf-py : add support for endian conversion of BF16...
2025-10-15 safranowithcpu : add FLOOR, CEIL, ROUND and TRUNC unary operators...
2025-10-15 lhezopencl: add q8_0 mm support (#16469)
2025-10-15 lhezopencl: fix FA for f32 (#16584)
2025-10-15 Aleksander... Add server-driven parameter defaults and syncing (...
2025-10-15 Sam/Samuelmetal: optimise `GGML_OP_SUM` (#16559)
2025-10-15 Georgi Gerganovserver : fix img token logs (#16595)
2025-10-15 Xuan-Son Nguyenllama-quant: add support for mmproj (#16592)
2025-10-15 Julius TischbeinCUDA: Changing the CUDA scheduling strategy to spin...
2025-10-15 Georgi Gerganovserver : fix mtmd checkpoints (#16591)
2025-10-14 Georgi Gerganovmetal : avoid using Metal's gpuAddress property (#16576)
2025-10-14 SavicStefanvulkan: Add ACC_TYPE_VEC2 implementation (#16203) upstream/0.0.6764
2025-10-14 Aman GuptaCUDA + openCL: fix bug in accessing rms_norm->src while...
2025-10-14 Jeff Bolzvulkan: Support FA with K/V in F32 (#16543)
2025-10-14 Jeff Bolzvulkan: Improve build time for MSVC (#16545)
2025-10-14 Johannes GäßlerCUDA: enable FA for FP32 KV cache (#16546)
2025-10-14 Aman GuptaCUDA: use fastdiv + ggml_cuda_mad for mmvf (#16557)
2025-10-14 Aman GuptaCUDA: add fp kernel for larger batch size MoE (#16512)
2025-10-14 Anav Prasadcuda : remove legacy copy-op pointer indirection code...
2025-10-14 Georgi Gerganovserver : dynamic token limit for prompt cache (#16560)
2025-10-13 Georgi Gerganovmetal : FA support F32 K and V and head size = 32 ...
2025-10-13 Georgi Gerganovgraph : support cacheless embeddings with FA and iSWA...
2025-10-13 lhezopencl: fix build targeting CL 2 (#16554)
2025-10-13 Johannes GäßlerCUDA: fix numerical issues in tile FA kernel (#16540)
2025-10-13 Jie Fu (傅杰)ggml : fix build broken with -march=armv9-a on MacOS...
2025-10-13 Chenguang LiCANN: fix CPU memory leak in CANN backend (#16549)
2025-10-13 Pascalfix: add remark plugin to render raw HTML as literal...
2025-10-13 Sam/Samuelmetal: add support for opt_step_sgd (#16539)
2025-10-13 Georgi Gerganovggml : fix scalar path for computing norm (#16558)
2025-10-13 hipuddingCANN: Update several operators to support FP16 data...
2025-10-12 Sam/Samuelmetal : add opt_step_adamw and op_sum (#16529)
2025-10-12 Pascalwebui: remove client-side context pre-check and rely...
2025-10-12 Neo Zhang Jianyu[SYCL] fix UT fault cases: count-equal, argsort, pad...
2025-10-12 Mathieu Baudierci : add Vulkan on Ubuntu with default packages build...
2025-10-12 Aldehir Rojascommon : handle unicode during partial json parsing...
2025-10-12 Georgi Gerganovcommon : update presets (#16504)
2025-10-12 sirus20x6ggml : Fix FP16 ELU positive branch (#16519)
2025-10-12 Daniel Beveniushparams : add check for layer index in is_recurrent...
2025-10-12 sirus20x6ggml: Correct SVE implementation in ggml_vec_dot_f16_un...
2025-10-11 Johannes GäßlerCUDA: faster tile FA, add oob checks, more HSs (#16492)
2025-10-11 Georgi Gerganovmetal : fix mul-mm condition + fix mul-mv permuted...
2025-10-11 Pascalfeat: render user content as markdown option (#16358)
2025-10-11 Yann Folletserver / ranking : add sorting and management of top_n...
2025-10-11 Diego Devesacuda : avoid initializing unused devices (#16510)
2025-10-11 amirai21convert : correctly handle LLaMA tokenizer for Jamba...
next