]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-10-17 GittyBursteinSYCL SET operator optimized for F32 tensors (#16350)
2025-10-16 Xuan-Son Nguyenmtmd : support home-cooked Mistral Small Omni (#14928)
2025-10-16 Pascalfix: added a normalization step for MathJax-style ...
2025-10-16 GittyBursteinsycl : add ARANGE operator (#16362)
2025-10-16 Chenguang LiCANN: format code using .clang-format (#15863)
2025-10-16 takasurazeemcommon : Update the docs on -t --threads (#16236)
2025-10-16 takuya kodamaggml-cpu: replace putenv with setenv for const-correctn...
2025-10-16 yael-worksSYCL: Add GGML_OP_MEAN operator support (#16009)
2025-10-15 Aleksei Nikiforovgguf-py : add support for endian conversion of BF16...
2025-10-15 safranowithcpu : add FLOOR, CEIL, ROUND and TRUNC unary operators...
2025-10-15 lhezopencl: add q8_0 mm support (#16469)
2025-10-15 lhezopencl: fix FA for f32 (#16584)
2025-10-15 Aleksander... Add server-driven parameter defaults and syncing (...
2025-10-15 Sam/Samuelmetal: optimise `GGML_OP_SUM` (#16559)
2025-10-15 Georgi Gerganovserver : fix img token logs (#16595)
2025-10-15 Xuan-Son Nguyenllama-quant: add support for mmproj (#16592)
2025-10-15 Julius TischbeinCUDA: Changing the CUDA scheduling strategy to spin...
2025-10-15 Georgi Gerganovserver : fix mtmd checkpoints (#16591)
2025-10-14 Georgi Gerganovmetal : avoid using Metal's gpuAddress property (#16576)
2025-10-14 SavicStefanvulkan: Add ACC_TYPE_VEC2 implementation (#16203) upstream/0.0.6764
2025-10-14 Aman GuptaCUDA + openCL: fix bug in accessing rms_norm->src while...
2025-10-14 Jeff Bolzvulkan: Support FA with K/V in F32 (#16543)
2025-10-14 Jeff Bolzvulkan: Improve build time for MSVC (#16545)
2025-10-14 Johannes GäßlerCUDA: enable FA for FP32 KV cache (#16546)
2025-10-14 Aman GuptaCUDA: use fastdiv + ggml_cuda_mad for mmvf (#16557)
2025-10-14 Aman GuptaCUDA: add fp kernel for larger batch size MoE (#16512)
2025-10-14 Anav Prasadcuda : remove legacy copy-op pointer indirection code...
2025-10-14 Georgi Gerganovserver : dynamic token limit for prompt cache (#16560)
2025-10-13 Georgi Gerganovmetal : FA support F32 K and V and head size = 32 ...
2025-10-13 Georgi Gerganovgraph : support cacheless embeddings with FA and iSWA...
2025-10-13 lhezopencl: fix build targeting CL 2 (#16554)
2025-10-13 Johannes GäßlerCUDA: fix numerical issues in tile FA kernel (#16540)
2025-10-13 Jie Fu (傅杰)ggml : fix build broken with -march=armv9-a on MacOS...
2025-10-13 Chenguang LiCANN: fix CPU memory leak in CANN backend (#16549)
2025-10-13 Pascalfix: add remark plugin to render raw HTML as literal...
2025-10-13 Sam/Samuelmetal: add support for opt_step_sgd (#16539)
2025-10-13 Georgi Gerganovggml : fix scalar path for computing norm (#16558)
2025-10-13 hipuddingCANN: Update several operators to support FP16 data...
2025-10-12 Sam/Samuelmetal : add opt_step_adamw and op_sum (#16529)
2025-10-12 Pascalwebui: remove client-side context pre-check and rely...
2025-10-12 Neo Zhang Jianyu[SYCL] fix UT fault cases: count-equal, argsort, pad...
2025-10-12 Mathieu Baudierci : add Vulkan on Ubuntu with default packages build...
2025-10-12 Aldehir Rojascommon : handle unicode during partial json parsing...
2025-10-12 Georgi Gerganovcommon : update presets (#16504)
2025-10-12 sirus20x6ggml : Fix FP16 ELU positive branch (#16519)
2025-10-12 Daniel Beveniushparams : add check for layer index in is_recurrent...
2025-10-12 sirus20x6ggml: Correct SVE implementation in ggml_vec_dot_f16_un...
2025-10-11 Johannes GäßlerCUDA: faster tile FA, add oob checks, more HSs (#16492)
2025-10-11 Georgi Gerganovmetal : fix mul-mm condition + fix mul-mv permuted...
2025-10-11 Pascalfeat: render user content as markdown option (#16358)
2025-10-11 Yann Folletserver / ranking : add sorting and management of top_n...
2025-10-11 Diego Devesacuda : avoid initializing unused devices (#16510)
2025-10-11 amirai21convert : correctly handle LLaMA tokenizer for Jamba...
2025-10-10 Georgi Gerganovserver : fix division by zero when reporting stats...
2025-10-10 Georgi Gerganovvocab : mark EOT token for Granite models (#16499)
2025-10-10 Radoslav Gerganovserver : return HTTP 400 if prompt exceeds context...
2025-10-10 Radoslav Gerganovserver : log requests to /v1/completions (#16495)
2025-10-10 Prajwal B Mehendarkarcmake : Dont define XOPENSOURCE on AIX (#16481)
2025-10-09 Pascalwebui: updated the chat service to only include max_tok...
2025-10-09 dudutacpu : optimize the ggml NORM operation (#15953)
2025-10-09 Georgi Gerganovserver : host-memory prompt caching (#16391)
2025-10-09 PascalNo markdown in cot (#16483)
2025-10-09 Daniel Beveniusmodel-conversion : add support for SentenceTransformers...
2025-10-09 sudhiarmci: add ARM64 Kleidiai build and test support (#16462)
2025-10-09 Chenguang LiCANN: Improve ACL graph matching (#16166)
2025-10-09 Charles Xukleidiai: kernel interface refactoring (#16460)
2025-10-09 Neo Zhang Jianyu[SYCL] refactor soft_max, add soft_max_back (#16472)
2025-10-09 Saba Fallahmodel: EmbeddingGemma Adding Support for SentenceTransf...
2025-10-08 Pascalrefactor: centralize CoT parsing in backend for streami...
2025-10-08 ai-fonsiDisable CUDA host buffers on integrated GPUs (#16308)
2025-10-08 issixxserver : fix cancel pending task (#16467)
2025-10-08 Georgi Gerganovmetal : mark FA blocks (#16372)
2025-10-08 Georgi Gerganovserver : improve context checkpoint logic (#16440)
2025-10-07 Reese Levineggml webgpu: profiling, CI updates, reworking of comman...
2025-10-07 Tarek Dakhranllama : support LiquidAI LFM2-MoE hybrid model (#16464)
2025-10-07 Georgi Gerganovserver : add `/v1/health` endpoint (#16461)
2025-10-07 Sascha Rogmannwebui : added download action (#13552) (#16282)
2025-10-07 Georgi Gerganovpresets : fix pooling param for embedding models (...
2025-10-07 Radoslav Gerganovrpc : update documentation (#16441)
2025-10-07 Georgi Gerganovmemory : use sequential equal splits for recurrent...
2025-10-07 Georgi Gerganovmetal : add support for non-padded FA KV (#16148)
2025-10-07 Georgi Gerganovtests : add -INF blocks to the KQ mask in the FA tests...
2025-10-07 Georgi Gerganovmetal : various optimizations + refactoring (#16446)
2025-10-06 Gadflyiillama : add --no-host to disable host buffers (#16310)
2025-10-06 Gabe Goodhartchat : Granite Docling stopping (#16438)
2025-10-06 Sigbjørn Skjæretci : refactor sdk caching to minimize storage (#16414)
2025-10-06 Georgi Gerganovggml : fix unaligned access in AMX code (#16315)
2025-10-06 Daniel Beveniusci : remove missing reranker model files (#16444)
2025-10-06 Daniel Beveniusggml-cpu : fix leftover handling in ggml_vec_scale_f32...
2025-10-06 Yuannannix : removed metal for nix (#16118)
2025-10-06 Oleksandr Kuvshynovserver: update readme to mention n_past_max metric...
2025-10-05 Gabe Goodhartmodel : Granite docling + Idefics3 preprocessing (SmolV...
2025-10-05 Reese Levineggml webgpu: actually add softmax, fix rms_norm offset...
2025-10-04 Evevulkan: use a more appropriate amount of threads when...
2025-10-04 Radoslav Gerganovrpc : check src buffer when copying tensor (#16421)
2025-10-04 Radoslav Gerganovrpc : add support for multiple devices (#16276)
2025-10-04 Aclyvulkan : incremental shader builds (#16341)
2025-10-03 Pascalchat : support Magistral thinking (#16413)
2025-10-03 ddh0server : context checkpointing for hybrid and recurrent...
2025-10-03 Georgi Gerganovmetal : fix loop bound in ggml_mem_ranges (#16412)
next