]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-09-05 Erik Scholzgguf: gguf_writer refactor (#15691)
2025-09-05 Georgi Gerganovkv-cache : fix SWA checks + disable cacheless iSWA...
2025-09-05 Daniel Beveniusmodel-conversion : add --embeddings flag to modelcard...
2025-09-04 ExtReMLapinchat : fixed crash when Hermes 2 <tool_call> had a...
2025-09-04 Piotr Wilkin... chat : nemotron thinking & toolcalling support (#15676)
2025-09-04 Piotr Wilkin... scripts : add Jinja tester PySide6 simple app (#15756)
2025-09-04 Daniel Beveniusllama : add support for EmbeddingGemma 300m (#15798)
2025-09-04 Gabe Goodhartmetal : Add template specialization for mul_mm_id w...
2025-09-04 Daniel Beveniusllama : set n_outputs to 1 to avoid 0 outputs mean...
2025-09-04 Chenguang LiCANN: Refactor ND to NZ workspace to be per-device...
2025-09-04 Xuan-Son Nguyenserver: add exceed_context_size_error type (#15780)
2025-09-04 Eric CurtinDocument the new max GPU layers default in help (#15771)
2025-09-04 leejetggml: add ops for WAN video model (cuda && cpu) (#15669)
2025-09-04 hipuddingCANN: Fix precision issue on 310I DUO multi-devices...
2025-09-04 rmatifopencl: add hs=40 to FA (#15758)
2025-09-04 Chenguang LiCANN: fix acl_rstd allocation size in ggml_cann_rms_nor...
2025-09-03 Ruben Ortlamvulkan: fix mmv subgroup16 selection (#15775)
2025-09-03 Jeff Bolzvulkan: don't use std::string in load_shaders, to impro...
2025-09-03 Daniel Beveniusvulkan : update ggml_vk_instance_validation_ext_availab...
2025-09-03 Shin-myoung... ggml vulkan: add hardsigmoid and hardswish operations...
2025-09-03 Oliver SimonsCUDA: Optimize `rms_norm_f32` kernel and its fused...
2025-09-03 Daniel Beveniusmodel-conversion : fix pyright errors (#15770)
2025-09-03 Georgi Gerganovsampling : optimize dist sampler (#15704)
2025-09-03 Daniel Beveniusllama : fix incorrect model type for Gemma 270M (...
2025-09-03 Daniel Beveniusmodel-conversion : remove hardcoded /bin/bash shebangs...
2025-09-03 hipuddingCANN: Add RoPE contiguous check for 310I DUP device...
2025-09-03 xctanggml-cpu : optimize RVV kernels (#15720)
2025-09-03 Daniel Beveniusmodel-conversion : add missing curl script [no ci]...
2025-09-03 hipuddingCANN: Mask unsupported TRANSPOSE_1D operator (#15733)
2025-09-03 Chenguang LiCANN: Fix type float_t to float (#15736)
2025-09-02 SnA1lGofix: resolve unsigned int initialization warning for...
2025-09-02 Oliver Simonschore: Update `.clang-format` to use `BinPackArguments...
2025-09-02 Johannes Gäßlerllama: -fa 1/0/-1 aliases for -fa on/off/auto (#15746)
2025-09-02 Ruben Ortlamvulkan: fix shaders gen when no integer dot is availabl...
2025-09-02 hipuddingCANN: Resolve soft_max precision issue (#15730)
2025-09-02 Jeff Bolzvulkan: Fix macro parameter order for f32 matmul shader...
2025-09-02 rmatifopencl: add attn sinks support for FA kernels (#15706)
2025-09-02 Chenguang LiCANN: Support eager execution mode under ACL graph...
2025-09-02 hipuddingCANN: Support ext_factor in rope (#15710)
2025-09-01 Johannes Gäßlerggml-backend: raise GGML_MAX_SPLIT_INPUTS (#15722)
2025-09-01 Gilad S.vulkan: use memory budget extension to read memory...
2025-09-01 Jeff Bolzvulkan: add missing clamps in new mul_mat_id paths...
2025-09-01 Ruben Ortlamvulkan: disable large mmv subgroups on older Nvidia...
2025-09-01 s-goto-11ggml: SVE support for exponential functions (#15145)
2025-09-01 Prashant Vithuleggml: aarch64: Implement SVE F16 kernels for vector...
2025-09-01 Jie Fu (傅杰)convert : remove redundant code (#15708)
2025-09-01 Ruben OrtlamVulkan: Add Integer Dot Product mul_mat_vec shader...
2025-09-01 Daniel Beveniusggml : WebGPU add TRANSPOSE and RESHAPE to supported...
2025-09-01 Jie Fu (傅杰)docs : add Hunyuan to models section (#15707)
2025-09-01 Akarshan BiswasCUDA: fix build error from ambiguous __half conversions...
2025-09-01 hipuddingCANN: Optimize MUL_MAT_ID (#15658)
2025-09-01 hipuddingCANN: fix RoPE cache issue on multi-device (#15629)
2025-08-31 Georgi Gerganovsampling : optimize samplers by reusing bucket sort...
2025-08-31 Georgi Gerganovserver : enable /slots by default and make it secure...
2025-08-31 Georgi Gerganovmetal : fix checks for available FA kernels (#15700)
2025-08-31 Diego Devesallama : fix fattn reserve call n_seqs parameter (#15699)
2025-08-31 Diego Devesallama : separate compute buffer reserve from fattn...
2025-08-31 Sigbjørn Skjæretci : explicitly set fa off or on (#15692)
2025-08-31 Jeff Bolzvulkan: handle large sizes for get_rows (#15686)
2025-08-31 Jeff Bolzvulkan: mul_mat_id coopmat2 optimizations (#15546)
2025-08-31 Daniel Beveniusvulkan : remove unused portability_enumeration_ext...
2025-08-31 Jeff Bolzvulkan: Allow fallback to sysmem memory when vidmem...
2025-08-31 Jeff Bolzvulkan: clamp matmul and FA results to the max finite...
2025-08-30 Charles Xuggml: update kleidiai to v1.13.0 (#15663)
2025-08-30 Diego DevesaUpdate build.md to remove MSVC arm64 notes (#15684)
2025-08-30 Johannes Gäßlerllama: use FA + max. GPU layers by default (#15434)
2025-08-30 Johannes GäßlerCUDA: use FP32 arithmetic for conv2d (#15683)
2025-08-30 Jeff Bolzvulkan: Skip syncing for prealloc_y when it is reused...
2025-08-30 Chenguang LiCANN: FIx compiler warnings (#15661)
2025-08-29 Sergey Alirzaevserver : removed obsolete doc (#15670)
2025-08-29 Johannes Gäßlerscripts: strip "AMD Instinct" from GPU name (#15668)
2025-08-29 ExtReMLapinserver : add documentation for `parallel_tool_calls...
2025-08-29 Aman GuptaCUDA: fix bug in rms_norm fusion (#15660)
2025-08-29 Piotr Wilkin... chat : Seed OSS thinking + tool call support (#15552)
2025-08-29 Aman GuptaCUDA: fuse adds, fuse add with rms norm (#15631)
2025-08-29 Gabe Goodhartnvidia nemotron nano v2 (nemotronh) (#15507)
2025-08-28 Gabe Goodhartfix: Compute the full sum in llama-eval-callback, not...
2025-08-28 mnehete32CUDA: add conv2d (#15635)
2025-08-28 Aaron Teoggml-cpu: fix invalid hsum build in debug s390x (#15634)
2025-08-28 compiladeggml : fix SSM_SCAN for n_groups > 1 (#15625)
2025-08-28 Georgi Gerganovkv-cache : fix find_slot to not search for continuous...
2025-08-28 Sigbjørn Skjæretmodel : jina-embeddings-v3 support (#13693)
2025-08-28 Aman Guptascripts: add sqlite3 check for compare-commits.sh ...
2025-08-28 Georgi Gerganovkv-cache : remove LLAMA_SET_ROWS checks (#15505)
2025-08-28 Aleksei Nikiforovgguf-py: byteswapping improvements (#12851)
2025-08-28 Joshua Cogliaticli : change log to warning to explain reason for stopp...
2025-08-28 Daniel Beveniusmodel-conversion : add mmproj conversion target (#15628)
2025-08-28 matiaslincuda: Add cublasLt_static linking when GGML_STATIC...
2025-08-27 Johannes Gäßlerserver: higher timeout for tests (#15621)
2025-08-27 Georgi Gerganovpresets : add qwen3-30B-a3b FIM (#15616)
2025-08-27 uvosHIP: Enable support for ggml_backend_cuda_register_host...
2025-08-27 Georgi Gerganovkv-cache : better estimate of n_kv for multi-sequence...
2025-08-27 Chenguang LiCANN: refactor mask handling and improve performance...
2025-08-27 xctanggml-cpu : add basic RVV support for vector f32 ops...
2025-08-27 Daniel Beveniuscommon : add -m to bash completion for --model [no...
2025-08-27 rmatifOpenCL: add fused group_norm/norm, mul, add (#15314)
2025-08-26 Diego Devesatests : fix test-opt with GGML_BACKEND_DL (#15599)
2025-08-26 Akarshan BiswasSYCL: fix rms_norm_mul_add for tensor dim not a multipl...
2025-08-26 fidorielmtmd : fix mtmd ios build (#15579)
2025-08-26 Evetests: add performance test for mul mat id (#15543)
next