]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-09-03 Ruben Ortlamvulkan: fix mmv subgroup16 selection (#15775)
2025-09-03 Jeff Bolzvulkan: don't use std::string in load_shaders, to impro...
2025-09-03 Daniel Beveniusvulkan : update ggml_vk_instance_validation_ext_availab...
2025-09-03 Shin-myoung... ggml vulkan: add hardsigmoid and hardswish operations...
2025-09-03 Oliver SimonsCUDA: Optimize `rms_norm_f32` kernel and its fused...
2025-09-03 Daniel Beveniusmodel-conversion : fix pyright errors (#15770)
2025-09-03 Georgi Gerganovsampling : optimize dist sampler (#15704)
2025-09-03 Daniel Beveniusllama : fix incorrect model type for Gemma 270M (...
2025-09-03 Daniel Beveniusmodel-conversion : remove hardcoded /bin/bash shebangs...
2025-09-03 hipuddingCANN: Add RoPE contiguous check for 310I DUP device...
2025-09-03 xctanggml-cpu : optimize RVV kernels (#15720)
2025-09-03 Daniel Beveniusmodel-conversion : add missing curl script [no ci]...
2025-09-03 hipuddingCANN: Mask unsupported TRANSPOSE_1D operator (#15733)
2025-09-03 Chenguang LiCANN: Fix type float_t to float (#15736)
2025-09-02 SnA1lGofix: resolve unsigned int initialization warning for...
2025-09-02 Oliver Simonschore: Update `.clang-format` to use `BinPackArguments...
2025-09-02 Johannes Gäßlerllama: -fa 1/0/-1 aliases for -fa on/off/auto (#15746)
2025-09-02 Ruben Ortlamvulkan: fix shaders gen when no integer dot is availabl...
2025-09-02 hipuddingCANN: Resolve soft_max precision issue (#15730)
2025-09-02 Jeff Bolzvulkan: Fix macro parameter order for f32 matmul shader...
2025-09-02 rmatifopencl: add attn sinks support for FA kernels (#15706)
2025-09-02 Chenguang LiCANN: Support eager execution mode under ACL graph...
2025-09-02 hipuddingCANN: Support ext_factor in rope (#15710)
2025-09-01 Johannes Gäßlerggml-backend: raise GGML_MAX_SPLIT_INPUTS (#15722)
2025-09-01 Gilad S.vulkan: use memory budget extension to read memory...
2025-09-01 Jeff Bolzvulkan: add missing clamps in new mul_mat_id paths...
2025-09-01 Ruben Ortlamvulkan: disable large mmv subgroups on older Nvidia...
2025-09-01 s-goto-11ggml: SVE support for exponential functions (#15145)
2025-09-01 Prashant Vithuleggml: aarch64: Implement SVE F16 kernels for vector...
2025-09-01 Jie Fu (傅杰)convert : remove redundant code (#15708)
2025-09-01 Ruben OrtlamVulkan: Add Integer Dot Product mul_mat_vec shader...
2025-09-01 Daniel Beveniusggml : WebGPU add TRANSPOSE and RESHAPE to supported...
2025-09-01 Jie Fu (傅杰)docs : add Hunyuan to models section (#15707)
2025-09-01 Akarshan BiswasCUDA: fix build error from ambiguous __half conversions...
2025-09-01 hipuddingCANN: Optimize MUL_MAT_ID (#15658)
2025-09-01 hipuddingCANN: fix RoPE cache issue on multi-device (#15629)
2025-08-31 Georgi Gerganovsampling : optimize samplers by reusing bucket sort...
2025-08-31 Georgi Gerganovserver : enable /slots by default and make it secure...
2025-08-31 Georgi Gerganovmetal : fix checks for available FA kernels (#15700)
2025-08-31 Diego Devesallama : fix fattn reserve call n_seqs parameter (#15699)
2025-08-31 Diego Devesallama : separate compute buffer reserve from fattn...
2025-08-31 Sigbjørn Skjæretci : explicitly set fa off or on (#15692)
2025-08-31 Jeff Bolzvulkan: handle large sizes for get_rows (#15686)
2025-08-31 Jeff Bolzvulkan: mul_mat_id coopmat2 optimizations (#15546)
2025-08-31 Daniel Beveniusvulkan : remove unused portability_enumeration_ext...
2025-08-31 Jeff Bolzvulkan: Allow fallback to sysmem memory when vidmem...
2025-08-31 Jeff Bolzvulkan: clamp matmul and FA results to the max finite...
2025-08-30 Charles Xuggml: update kleidiai to v1.13.0 (#15663)
2025-08-30 Diego DevesaUpdate build.md to remove MSVC arm64 notes (#15684)
2025-08-30 Johannes Gäßlerllama: use FA + max. GPU layers by default (#15434)
2025-08-30 Johannes GäßlerCUDA: use FP32 arithmetic for conv2d (#15683)
2025-08-30 Jeff Bolzvulkan: Skip syncing for prealloc_y when it is reused...
2025-08-30 Chenguang LiCANN: FIx compiler warnings (#15661)
2025-08-29 Sergey Alirzaevserver : removed obsolete doc (#15670)
2025-08-29 Johannes Gäßlerscripts: strip "AMD Instinct" from GPU name (#15668)
2025-08-29 ExtReMLapinserver : add documentation for `parallel_tool_calls...
2025-08-29 Aman GuptaCUDA: fix bug in rms_norm fusion (#15660)
2025-08-29 Piotr Wilkin... chat : Seed OSS thinking + tool call support (#15552)
2025-08-29 Aman GuptaCUDA: fuse adds, fuse add with rms norm (#15631)
2025-08-29 Gabe Goodhartnvidia nemotron nano v2 (nemotronh) (#15507)
2025-08-28 Gabe Goodhartfix: Compute the full sum in llama-eval-callback, not...
2025-08-28 mnehete32CUDA: add conv2d (#15635)
2025-08-28 Aaron Teoggml-cpu: fix invalid hsum build in debug s390x (#15634)
2025-08-28 compiladeggml : fix SSM_SCAN for n_groups > 1 (#15625)
2025-08-28 Georgi Gerganovkv-cache : fix find_slot to not search for continuous...
2025-08-28 Sigbjørn Skjæretmodel : jina-embeddings-v3 support (#13693)
2025-08-28 Aman Guptascripts: add sqlite3 check for compare-commits.sh ...
2025-08-28 Georgi Gerganovkv-cache : remove LLAMA_SET_ROWS checks (#15505)
2025-08-28 Aleksei Nikiforovgguf-py: byteswapping improvements (#12851)
2025-08-28 Joshua Cogliaticli : change log to warning to explain reason for stopp...
2025-08-28 Daniel Beveniusmodel-conversion : add mmproj conversion target (#15628)
2025-08-28 matiaslincuda: Add cublasLt_static linking when GGML_STATIC...
2025-08-27 Johannes Gäßlerserver: higher timeout for tests (#15621)
2025-08-27 Georgi Gerganovpresets : add qwen3-30B-a3b FIM (#15616)
2025-08-27 uvosHIP: Enable support for ggml_backend_cuda_register_host...
2025-08-27 Georgi Gerganovkv-cache : better estimate of n_kv for multi-sequence...
2025-08-27 Chenguang LiCANN: refactor mask handling and improve performance...
2025-08-27 xctanggml-cpu : add basic RVV support for vector f32 ops...
2025-08-27 Daniel Beveniuscommon : add -m to bash completion for --model [no...
2025-08-27 rmatifOpenCL: add fused group_norm/norm, mul, add (#15314)
2025-08-26 Diego Devesatests : fix test-opt with GGML_BACKEND_DL (#15599)
2025-08-26 Akarshan BiswasSYCL: fix rms_norm_mul_add for tensor dim not a multipl...
2025-08-26 fidorielmtmd : fix mtmd ios build (#15579)
2025-08-26 Evetests: add performance test for mul mat id (#15543)
2025-08-26 shalinib-ibmllamafile: PowerPC Sgemm Optimization (#15558)
2025-08-26 Georgi Gerganovgraph : fix assert in memory-less build_attn (#15590)
2025-08-26 Daniel Beveniusmodel-conversion : add qat-q4 quantization targets...
2025-08-26 Johannes GäßlerCUDA: return -1 for nonexistent compiled arch (#15587)
2025-08-26 Georgi Gerganovmetal : optimize FA vec for large sequences and BS...
2025-08-26 Xuan-Son Nguyenmtmd : support Kimi VL model (#15458)
2025-08-26 Georgi Gerganovcontext : print graph stats for memory-less contexts...
2025-08-26 Georgi Gerganovmetal : improve `MUL_MAT_ID` (#15541)
2025-08-26 tc-mbmodel : support MiniCPM-V 4.5 (#15575)
2025-08-26 Sigbjørn Skjæretgguf-py : remove erroneous FFN_GATE entry (#15583)
2025-08-26 Sigbjørn Skjæretmetal : remove contiguous assertion for src0 in IM2COL...
2025-08-26 Yoshi_likes_e4Add a warning for special devices (#15563)
2025-08-26 Jeff Bolzvulkan: Remove splitting for mul_mat_id (#15568)
2025-08-25 QeeweewCUDA: Accelerate MXFP4 table lookup using `__byte_perm...
2025-08-25 lhezopencl: fix support ops condition for `rms_norm` (...
2025-08-25 Ruben Ortlamvulkan: fix min subgroup 16 condition for mmid subgroup...
next