]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/shortlog
pkg/ggml/sources/whisper.cpp
2025-09-20 Chenguang LiCANN: Support eager execution mode under ACL graph...
2025-09-20 hipuddingCANN: Support ext_factor in rope (llama/15710)
2025-09-20 Johannes Gäßlerggml-backend: raise GGML_MAX_SPLIT_INPUTS (llama/15722)
2025-09-20 Gilad Svulkan: use memory budget extension to read memory...
2025-09-20 Jeff Bolzvulkan: add missing clamps in new mul_mat_id paths...
2025-09-20 Ruben Ortlamvulkan: disable large mmv subgroups on older Nvidia...
2025-09-20 s-goto-11ggml: SVE support for exponential functions (llama...
2025-09-20 Prashant Vithuleggml: aarch64: Implement SVE F16 kernels for vector...
2025-09-20 Ruben OrtlamVulkan: Add Integer Dot Product mul_mat_vec shader...
2025-09-20 Daniel Beveniusggml : WebGPU add TRANSPOSE and RESHAPE to supported...
2025-09-20 Akarshan BiswasCUDA: fix build error from ambiguous __half conversions...
2025-09-20 hipuddingCANN: Optimize MUL_MAT_ID (llama/15658)
2025-09-20 hipuddingCANN: fix RoPE cache issue on multi-device (llama/15629)
2025-09-20 Georgi Gerganovmetal : fix checks for available FA kernels (llama...
2025-09-20 Diego Devesallama : separate compute buffer reserve from fattn...
2025-09-20 Jeff Bolzvulkan: handle large sizes for get_rows (llama/15686)
2025-09-20 Jeff Bolzvulkan: mul_mat_id coopmat2 optimizations (llama/15546)
2025-09-20 Daniel Beveniusvulkan : remove unused portability_enumeration_ext...
2025-09-20 Jeff Bolzvulkan: Allow fallback to sysmem memory when vidmem...
2025-09-20 Jeff Bolzvulkan: clamp matmul and FA results to the max finite...
2025-09-20 Charles Xuggml: update kleidiai to v1.13.0 (llama/15663)
2025-09-20 Johannes Gäßlerllama: use FA + max. GPU layers by default (llama/15434)
2025-09-20 Johannes GäßlerCUDA: use FP32 arithmetic for conv2d (llama/15683)
2025-09-20 Jeff Bolzvulkan: Skip syncing for prealloc_y when it is reused...
2025-09-20 Chenguang LiCANN: FIx compiler warnings (llama/15661)
2025-09-20 Aman GuptaCUDA: fix bug in rms_norm fusion (llama/15660)
2025-09-20 Aman GuptaCUDA: fuse adds, fuse add with rms norm (llama/15631)
2025-09-20 mnehete32CUDA: add conv2d (llama/15635)
2025-09-20 Aaron Teoggml-cpu: fix invalid hsum build in debug s390x (llama...
2025-09-20 compiladeggml : fix SSM_SCAN for n_groups > 1 (llama/15625)
2025-09-20 Georgi Gerganovkv-cache : remove LLAMA_SET_ROWS checks (llama/15505)
2025-09-20 matiaslincuda: Add cublasLt_static linking when GGML_STATIC...
2025-09-20 uvosHIP: Enable support for ggml_backend_cuda_register_host...
2025-09-20 Chenguang LiCANN: refactor mask handling and improve performance...
2025-09-20 xctanggml-cpu : add basic RVV support for vector f32 ops...
2025-09-20 rmatifOpenCL: add fused group_norm/norm, mul, add (llama...
2025-09-20 Akarshan BiswasSYCL: fix rms_norm_mul_add for tensor dim not a multipl...
2025-09-20 shalinib-ibmllamafile: PowerPC Sgemm Optimization (llama/15558)
2025-09-20 Johannes GäßlerCUDA: return -1 for nonexistent compiled arch (llama...
2025-09-20 Georgi Gerganovmetal : optimize FA vec for large sequences and BS...
2025-09-20 Georgi Gerganovmetal : improve `MUL_MAT_ID` (llama/15541)
2025-09-20 Sigbjørn Skjæretmetal : remove contiguous assertion for src0 in IM2COL...
2025-09-20 Yoshi_likes_e4Add a warning for special devices (llama/15563)
2025-09-20 Jeff Bolzvulkan: Remove splitting for mul_mat_id (llama/15568)
2025-09-20 QeeweewCUDA: Accelerate MXFP4 table lookup using `__byte_perm...
2025-09-20 lhezopencl: fix support ops condition for `rms_norm` (llama...
2025-09-20 Ruben Ortlamvulkan: fix min subgroup 16 condition for mmid subgroup...
2025-09-20 Ihar Hrachyshkametal: fix regression when no metal devices are present...
2025-09-20 Johannes GäßlerCUDA: MoE helper in device code, better tile sizes...
2025-09-20 Georgi Gerganovmetal : add FA kernels for HS=40 (llama/15559)
2025-09-20 Chenguang LiCANN: ROPE cache sin/cos repeat (llama/15501)
2025-09-20 Ruben Ortlamvulkan: apply MUL_MAT_ID subgroup optimization to non...
2025-09-20 Jeff Bolzvulkan: Support FA with any multiple of 8 head sizes...
2025-09-20 Ruben Ortlamvulkan: enable Conv2D for Apple after MoltenVK fixed...
2025-09-20 Jeff Bolzvulkan: workaround MoltenVK compile failure in multi_ad...
2025-09-20 Johannes GäßlerCUDA: fix half2 -> half conversion for HIP (llama/15529)
2025-09-20 Jeff Bolzvulkan: optimize rms_norm, and allow the work to spread...
2025-09-20 Jeff Bolzvulkan: Rewrite synchronization to allow some overlap...
2025-09-20 Aclyvulkan : support ggml_mean (llama/15393)
2025-09-20 Jeff Bolzvulkan: optimize mul_mat_id loading row ids into shared...
2025-09-20 Reese Levineggml WebGPU: add support for quantization types (llama...
2025-09-20 rmatifggml: add `conv3d` op (llama/15182)
2025-09-20 Yavor Ivanovcuda : add Pad Reflect 1D support (llama/14659)
2025-09-20 Aaron Teoggml-cpu: Support Q5_0 and Q5_1 on s390x (llama/15486)
2025-09-20 Chenguang LiCANN: Optimize RMS_NORM using cache (llama/15419)
2025-09-20 Diego Devesasched : fix possible use of wrong ids tensor when offlo...
2025-09-20 Aclyvulkan : support conv_2d_dw with f16 weights (llama...
2025-09-20 Dong Won Kimvulkan: add exp operation (llama/15456)
2025-09-20 Jeff Bolzvulkan: Reuse conversion results in prealloc_y (llama...
2025-09-20 Xuan-Son Nguyenggml : fix condition of im2col on Metal backend (llama...
2025-09-20 R0CKSTARmusa: add GGML_UNUSED_VARS (llama/15446)
2025-09-20 Diego Devesasched : copy only the used experts when offloading...
2025-09-20 Johannes GäßlerCUDA: refactor FA support/selection code (llama/15454)
2025-09-20 Johannes GäßlerCUDA: replace GGML_CUDA_F16 with CUDA arch checks ...
2025-09-20 Jeff Bolzvulkan: shorten pipeline name strings (llama/15431)
2025-09-20 R0CKSTARmusa: fix build warnings (llama/15258)
2025-09-20 lhezopencl: mark `argsort` unsupported if cols exceed workg...
2025-09-20 SHUAI YANGCANN: optimize rope operator (llama/15335)
2025-09-20 R0CKSTARmusa: handle __hgt2_mask, available starting from MUSA...
2025-09-20 Marvin Gießingggml-cpu: add mxfp4 VSX intrinsics for Power9+ (ppc64le...
2025-09-20 Georgi Gerganovcuda : remove obsolete sources (ggml/1332)
2025-09-19 Carlos Zoidoggml : Fix MKL detection by quoting BLAS_INCLUDE_DIRS...
2025-09-08 Siva Mahadevanwhisper : prefer curl over wget in download scripts...
2025-09-05 Daniel Beveniusci : remove brew installation of cmake for macos-latest...
2025-09-04 Daniel Beveniustests : use CMake definitions for model/sample paths...
2025-08-24 TrebokoHandle negative value in padding (#3389)
2025-08-24 Thea Mukhimodels : update`./models/download-ggml-model.cmd`...
2025-08-18 Georgi Gerganovtalk-llama : sync llama.cpp
2025-08-18 Georgi Gerganovsync : ggml
2025-08-18 Reese Levineggml: Add initial WebGPU backend (llama/14521)
2025-08-18 Aaron Teoggml : initial zDNN backend (llama/14975)
2025-08-18 Georgi Gerganovcommon : handle mxfp4 enum
2025-08-18 compiladeggml-quants : fix make_qp_quants NANs and IQ1 assertion...
2025-08-18 Jeff Bolzvulkan: disable spirv-opt for bfloat16 shaders (llama...
2025-08-18 Jeff Bolzvulkan: Use larger workgroups for mul_mat_vec when...
2025-08-18 Dong Won Kimvulkan: support sqrt (llama/15370)
2025-08-18 Jeff Bolzvulkan: Optimize argsort (llama/15354)
2025-08-18 Jeff Bolzvulkan: fuse adds (llama/15252)
2025-08-18 Jeff Bolzvulkan: Support mul_mat_id with f32 accumulators (llama...
2025-08-18 Jeff Bolzvulkan: Add missing bounds checking to scalar/coopmat1...
next