]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/shortlog
pkg/ggml/sources/whisper.cpp
2025-11-17 Piotr Wilkin... ggml : add ops SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM...
2025-11-17 Ruben Ortlamvulkan: remove shell call from vulkan-shaders-gen tool...
2025-11-17 Diego Devesasched : fix reserve ignoring user tensor assignments...
2025-11-17 ixgbeggml-cpu : add RISC-V vector intrinsic support for...
2025-11-17 bagheerametal: accelerated conv2d (llama/17175)
2025-11-17 Georgi GerganovRevert "ggml-cpu: handle 3d tensors in repack mat_mul...
2025-11-17 Diego Devesaggml-cpu : use template for argsort (llama/17222)
2025-11-17 TecJeshCANN: Add cross_entropy_loss op support (llama/16886)
2025-11-17 Aman GuptaCUDA: fuse rope + set_rows (llama/16884)
2025-11-17 Johannes GäßlerCUDA: static assert to prevent misuse of memcpy_1 ...
2025-11-17 Georgi Gerganovggml : use std::sort in ggml_argsort CPU implementation...
2025-11-17 Alberto Cabrera... ggml-cpu: handle 3d tensors in repack mat_mul (llama...
2025-11-17 TecJeshCANN: Add L2_NORM op support (llama/16856)
2025-11-17 Neo Zhang Jianyufix ci crash about SSM_CONV (llama/17169)
2025-11-17 Max Krasnyanskyhexagon: various Op fixes (llama/17135)
2025-11-17 Evedisable rms norm mul rope for chips with no fp16 rte...
2025-11-17 ixgbeggml-cpu : add RISC-V RVV (Zvfh) optimization for FP16...
2025-11-17 dudutaggml-cpu: templateify ggml_compute_forward_rope_f32...
2025-11-17 Charles Xukleidiai: add optimized per-channel kernels for Q8_0...
2025-11-17 Mike Abbottcmake : add version to all shared object files (llama...
2025-11-17 lhezopencl: add fastdiv and use it in set_rows, ported...
2025-11-17 Max Krasnyanskycpu: skip NOPs to avoid barriers (llama/17133)
2025-11-17 Georgi Gerganovmetal : cap threadgroups size of set_rows (llama/17146)
2025-11-17 Adrien Gallouëtggml-cpu : inspect -march and -mcpu to found the CPU...
2025-11-17 Ruben Ortlamvulkan: check glslc executable string (llama/17144)
2025-11-17 Ruben Ortlamvulkan: fix validation issue introduced by #16868 ...
2025-11-17 Georgi Gerganovmetal : enable tensor API for A19 (llama/17087)
2025-11-17 fj-y-saitoarm64: add i8mm route with SVE ggml_vec_dot_q4_K_q8_K...
2025-11-17 Aclycuda/vulkan : bicubic interpolation (llama/17022)
2025-11-17 Ruben Ortlamvulkan: fix memory allocations (llama/17122)
2025-11-17 KITAITI Makotovad : Silero VAD v6.2.0 (#3524)
2025-11-13 KITAITI Makotoruby : VAD separately from ASR (#3518)
2025-11-09 Georgi Gerganovsync : llama.cpp
2025-11-09 Georgi Gerganovsync : ggml
2025-11-09 Ruben Ortlamvulkan: iGPU memory reporting fix (llama/17110)
2025-11-09 Ruben Ortlamvulkan: fix mmq out of bounds reads (llama/17108)
2025-11-09 Jeff Bolzvulkan: fuse mul_mat_id + mul (llama/17095)
2025-11-09 Georgi Gerganovmetal : retain src and dst buffers during async ops...
2025-11-09 Jeff Bolzvulkan: Use spec constants for conv2d s/d/p and kernel...
2025-11-09 Aman GuptaRevert "CUDA: add expert reduce kernel (ggml/16857...
2025-11-09 Aman GuptaCUDA: skip fusion for repeating adds in bias (llama...
2025-11-09 SavicStefanvulkan: Increase BK to 32; use BK/4 for non-CM mul_mm...
2025-11-09 Aleksei Nikiforovggml: disable vxe for cross-compilation by default...
2025-11-09 Jeff Bolzvulkan: fuse rms_norm + mul + rope (+ view + set_rows...
2025-11-09 Jeff Bolzvulkan: Fix test-thread-safety crashes (llama/17024)
2025-11-09 Johannes GäßlerCUDA: fix MMQ stream-k fixup ne1 indices (llama/17089)
2025-11-09 Reese Levineggml webgpu: faster matrix multiplication/matrix-vector...
2025-11-09 bssrdfCUDA: properly handle nb00=nb02 case for cpy (llama...
2025-11-09 Aclyvulkan : refactor buffer handling in vk_op_f32 (llama...
2025-11-09 Johannes GäßlerCUDA: fix should_use_mmvf for ne11 == 1 (llama/17085)
2025-11-09 Adrien GallouëtRevert "ggml-cpu: detect correct cpu flags for arm64...
2025-11-09 ironggml-cpu: detect correct cpu flags for arm64 (ggml...
2025-11-09 xctanggml-cpu : optimize RVV q2_k and q3_k kernels (llama...
2025-11-09 Johannes GäßlerCUDA: fix crash on uneven context without FA (llama...
2025-11-09 Georgi Gerganovmetal : initial Metal4 tensor API support (llama/16634)
2025-11-09 YehuditEsycl: add CONCAT operator support (llama/16047)
2025-11-09 l3utterflyggml-hexagon: graceful fallback for older socs where...
2025-11-09 bssrdfimprove CUDA cpy memory bandwidth when copying transpos...
2025-11-09 Jeff Bolzvulkan: Fix GGML_VULKAN_CHECK_RESULTS to better handle...
2025-11-09 Reese Levineggml webgpu: minor set rows optimization (llama/16810)
2025-11-09 nullnamerefactor: replace sprintf with snprintf for safer strin...
2025-11-09 Jeff Bolzvulkan: remove the need for the dryrun (llama/16826)
2025-11-09 Aclyggml-cpu : bicubic interpolation (llama/16891)
2025-11-09 NoahFix garbled output with REPACK at high thread counts...
2025-11-09 Aman GuptaCUDA: avoid mul + bias fusion when doing fusion (llama...
2025-11-09 lhezopencl: support imrope (llama/16914)
2025-11-09 theo77186ggml: CUDA: add head size 72 for flash-attn (llama...
2025-11-09 Jinyang Heggml : LoongArch fixes (llama/16958)
2025-11-09 shani-fSYCL: optimized repeat_back kernel (3× fewer asm instru...
2025-11-09 Georgi Gerganovclip : use FA (llama/16837)
2025-11-09 mnehete32CUDA: add FLOOR, CEIL, ROUND, TRUNC unary ops (llama...
2025-11-09 Aaron Teoggml: add s390x cpu-feats (llama/16774)
2025-11-09 Jeff Bolzvulkan: Fix multi_add invalid descriptor usage (llama...
2025-11-09 Jeff Bolzvulkan: fuse mul_mat+add and mul_mat_id+add_id (llama...
2025-11-09 Oliver SimonsCUDA: Remove unneded bias/gate dims in fused mmvq ...
2025-11-09 Johannes GäßlerCUDA: Volta tensor core support for MMF (llama/16843)
2025-11-09 Georgi Gerganovggml : fix conv2d_dw SVE path (ggml/1380)
2025-11-09 Aman GuptaCUDA: add expert reduce kernel (llama/16857)
2025-11-09 Jeff Bolzvulkan: disable spirv-opt for rope shaders (llama/16872)
2025-11-09 Masato Nakasakavulkan: Fix crash when FP16 mul_mat accumulation is...
2025-11-09 Ruben Ortlamvulkan: fix shmem overrun in mmq id shader (llama/16873)
2025-11-09 l3utterflyggml-hexagon: respect input size when getting/setting...
2025-11-09 lhezopencl: fix boundary handling for mul_mm (llama/16875)
2025-11-09 Max Krasnyanskycpu: introduce chunking for repack matmuls and enable...
2025-11-09 JJJYmmmmodel: add support for qwen3vl series (llama/16780)
2025-11-09 Max Krasnyanskycpu: introduce chunking for flash attention (llama...
2025-11-09 Sigbjørn Skjæretcuda : fix argsort with 64k+ rows (llama/16849)
2025-11-09 Jeff Bolzvulkan: Handle argsort with a large number of rows...
2025-11-09 Oliver SimonsHide latency of bias and gate-loading (llama/16847)
2025-11-09 Jeff Bolzvulkan: Fuse rope+set_rows (llama/16769)
2025-11-09 Jeff Bolzvulkan: Update topk_moe fusion to handle gpt's late...
2025-11-09 Ruben OrtlamVulkan MMQ Integer Dot Refactor and K-Quant support...
2025-11-09 Max KrasnyanskyHexagon Op queue & dispatch optimizations (llama/16820)
2025-11-09 Aman GuptaCUDA: use fastdiv in set-rows (llama/16834)
2025-11-09 Jeff Bolzvulkan: Call ggml_vk_buffer_write_2d from ggml_vk_buffe...
2025-11-09 Aman GuptaCUDA: Fix bug in topk-moe for gpt-oss (llama/16821)
2025-11-09 YaelLogicsycl: add RMS_NORM_BACK operation support (llama/16808)
2025-11-09 YaelGitAccountcuda: add SET operation support (llama/16804)
2025-11-09 l3utterflyinitialise buffer.device in ggml_hexagon_session (llama...
2025-11-09 Chenguang LiCANN: Improve device ID handling and aclnnArange checks...
next