]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/shortlog
pkg/ggml/sources/whisper.cpp
2025-12-12 Phylliida Devggml : add circular tiling support to pad, for Vulkan...
2025-12-12 Johannes GäßlerHIP: fix RDNA3 FP16/BF16 matrix multiplication (llama...
2025-12-12 Skyggml : improve error handling for search path existence...
2025-12-12 Jeff Bolzvulkan: Use one row per workgroup for f32 mmv (llama...
2025-12-12 Jeff Bolzvulkan: support solve_tri with larger N/K values (llama...
2025-12-12 Georgi Gerganovmetal : fix build(#17799)
2025-12-12 Masato Nakasakavulkan: Replace deprecated VK_EXT_validation_features...
2025-12-12 Masato Nakasakavulkan: Fix mismatch in TOPK_MOE unit test (llama/17541)
2025-12-12 Jeff Bolzvulkan: add more num_blocks instantiations in rms_norm...
2025-12-12 Jeff Bolzvulkan: fix top_k bug when there are ties in the input...
2025-12-12 Aclyvulkan : support conv-2d with large output size (llama...
2025-12-12 Reese Levineggml webgpu: unary op suppport, code refactoring, ops...
2025-12-12 Jeff Bolzvulkan: enable mmvq for q2_k on NVIDIA (llama/17675)
2025-12-12 Jeff Bolzvulkan: set all memory allocations to high priority...
2025-12-12 Georgi Gerganovrpc : fix alloc size logic (llama/17116)
2025-12-12 Georgi Gerganovmetal : add residency sets keep-alive heartbeat (llama...
2025-12-12 Johannes GäßlerHIP : fix RDNA4 build (llama/17792)
2025-12-12 shalinib-ibmQ4/Q8 Tiled Gemm Optimization. (llama/16999)
2025-12-12 Johannes GäßlerCUDA: fix FA VKQ accumulator overflow (llama/17746)
2025-12-12 Jiacheng (Jason... HIP: enable WMMA-MMQ INT kernels for RDNA 3 (llama...
2025-12-12 Piotr Wilkin... Add support for CUMSUM and TRI for CUDA. (llama/17584)
2025-12-12 Gabe Goodhartmetal: TRI, FILL, EXPM1, SOFTPLUS (llama/16623)
2025-12-12 Alberto Cabrera... ggml-cpu : remove asserts always evaluating to false...
2025-12-12 Georgi Gerganovmetal : use params per pipeline instance (llama/17739)
2025-12-12 Adrien Gallouëtbuild : move _WIN32_WINNT definition to headers (llama...
2025-12-12 Herman Semenoffggml-cpu: remove duplicate conditional check 'iid'...
2025-12-12 Johannes GäßlerCUDA: generalized (mma) FA, add Volta support (llama...
2025-12-12 Georgi Gerganovmetal : fix data race in pipeline library (llama/17731)
2025-12-12 Reese Levineggml webgpu: add support for emscripten builds (llama...
2025-12-12 Jeff Bolzvulkan: Reduce temporary memory usage for TOP_K (llama...
2025-12-12 xiaobing318cmake : add utf8 compilation options for msvc (llama...
2025-12-12 Adrien Gallouëtggml : use svcntb() for SVE vector length detection...
2025-12-12 TianHao324CANN: Disable Ger operator of OUT_PROD on 310p device...
2025-12-12 Daniel Beveniusggml : remove redundant n_copies check when setting...
2025-12-12 Adrien Gallouëtggml : add fallback definition for HWCAP2_SVE2 (llama...
2025-12-12 Aman Guptaggml-cuda: reorder only relevant nodes (llama/17639)
2025-12-12 Neo Zhang Jianyuenhance argsort for UT (llama/17573)
2025-12-12 Georgi Gerganovmetal : add FA head size 48 (llama/17619)
2025-12-12 Georgi Gerganovggml : extend the GGML_SCHED_NO_REALLOC debug logic...
2025-12-12 Aman Guptallama-graph: avoid expand_forward for fusion (llama...
2025-12-12 Tarek Dakhranmodel: LFM2-VL fixes (llama/17577)
2025-12-12 Gilad S.ggml: fix: macOS build with `-DGGML_BACKEND_DL=ON`...
2025-12-12 Aman GuptaCUDA: add stream-based concurrency (llama/16991)
2025-12-12 Mahekk Shaikhcuda : add error checking for cudaMemcpyAsync in argsor...
2025-12-12 Aclyvulkan : fix FA mask load with bounds check (coopmat2...
2025-12-12 Neo Zhangsycl : support to malloc memory on device more than...
2025-12-12 ixgbeggml: replace hwcap with riscv_hwprobe for RVV detectio...
2025-12-12 Ruben OrtlamVulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support...
2025-12-12 Jeff Bolzvulkan: improve topk perf for large k, fix overflow...
2025-12-12 Diego Devesaggml : add GGML_SCHED_NO_REALLOC option to disable...
2025-12-12 R0CKSTARenable fp16/fast_fp16/bf16_mma on PH1 (llama/17551)
2025-12-12 Aman Guptaggml-cuda: add stricter checking for fusion (llama...
2025-12-12 Piotr Wilkin... model : Qwen3 Next (llama/16095)
2025-12-12 Johannes GäßlerCUDA: no FP16 arithmetic for vector FA kernel (llama...
2025-12-12 Jeff Bolzvulkan: Implement GGML_OP_TRI (llama/17503)
2025-12-12 Radoslav Gerganovrpc : cache and reuse compute graphs (llama/15405)
2025-12-12 yuloHIP: enable mul_mat_f for RDNA4 (llama/17437)
2025-12-12 Piotr Wilkin... SOLVE_TRI CUDA kernel for small matrices (llama/17457)
2025-12-12 Neo Zhang Jianyurefactor pad_reflect_1d to make the UT case pass (llama...
2025-12-12 Jeff Bolzvulkan: Implement SOLVE_TRI (llama/17486)
2025-12-12 matt23654cuda : fix UMA detection on discrete GPUs. (llama/17537)
2025-12-12 Alberto Cabrera... ggml-cpu: aarm64: q4_K repack gemm and gemv implementat...
2025-12-12 Aclyvulkan : move contiguous checks to device_supports_op...
2025-12-12 Jeff Bolzvulkan: use a fixed 1KB buffer for the add_rms_fusion...
2025-12-12 lhezopencl: add sqr, sqrt, mean and ssm_conv (llama/17476)
2025-12-12 Alberto Cabrera... Fix chunks being too small with small matrix sizes...
2025-12-12 Jeff Bolzvulkan: allow graph_optimize for prompt processing...
2025-12-12 Jeff Bolzvulkan: Implement top-k (llama/17418)
2025-12-12 xctanggml-cpu : add RISC-V Zvfh impl for ggml_vec_mad_f16...
2025-12-12 Adrien Gallouëtggml : fix ARM feature verification (llama/17519)
2025-12-12 Jiacheng (Jason... HIP: Patch failed testcase in WMMA-MMQ kernels for...
2025-12-12 hipuddingCANN: Add MROPE and IMROPE support (llama/17401)
2025-12-12 Jeff Bolzvulkan: Implement GGML_OP_CUMSUM (llama/17479)
2025-12-12 Georgi Gerganovggml : add ggml_top_k (llama/17365)
2025-12-12 TianHao324CANN: supports out_prod operator for F32 and F16 (llama...
2025-12-12 Jeff Bolzvulkan: Use fewer rows for scalar FA when HS is not...
2025-12-12 Jeff Bolzvulkan: more FA details in vk_perf_logger (llama/17443)
2025-12-12 Jiacheng (Jason... HIP: WMMA-MMQ kernels for RDNA 4 (llama/17156)
2025-12-12 Alberto Cabrera... ggml-cpu: arm64: q4_K repack gemm and gemv implementati...
2025-12-12 ixgbeggml: add RISC-V cpu-feats (llama/17461)
2025-12-12 Max Krasnyanskyhexagon: add support for ROPE_NEOX (llama/17458)
2025-12-12 Raul TorresCANN: Define `cann_graph_update_required` before macro...
2025-12-12 M. Mediouniggml-hexagon: Initial Hexagon v68/v69 support (llama...
2025-12-12 nullnameggml-hexagon: add `hex_supported_buffer` for better...
2025-12-12 Sigbjørn Skjæretcuda : support non-contiguous i32 to i32 copy (llama...
2025-12-12 Jeff Bolzvulkan: remove a couple unnecessary switches (llama...
2025-12-12 yuloHIP: RDNA4 tensor core support for MMF (llama/17077)
2025-12-12 lhezopencl: refine condition for kqv mm (llama/17392)
2025-12-12 Jeff Bolzvulkan: disable async for older Intel devices (llama...
2025-12-12 Raul TorresCANN: Refactor `evaluate_and_capture_cann_graph` (llama...
2025-12-12 nullnameggml-hexagon: fix swiglu failure at `test-backend-ops...
2025-12-12 Piotr Wilkin... ggml : Fix transposed SOLVE_TRI result (llama/17323)
2025-12-12 Scott FudallyDGX Spark: UMA support (llama/17368)
2025-12-12 Adrien Gallouëtggml : remove useless and error-prone variadic macros...
2025-12-12 sudhiarmkleidiai: fix zero-size array declaration (llama/17240)
2025-12-12 ixgbeggml-cpu:add RISC-V RVV (Zvfh) optimization for FP16...
2025-12-12 Giuseppe Scrivanovulkan: implement ADD1, ARANGE, FILL, SOFTPLUS, STEP...
2025-12-12 Jeff Bolzvulkan: support larger argsort (llama/17313)
2025-12-12 Jeff Bolzvulkan: Add copy_transpose shader (llama/17371)
2025-12-12 Aman Guptacuda: fix rope fusion for gemma3 (llama/17378)
next