]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/shortlog
pkg/ggml/sources/whisper.cpp
2026-02-16 Mathieu BaudierUpdate upstream debian/latest
2026-02-16 Mathieu BaudierMerge tag 'upstream/1.8.3+155' into debian/latest
2026-02-15 Georgi Gerganovtalk-llama : sync llama.cpp upstream/1.8.3+155
2026-02-15 Georgi Gerganovsync : ggml
2026-02-15 Georgi Gerganovmodels : optimize qwen3next graph (llama/19375)
2026-02-15 Adrien Gallouëtggml : fix GGML_DEBUG with OpenMP (llama/19599)
2026-02-15 Georgi Gerganovmetal : fix ACC op (llama/19427)
2026-02-15 Jeff Bolzvulkan: support L2_NORM with contiguous rows (llama...
2026-02-15 Jeff Bolzvulkan: support GGML_OP_SET (llama/19584)
2026-02-15 Sophonvulkan: Add vendor id for Qualcomm drivers (llama/19569)
2026-02-15 Max Krasnyanskyhexagon: further optimizations and refactoring for...
2026-02-15 Jeff Bolzvulkan: restore -inf check in FA shaders (llama/19582)
2026-02-15 Alberto Cabrera... Fix wrong memcpy length for block_interleave == 4 ...
2026-02-15 ymckifix vulkan ggml_acc only works in 3d but not 4d (llama...
2026-02-15 Aman GuptaCUDA: loop over ne2*ne3 in case it overflows (llama...
2026-02-15 Oliver SimonsCUDA: Do not mutate cgraph for fused ADDs (llama/19566)
2026-02-15 Georgi Gerganovmetal : improve concurrency (llama/19555)
2026-02-15 Georgi Gerganovmetal : support GGML_OP_SET (llama/19548)
2026-02-15 Shupei Fanhexagon: fix typo in vtcm_needs_release (llama/19545)
2026-02-15 lhezopencl: add basic support for q4_1 (llama/19534)
2026-02-15 Georgi Gerganovmetal : update sum_rows kernel to support float4 (llama...
2026-02-15 Mario LimoncielloAdd a workaround for compilation with ROCWMMA_FATTN...
2026-02-15 Max Krasnyanskyhexagon: further optimization and tuning of matmul...
2026-02-15 lhezopencl: add general Q6_K mm and Q4_K mv (llama/19347)
2026-02-15 Georgi Gerganovggml : unary ops support non-cont src0 + metal F16...
2026-02-15 Georgi Gerganovmetal : extend l2_norm support for non-cont src0 (llama...
2026-02-15 Max Krasnyanskyhexagon: Add ARGSORT, DIV, SQR, SQRT, SUM_ROWS, GEGLU...
2026-02-15 Georgi Gerganovggml : extend bin bcast for permuted src1 (llama/19484)
2026-02-15 Georgi Gerganovmetal : consolidate unary ops (llama/19490)
2026-02-15 Oliver SimonsCUDA : Update CCCL-tag for 3.2 to final release from...
2026-02-15 Nikhil JainPlug memory leaks and free resources on shutdown (llama...
2026-02-15 Alberto Cabrera... ggml-cpu: arm64: q6_K repack gemm and gemv (and generic...
2026-02-15 k4ss4nggml : use noexcept overload for is_regular_file in...
2026-02-15 Raul TorresCANN: Remove unnecessary wrapper for `gml_backend_buft_...
2026-02-15 hipuddingCANN: implement quantized MUL_MAT_ID for MoE models...
2026-02-15 Georgi Gerganovcuda : extend GGML_OP_PAD to work with non-cont src0...
2026-02-15 Oliver SimonsCUDA: Fix non-contig rope (llama/19338)
2026-02-09 Nunoci: add vulkan docker image (#3644)
2026-02-09 Pádraic Slatterychore: Update outdated GitHub Actions versions (#3646)
2026-02-09 Christian Kastnercmake: Drop obsolete build-time configuration of backen...
2026-02-09 Sid Mohanserver : fix hardcoded /inference path in default HTML...
2026-02-09 Georgi Gerganovci : try fix mirrors (#3655)
2026-02-08 Georgi Gerganovtalk-llama : sync llama.cpp
2026-02-08 Georgi Gerganovsync : ggml
2026-02-08 Georgi Gerganovmetal : consolidate bin kernels (llama/19390)
2026-02-08 Georgi Gerganovmetal : fix event synchronization in cpy_tensor_async...
2026-02-08 Abhijit Rameshggml-webgpu: JIT compile binary operators and handle...
2026-02-08 Nechama Krashinskisycl: add F16 support for GGML_OP_CEIL (llama/19306)
2026-02-08 Jeff Bolzvulkan: For coopmat2 FA, use fp16 accumulators for...
2026-02-08 Jeff Bolzvulkan: make FA mask/softcap enables spec constants...
2026-02-08 Georgi Gerganovmetal : skip loading all-zero mask (llama/19337)
2026-02-08 Georgi Gerganovcuda : cuda graphs now compare all node params (llama...
2026-02-08 Georgi Gerganovmetal : adaptive CPU/GPU interleave based on number...
2026-02-08 Jeff Bolzvulkan: Preprocess FA mask to detect all-neg-inf and...
2026-02-08 Georgi Gerganovmetal : add diag (llama/19330)
2026-02-08 Oleksandr Kuvshynovvulkan: fix GPU deduplication logic. (llama/19222)
2026-02-08 Jeff Bolzvulkan: Set k_load_shmem to false when K is too large...
2026-02-08 Jeff Bolzvulkan: fix non-contig rope (llama/19299)
2026-02-08 will-lmsmetal : add missing includes (llama/19348)
2026-02-08 Kevin Pougetggml-virtgpu: make the code thread safe (llama/19204)
2026-02-08 Aman Guptaggml-cpu: use LUT for converting e8->f32 scales on...
2026-02-08 Georgi Gerganovmetal : add solve_tri (llama/19302)
2026-02-08 Ruben Ortlamvulkan: disable coopmat1 fa on Nvidia Turing (llama...
2026-02-08 Aman GuptaCUDA: use mmvq for mul-mat-id for small batch sizes...
2026-02-08 Georgi Gerganovmetal : minor cleanup (llama/19251)
2026-02-08 Oliver SimonsCUDA: Fix loop unrolling for BW in mul_mat_q_stream_k_f...
2026-02-08 Georgeggml: added cleanups in ggml_quantize_free (llama/19278)
2026-02-08 Gaurav Gargcuda : revert CUDA_SCALE_LAUNCH_QUEUES override until...
2026-02-08 lhezopencl: refactor some ops, concat, repeat, tanh and...
2026-02-08 Aman Guptaggml-cpu: FA split across kv for faster TG (llama/19209)
2026-02-08 Neo ZhangRemove support for Nvidia & AMD GPU, because the oneAPI...
2026-02-08 Tamarsycl: implement GGML_OP_TOP_K (llama/19242)
2026-02-08 Georgi Gerganovmetal : support virtual devices (llama/18919)
2026-02-08 Johannes Gäßlerggml-backend: fix async set/get fallback sync (llama...
2026-02-08 Christian Kastnerdocs : Minor cleanups (llama/19252)
2026-02-08 Nikhil JainRemove pipeline cache mutexes (llama/19195)
2026-02-08 Max KrasnyanskyBump cmake max version (needed for Windows on Snapdrago...
2026-02-08 nullnameggml-hexagon: flash-attention and reduce-sum optimizati...
2026-02-08 shaofeiqiopencl: add optimized q8_0 mm kernel for adreno (llama...
2026-02-08 Simon RedmanCorrectly fetch q8_1 quantize pipeline in test as neede...
2026-02-08 Georgi Gerganovggml : bump version to 0.9.6 (ggml/1423)
2026-02-08 Georgi Gerganovcmake : remove unused file (ggml/1419)
2026-02-04 KITAITI Makotoruby : add `Whisper::Context::Params`, fix token memory...
2026-02-03 Mathieu BaudierAdd patch removing obsolete build-time configuration...
2026-01-30 KITAITI Makotoruby : add `VAD::Context#segments_from_samples`, allow...
2026-01-30 Frieder Bluemlescripts : Fix dSYMs path case for macOS xcframework...
2026-01-30 Georgi Gerganovcuda : fix compile warnings (#0)
2026-01-30 Georgi Gerganovtalk-llama : sync llama.cpp
2026-01-30 Georgi Gerganovsync : ggml
2026-01-30 bssrdfadd tensor type checking as part of cuda graph properti...
2026-01-30 s8322sycl: implement GGML_UNARY_OP_SOFTPLUS (llama/19114)
2026-01-30 RachelMantelsycl: implement GGML_OP_TRI (llama/19089)
2026-01-30 Zheyuan Chenggml-webgpu: improve flastAttention performance by...
2026-01-30 Todor Boinovskihexagon: enable offloading to Hexagon on Windows on...
2026-01-30 Georgi Gerganovcuda : fix nkvo, offload and cuda graph node properties...
2026-01-30 yuloHIP: add mmf for CDNA (llama/18896)
2026-01-30 Vishal Singhggml-zendnn : resolve ZenDNN backend cross-module symbo...
2026-01-30 Aman GuptaCUDA: refactor topk-moe to enable more models (GLM...
2026-01-30 Neo Zhangsycl: fix norm kernels: l2_norm, group_norm, rms_norm...
2026-01-30 Ruben OrtlamVulkan Flash Attention Coopmat1 Refactor (llama/19075)
next