]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/shortlog
pkg/ggml/sources/whisper.cpp
2026-01-30 Reese Levineggml webgpu: support for backend sampling (llama/18880)
2026-01-30 Thore Koritziusggml : extend ggml_pool_1d + metal (llama/16429)
2026-01-30 Perry Naseckggml-blas: hide warnings from included BLAS headers...
2026-01-30 Raul TorresCANN: Remove unused `ggml_cann_get_device` function...
2026-01-30 Chenguang LiCANN: fix an issue where get_env was not fully renamed...
2026-01-30 hipuddingCANN: support gated linear attn (llama/18653)
2026-01-30 shaofeiqiOpenCL: add SOLVE_TRI op support (llama/18846)
2026-01-30 Georgi Gerganovcuda : print less debug logs when disabling cuda graphs...
2026-01-30 Johannes GäßlerCUDA: fix allignment on register spill for FA (llama...
2026-01-30 shalinib-ibmggml-cpu: optimize ggml_vec_dot_bf16 for Power9 (llama...
2026-01-30 Max Krasnyanskyhexagon: support for OP_CPY, host buffers now optional...
2026-01-30 Oliver SimonsCUDA: Factor out and re-use `block_reduce` function...
2026-01-30 Jeff Bolzvulkan: Check maxStorageBufferRange in supports_op...
2026-01-30 Daniel BeveniusCUDA : fix typo in clang pragma comment [no ci] (llama...
2026-01-30 Ruben Ortlamvulkan: work around Intel fp16 bug in mmq (llama/18814)
2026-01-30 Perry Naseckggml-metal: do not copy headers for embedded, use curre...
2026-01-30 yuloHIP: add fattn-mma-f16 for RDNA4 (llama/18481)
2026-01-21 Bráulio Oliveiraexamples : use -dev/--device and WHISPER_ARG_DEVICE...
2026-01-16 Yshtolawhisper : Fix UTF-8 character boundary issue in segment...
2026-01-15 Georgi Gerganovrelease : v1.8.3 upstream/1.8.3
2026-01-15 Georgi Gerganovbenches : update
2026-01-14 Georgi Gerganovsync : ggml
2026-01-14 Georgi GerganovCUDA : fix unused argument when USE_CUDA_GRAPH=OFF...
2026-01-14 Jeff Bolzvulkan: change memory_logger to be controlled by an...
2026-01-14 Jeff Bolzvulkan: Use VK_EXT_shader_64bit_indexing to handle...
2026-01-14 Ruben Ortlamvulkan: Disable large coopmat matmul configuration...
2026-01-14 Ruben OrtlamVulkan: Optimize Matmul parameters for AMD GPUs with...
2026-01-14 Georgi Gerganovtalk-llama : sync llama.cpp
2026-01-14 Georgi Gerganovsync : ggml
2026-01-14 shaofeiqiopencl: add SOFTPLUS op support (llama/18726)
2026-01-14 Johannes GäßlerHIP: adjust RDNA3.5 MMQ kernel selction logic (llama...
2026-01-14 Perry Naseckcmake : update blas logic (llama/18205)
2026-01-14 Michael WandCorrected: changed s13 = src1->nb[3] instead of nb...
2026-01-14 shaofeiqiopencl: add EXPM1 op (llama/18704)
2026-01-14 Reese LevineUpdates to webgpu get_memory (llama/18707)
2026-01-14 Aaron Teollama: use host memory if device reports 0 memory ...
2026-01-14 Masashi Yoshimuraggml-webgpu: Fix GGML_MEM_ALIGN to 8 for emscripten...
2026-01-14 Reese Levineggml webgpu: initial flashattention implementation...
2026-01-14 Jeff Bolzvulkan: fix push constant size for quantize_q8_1 (llama...
2026-01-14 Jeff Bolzvulkan: optimize ssm_scan (llama/18630)
2026-01-14 도로로도로또metal : add MoE kernel specialization for ne20=5 (llama...
2026-01-14 Doctor Shotgunggml: add env var GGML_OP_OFFLOAD_MIN_BATCH (llama...
2026-01-14 shaofeiqiopencl: add FILL op support (llama/18682)
2026-01-14 Oliver Walshcuda : fix build on cuda 12.8 (llama/18672)
2026-01-14 Jeff Bolzvulkan: reject ops when a tensor is too large to alloca...
2026-01-14 virajwadvulkan: Warptile tuning for Intel Xe2/Xe3 (llama/18178)
2026-01-14 Evevulkan: more mul mat optimizations (llama/18533)
2026-01-14 hipuddingCANN: Fix rename for get_env (llama/18652)
2026-01-14 Raul TorresCANN: Rename `get_env` to `get_env_as_lowercase` (llama...
2026-01-14 Max KrasnyanskyHexagon add support for f16/f32 flash attention, scale...
2026-01-14 Aadeshveer... ggml : optimize cuda ssm_scan using warp-level reductio...
2026-01-14 Jeff Bolzvulkan: support buffer_from_host_ptr (llama/18467)
2026-01-14 Aman Guptaggml-cuda: refactor cuda graph usage (llama/18637)
2026-01-14 Beinseziimmq.cu: tune mmq/rocblas switching for RDNA (llama...
2026-01-14 Adrien Gallouëtggml : fix avx512bf16 build (llama/18623)
2026-01-14 Raul TorresCANN: Make `valid_values` variable `static const` ...
2026-01-14 nwyinggml webgpu: add CEIL operation support (llama/18605)
2026-01-14 Johannes GäßlerCUDA: fix FA FP16 accumulator overflow for Granite...
2026-01-14 Aman Guptaggml-cuda: check for srcs outside the cgraph (llama...
2026-01-14 Jeff Bolzvulkan: fix topk_moe_sigmoid_norm_bias failures in...
2026-01-14 Jeff Bolzvulkan: handle quantize_q8_1 overflowing the max workgr...
2026-01-14 Chenguang LiCANN: add operator fusion support for ADD + RMS_NORM...
2026-01-14 Daniel Beveniussampling : add support for backend sampling (llama...
2026-01-14 Aman GuptaCUDA: disable cuda graph when using n-cpu-moe (llama...
2026-01-14 Aman Guptaggml-cuda: remove unused params in ggml_cuda_graph...
2026-01-14 Aman Guptaggml-cuda: fixes for concurrent streams (llama/18496)
2026-01-14 Johannes GäßlerCUDA: only allocate FA tmp buffer if needed (llama...
2026-01-14 pl752(Bugfix, ggml-cuda) Pool alloc count fix + small size...
2026-01-14 Shouyuggml-hexagon: optimize activation function (llama/18393)
2026-01-14 Jeff Bolzvulkan: Optimize GGML_OP_CUMSUM (llama/18417)
2026-01-14 Jeff Bolzvulkan: Implement mmvq for iq1_s/iq1_m (llama/18450)
2026-01-14 Georgi Gerganovmetal : adjust extra size for FA buffer to avoid reallo...
2026-01-14 Chris Rohlfrpc : use unordered_map::reserve and emplace (llama...
2026-01-14 MeeMincuda : fix copy of large tensors (ggml_nbytes <= INT_MA...
2026-01-14 Aman Guptaggml-cuda: remove unneccesary prints on ggml_cuda_init...
2026-01-14 Jeff Bolzvulkan: extend topk_moe to handle sigmoid w/exp_probs_b...
2026-01-13 Peter A.examples : fix executable example targets (#3600)
2026-01-05 KITAITI Makotoruby : fix segmentation fault (#3591)
2025-12-31 Georgi Gerganovsync : ggml
2025-12-31 Georgi Gerganovggml : bump version to 0.9.5 (ggml/1410)
2025-12-31 Georgi Gerganovtalk-llama : sync llama.cpp
2025-12-31 Georgi Gerganovsync : ggml
2025-12-31 gatbontonpcmetal : add count_equal op (llama/18314)
2025-12-31 Johannes GäßlerCUDA: fix KQ max calculation (llama/18487)
2025-12-31 Georgi Gerganovmetal : remove BF16 x F16 kernels (llama/18456)
2025-12-31 Aman Guptasycl: add newline at the end of CMakeLists.txt (llama...
2025-12-31 Rahul SatheWork around broken IntelSYCLConfig.cmake in Intel oneAP...
2025-12-31 Charles Xukleidiai: add and integrate SVE 256-bit vector-length...
2025-12-31 Aman GuptaCUDA: add log line when mxfp4 acceleration is used...
2025-12-31 Johannes GäßlerCUDA: fix replacment of bad archs in CMake (llama/18457)
2025-12-31 Johannes GäßlerCUDA: Blackwell features for non-native builds (llama...
2025-12-31 Aman Guptacuda: fix race condition in cumsum (llama/18448)
2025-12-31 uvosHIP: Use mmq on MFMA devices for MUL_MAT_ID in cases...
2025-12-31 Aman GuptaRevert "ggml-cuda: use CMAKE_CUDA_ARCHITECTURES if...
2025-12-31 o7sirpc: fix segfault on invalid endpoint format (llama...
2025-12-31 Boian Berberovcmake: Added more x86_64 CPU backends when building...
2025-12-31 QDeltaggml-cuda: use CMAKE_CUDA_ARCHITECTURES if set when...
2025-12-31 lhezopencl: allow resizing transpose buffers (llama/18384)
2025-12-31 Aman Guptaggml-cuda: Use same regex for GGML_NATIVE=OFF (llama...
2025-12-31 Jeff Bolzvulkan: preprocess mul_mat_id experts and discard workg...
next