]> git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog
pkg/ggml/sources/ggml
2025-12-31 Ruben Ortlamvulkan: use fewer FA rows for small cache runs (llama...
2025-12-31 TianHao324CANN: Uses yarn_ramp cache in ROPE (llama/17725)
2025-12-31 Chris Rohlfrpc : add check for rpc buffer type (llama/18242)
2025-12-31 nullnameggml-hexagon: create generalized functions for cpu...
2025-12-31 Shouyuggml-hexagon: gelu optimization (llama/18151)
2025-12-31 Taimur Ahmadllamafile: add rvv support for sgemm kernels (llama...
2025-12-31 lhezopencl: unpack q4_0 for adreno in get_tensor (llama...
2025-12-31 Jeff Bolzvulkan: Extend rope fusions to allow mrope (llama/18264)
2025-12-31 Jeff Bolzvulkan: Implement set_tensor_async and the event interf...
2025-12-31 Johannes Gäßlerllama: fix RPC for -fit on (llama/18233)
2025-12-31 Jeff Bolzvulkan: fix im2col overflowing maxworkgroupcount (llama...
2025-12-31 Jeff Bolzvulkan/cuda: fix topk_moe with exp_probs_b (llama/18071)
2025-12-31 Jeff Bolzvulkan: support GGML_UNARY_OP_XIELU (llama/18062)
2025-12-31 Jeff Bolzvulkan: in graph_optimize, try to group ADD operations...
2025-12-31 lovedheartVulkan: some improvement on mul_mat_iq2_xs (llama/18031)
2025-12-31 Jeff Bolztests: Avoid floating point precision false positives...
2025-12-31 Jeff Bolztest-backend-ops: improve msvc build time (llama/18209)
2025-12-31 Aadeshveer... Added comments explaining thread block size selection...
2025-12-31 Alfredggml-hexagon: Implement true Q8_0 quantization on Hexag...
2025-12-31 Jeff Bolzvulkan: Add perf logger mode with concurrency (llama...
2025-12-31 Xuan-Son Nguyenmodel : add ASR support for LFM2-Audio-1.5B (conformer...
2025-12-31 Taimur Ahmadggml-cpu: extend support for RVV floating-point kernels...
2025-12-31 yuloremove i_major_dual (llama/18157)
2025-12-31 Shouyuggml-hexagon: swiglu_oai operation (llama/18114)
2025-12-31 Shouyuggml-hexagon: gelu operation (llama/17921)
2025-12-31 Alberto Cabrera... ggml-cpu: ARM64: repack version of q8_0 (dotprod and...
2025-12-31 yuloHIP: Refactor mma for RDNA and CDNA (llama/17990)
2025-12-17 Georgi Gerganovsync : llama.cpp upstream/0.9.4.395
2025-12-17 Naco Sirenllama.android : Rewrite Android binding (w/o cpu_featur...
2025-12-17 Aadeshveer... ggml : use WARP_SIZE/2 for argmax reduction offset...
2025-12-17 Shouyuggml-hexagon: mm for mtmd (llama/17894)
2025-12-17 Jeremy Demeulemetal: use shared buffers on eGPU (llama/17866)
2025-12-17 Johannes Gäßlerllama: automatically set parameters not set by the...
2025-12-17 Neo Zhang JianyuSupport gpt-oss by OPs add-id, mul_mat for mxfp4, swigl...
2025-12-17 Ruben Ortlamvulkan: fix mul_mat_vec_iq1_s formatting (llama/18026)
2025-12-14 Georgi Gerganovsync : llama.cpp
2025-12-14 Jeff Bolzvulkan: Fix data race/hang in scalar/cm1 flash attentio...
2025-12-14 lovedheartvulkan: improve mul_mat_vec_iq1_s speed (llama/17874)
2025-12-14 Evevulkan: faster q6_k matmul (llama/17813)
2025-12-14 Georgi Gerganovggml : arm repack fix build (llama/0)
2025-12-14 Jeff Bolzvulkan: support get_rows for i32 (llama/17941)
2025-12-14 Jeff Bolzvulkan: support GGML_OP_DIAG (llama/17893)
2025-12-14 Jeff Bolzvulkan: Multi-pass softmax for large number of cols...
2025-12-14 Jeff Bolzvulkan: Allow non-pow2 n_experts in topk_moe (llama...
2025-12-14 Johannes GäßlerCUDA: fix overflow in MMA kernel without stream-k ...
2025-12-14 Sigbjørn Skjæretcann : fix ops broken by circular padding guard (llama...
2025-12-14 ixgbeggml-cpu : fix RISC-V Q4_0 repack select and RVV featur...
2025-12-14 yuloHIP: enable mmf for RDNA3 (llama/17879)
2025-12-14 Piotr Wilkin... SOLVE_TRI extension to more dimensions (llama/17793)
2025-12-13 Georgi Gerganovsync : whisper.cpp
2025-12-13 Georgi Gerganovggml : arm repack fix build (whisper/0)
2025-12-12 Congcong Caicmake : set `CMAKE_RUNTIME_OUTPUT_DIRECTORY` for non...
2025-12-12 Copilotci : remove retired macOS-13 runners from CI (#1397)
2025-12-11 Georgi Gerganovsync : llama.cpp
2025-12-11 Georgi Gerganovggml-alloc : fix reuse-parent logic for misaligned...
2025-12-11 nullnameggml-hexagon: fix `rope` failure at `test-backend-ops...
2025-12-11 Max KrasnyanskyFix race conditions in threadpool when dealing with...
2025-12-11 Georgi Gerganovggml : remove GGML_KQ_MASK_PAD constant (llama/17910)
2025-12-11 Sigbjørn Skjæretcuda : add missing support check for xielu (llama/17895)
2025-12-11 Johannes GäßlerCUDA: fix unpadded strides in MMA FA kernel (llama...
2025-12-11 Neo Zhang Jianyufix softmax for iGPU (llama/17838)
2025-12-11 Gabe Goodhartmetal: SSM kernel improvements (llama/17876)
2025-12-11 Piotr Wilkin... Add DIAG for CUDA (llama/17873)
2025-12-11 Gabe Goodhartggml : Provide macos-specific backtrace printing to...
2025-12-11 Georgi Gerganovmetal : print node names for debugging (llama/17882)
2025-12-11 Sigbjørn Skjæretggml : allow fill node alloc inplace (llama/17870)
2025-12-11 Chenguang LiCANN: add support for partial RoPE and Vision mode...
2025-12-11 Johannes GäßlerCUDA: fix FP16 overflow in tile FA kernel (llama/17875)
2025-12-11 Jay Zenithcuda : add FILL op support (llama/17851)
2025-12-11 wsbagnsv1cuda: optimize SOLVE_TRI using registers and FMAF ...
2025-12-11 ixgbeggml-cpu: add ggml_thread_cpu_relax with Zihintpause...
2025-12-11 lovedheartVulkan: improve mul_mat_vec_iq1_m (llama/16907)
2025-12-11 Law Po Yingsycl: add missing BF16 conversion support for Intel...
2025-12-11 Jeff Bolzvulkan: perf_logger improvements (llama/17672)
2025-12-11 Vishal Singhggml-zendnn : add ZenDNN backend for AMD CPUs (llama...
2025-12-11 Phylliida Devggml : add circular tiling support to pad, for Vulkan...
2025-12-11 Johannes GäßlerHIP: fix RDNA3 FP16/BF16 matrix multiplication (llama...
2025-12-11 Skyggml : improve error handling for search path existence...
2025-12-11 Jeff Bolzvulkan: Use one row per workgroup for f32 mmv (llama...
2025-12-11 Jeff Bolzvulkan: support solve_tri with larger N/K values (llama...
2025-12-11 Georgi Gerganovmetal : fix build(#17799)
2025-12-11 Masato Nakasakavulkan: Replace deprecated VK_EXT_validation_features...
2025-12-11 Masato Nakasakavulkan: Fix mismatch in TOPK_MOE unit test (llama/17541)
2025-12-11 Jeff Bolzvulkan: add more num_blocks instantiations in rms_norm...
2025-12-11 Jeff Bolzvulkan: fix top_k bug when there are ties in the input...
2025-12-11 Aclyvulkan : support conv-2d with large output size (llama...
2025-12-11 Reese Levineggml webgpu: unary op suppport, code refactoring, ops...
2025-12-11 Jeff Bolzvulkan: enable mmvq for q2_k on NVIDIA (llama/17675)
2025-12-11 Jeff Bolzvulkan: set all memory allocations to high priority...
2025-12-11 Georgi Gerganovrpc : fix alloc size logic (llama/17116)
2025-12-11 Georgi Gerganovmetal : add residency sets keep-alive heartbeat (llama...
2025-12-11 Johannes GäßlerHIP : fix RDNA4 build (llama/17792)
2025-12-11 shalinib-ibmQ4/Q8 Tiled Gemm Optimization. (llama/16999)
2025-12-11 Johannes GäßlerCUDA: fix FA VKQ accumulator overflow (llama/17746)
2025-12-11 Jiacheng (Jason... HIP: enable WMMA-MMQ INT kernels for RDNA 3 (llama...
2025-12-11 Piotr Wilkin... Add support for CUMSUM and TRI for CUDA. (llama/17584)
2025-12-11 Gabe Goodhartmetal: TRI, FILL, EXPM1, SOFTPLUS (llama/16623)
2025-12-11 Alberto Cabrera... ggml-cpu : remove asserts always evaluating to false...
2025-12-11 Georgi Gerganovmetal : use params per pipeline instance (llama/17739)
2025-12-11 Adrien Gallouëtbuild : move _WIN32_WINNT definition to headers (llama...
next