]> git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog
pkg/ggml/sources/ggml
2025-09-05 Johannes GäßlerCUDA: replace GGML_CUDA_F16 with CUDA arch checks ...
2025-09-05 Jeff Bolzvulkan: shorten pipeline name strings (llama/15431)
2025-09-05 R0CKSTARmusa: fix build warnings (llama/15258)
2025-09-05 lhezopencl: mark `argsort` unsupported if cols exceed workg...
2025-09-05 SHUAI YANGCANN: optimize rope operator (llama/15335)
2025-09-05 R0CKSTARmusa: handle __hgt2_mask, available starting from MUSA...
2025-09-05 Marvin Gießingggml-cpu: add mxfp4 VSX intrinsics for Power9+ (ppc64le...
2025-08-28 Daniel Beveniusci : add github release job (#1334)
2025-08-18 Georgi Gerganovcuda : remove obsolete sources (#1332) upstream/0.0.2471
2025-08-18 Georgi Gerganovsync : whisper.cpp
2025-08-18 Georgi Gerganovscripts : update sync scripts
2025-08-18 Reese Levineggml: Add initial WebGPU backend (llama/14521)
2025-08-18 Aaron Teoggml : initial zDNN backend (llama/14975)
2025-08-18 Georgi Gerganovscripts : update sync scripts
2025-08-18 Georgi Gerganovcommon : handle mxfp4 enum
2025-08-18 Georgi Gerganovsync : llama.cpp
2025-08-18 compiladeggml-quants : fix make_qp_quants NANs and IQ1 assertion...
2025-08-18 Jeff Bolzvulkan: disable spirv-opt for bfloat16 shaders (llama...
2025-08-18 Jeff Bolzvulkan: Use larger workgroups for mul_mat_vec when...
2025-08-18 Dong Won Kimvulkan: support sqrt (llama/15370)
2025-08-18 Jeff Bolzvulkan: Optimize argsort (llama/15354)
2025-08-18 Jeff Bolzvulkan: fuse adds (llama/15252)
2025-08-18 Jeff Bolzvulkan: Support mul_mat_id with f32 accumulators (llama...
2025-08-18 Jeff Bolzvulkan: Add missing bounds checking to scalar/coopmat1...
2025-08-18 rmatifOpenCL: add initial FA support (llama/14987)
2025-08-18 lhezopencl: add initial mxfp4 support via mv (llama/15270)
2025-08-18 Georgi Gerganovvulkan : fix out-of-bounds access in argmax kernel...
2025-08-18 Georgi Gerganovvulkan : fix compile warnings on macos (llama/15340)
2025-08-18 Aaron Teoggml: initial IBM zDNN backend (llama/14975)
2025-08-18 Johannes Gäßlertest-opt: fix backend support check (llama/15317)
2025-08-18 Johannes GäßlerCUDA: fix negative KV_max values in FA (llama/15321)
2025-08-18 uvosHIP: Cleanup hipification header (llama/15285)
2025-08-18 Jeff Bolzvulkan: perf_logger improvements (llama/15246)
2025-08-14 Jason Niggml: fix ggml_conv_1d_dw bug (#1323) upstream/0.0.2446
2025-08-14 Georgi Gerganovmnist : adapt to opt changes
2025-08-14 Georgi Gerganovtests : remove unused includes (#0)
2025-08-14 Georgi Gerganovsync : llama.cpp
2025-08-14 Sigbjørn Skjæretcuda : fix GGML_CUDA_GRAPHS=OFF (llama/15300)
2025-08-14 Jonathan Graehlfinetune: SGD optimizer, more CLI args (llama/13873)
2025-08-14 uvosHIP: bump requirement to rocm 6.1 (llama/15296)
2025-08-14 Georgi Gerganovsync : llama.cpp
2025-08-14 Juddggml : update `ggml_rope_multi` (llama/12665)
2025-08-14 Georgi Gerganovggml : repack block_iq4_nlx8 (llama/14904)
2025-08-14 Oliver SimonsCUDA: Optimize `reduce_rows_f32` kernel, leading up...
2025-08-14 Tak-RSggml-rpc: chunk send()/recv() to avoid EINVAL for very...
2025-08-14 uvosHIP: disable sync warp shuffel operators from clr amd_w...
2025-08-14 Romain Biessysycl: Fix and disable more configurations of mul_mat...
2025-08-14 rmatifopencl: allow mixed f16/f32 `add` (llama/15140)
2025-08-14 Aman GuptaCUDA cmake: add `-lineinfo` for easier debug (llama...
2025-08-14 Chenguang LiCANN: GGML_OP_CPY optimization (llama/15070)
2025-08-14 R0CKSTARmusa: fix failures in test-backend-ops for mul_mat_id...
2025-08-14 hipuddingCANN: Add broadcast for softmax and FA (llama/15208)
2025-08-14 Charles Xukleidiai: fix unsigned overflow bug (llama/15150)
2025-08-14 David Zhaocuda: refactored ssm_scan and use CUB (llama/13291)
2025-08-14 Aman GuptaCUDA: add attention sinks for tile and wmma (llama...
2025-08-14 compiladegguf-py : add Numpy MXFP4 de/quantization support ...
2025-08-14 AN Longggml : fix field name when new ggml_backend (llama...
2025-08-14 Johannes GäßlerCUDA: attention sinks for mma FlashAttention (llama...
2025-08-14 lhezopencl: support sink in `soft_max` (attn sinks) (llama...
2025-08-14 Jeff Bolzvulkan: support fattn sinks (llama/15126)
2025-08-14 Jeff Bolzvulkan: Add env var to disable host visible vidmem...
2025-08-14 uvosHIP: add cmake option to enable compiler output of...
2025-08-14 Christian Kastnerggml: Skip backend library linking code when GGML_BACKE...
2025-08-14 Johannes GäßlerCUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (llama...
2025-08-14 rmatiffix profiling crash (llama/15072)
2025-08-14 lhezopencl: add `swiglu_oai` and `add_id` (llama/15121)
2025-08-14 Diego Devesaggml : fix fallback to CPU for ununsupported ops (llama...
2025-08-14 Chenguang LiCANN: add support for ACL Graph (llama/15065)
2025-08-14 Georgi Gerganovllama : add gpt-oss (llama/15091)
2025-08-14 Romain Biessysycl: fix mul_mat selection (llama/15092)
2025-08-14 Christian Kastnercmake: Add GGML_BACKEND_DIR option (llama/15074)
2025-08-14 Jeff Bolzvulkan: fix build when using glslang that does not...
2025-08-14 Jeff Bolzvulkan: Use coopmat2 for conv2d (llama/14982)
2025-08-14 lhezopencl: fix adreno compiler detection logic (llama...
2025-08-14 Johannes GäßlerCUDA: use mma FA kernel for gqa > 4 on RTX 4000 (llama...
2025-08-02 Georgi Gerganovsync : llama.cpp upstream/0.0.2404
2025-08-02 leejetcuda: make im2col a little faster (llama/15025)
2025-08-02 Georgi Gerganovcuda, sycl : fix batched gemm when ne02 == 1 && ne03...
2025-08-02 Jeff Bolzvulkan: coopmat2 mul_mat optimizations (llama/14934)
2025-08-02 Jeff Bolzvulkan: Support ne[3]>1 in noncontig matrix-vector...
2025-08-02 Jeff Bolzvulkan: optimizations for direct convolution (llama...
2025-08-02 Johannes GäßlerCUDA: fix MMQ nwarps for AMD with warp_size==32 (llama...
2025-08-02 lhezopencl: add f16 for `add`, `sub`, `mul`, `div` (llama...
2025-08-02 Srihari-mcwggml : Q2k interleaving implementation - x86/x64 SIMD...
2025-08-02 diannaodocker : add cann build pipline (llama/14591)
2025-08-02 Ruben OrtlamVulkan: Fix minor debug mode issues (llama/14899)
2025-08-02 hipuddingCANN: Improve loading efficiency after converting weigh...
2025-08-02 lhezopencl: add `mul_mat_f32_f32_l4_lm` and `mul_mat_f16_f3...
2025-08-02 uvosHIP: enable mfma mmq on gfx908 and gfx90a for select...
2025-08-02 Johannes GäßlerCUDA: skip masked KV slices for all FA kernels (llama...
2025-08-02 uvosHIP: remove the use of __HIP_PLATFORM_AMD__, explicitly...
2025-08-02 uvosHIP: add GGML_HIP_MMQ_MFMA option to allow disableing...
2025-08-02 uvosHIP: Ignore unsupported unroll transformation in fattn...
2025-08-02 hipuddingCANN: Add ggml_set_rows (llama/14943)
2025-08-02 Sigbjørn Skjæretcuda : add softcap fusion (llama/14907)
2025-08-02 Aman GuptaCUDA: add roll (llama/14919)
2025-08-02 Leonard Mosescutest-backend-ops : extend test case filtering (llama...
2025-08-02 xctanggml-cpu : deduplicate scalar implementations (llama...
2025-08-02 Akarshan BiswasSYCL: Add set_rows support for quantized types (llama...
2025-08-02 Johannes GäßlerCUDA: fix pointer incrementation in FA (llama/14916)
next