]> git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog
pkg/ggml/sources/ggml
2025-12-11 Alberto Cabrera... ggml-cpu : remove asserts always evaluating to false...
2025-12-11 Georgi Gerganovmetal : use params per pipeline instance (llama/17739)
2025-12-11 Adrien Gallouëtbuild : move _WIN32_WINNT definition to headers (llama...
2025-12-11 Herman Semenoffggml-cpu: remove duplicate conditional check 'iid'...
2025-12-11 Johannes GäßlerCUDA: generalized (mma) FA, add Volta support (llama...
2025-12-11 Georgi Gerganovmetal : fix data race in pipeline library (llama/17731)
2025-12-11 Reese Levineggml webgpu: add support for emscripten builds (llama...
2025-12-11 Jeff Bolzvulkan: Reduce temporary memory usage for TOP_K (llama...
2025-12-11 xiaobing318cmake : add utf8 compilation options for msvc (llama...
2025-12-11 Adrien Gallouëtggml : use svcntb() for SVE vector length detection...
2025-12-11 TianHao324CANN: Disable Ger operator of OUT_PROD on 310p device...
2025-12-11 Daniel Beveniusggml : remove redundant n_copies check when setting...
2025-12-11 Adrien Gallouëtggml : add fallback definition for HWCAP2_SVE2 (llama...
2025-12-11 Aman Guptaggml-cuda: reorder only relevant nodes (llama/17639)
2025-12-11 Neo Zhang Jianyuenhance argsort for UT (llama/17573)
2025-12-11 Georgi Gerganovmetal : add FA head size 48 (llama/17619)
2025-12-11 Georgi Gerganovggml : extend the GGML_SCHED_NO_REALLOC debug logic...
2025-12-11 Aman Guptallama-graph: avoid expand_forward for fusion (llama...
2025-12-11 Tarek Dakhranmodel: LFM2-VL fixes (llama/17577)
2025-12-11 Gilad S.ggml: fix: macOS build with `-DGGML_BACKEND_DL=ON`...
2025-12-11 Aman GuptaCUDA: add stream-based concurrency (llama/16991)
2025-12-11 Mahekk Shaikhcuda : add error checking for cudaMemcpyAsync in argsor...
2025-12-11 Aclyvulkan : fix FA mask load with bounds check (coopmat2...
2025-12-11 Neo Zhangsycl : support to malloc memory on device more than...
2025-12-11 ixgbeggml: replace hwcap with riscv_hwprobe for RVV detectio...
2025-12-11 Ruben OrtlamVulkan: MMVQ Integer Dot K-Quant and MUL_MAT_ID support...
2025-12-11 Jeff Bolzvulkan: improve topk perf for large k, fix overflow...
2025-12-11 Diego Devesaggml : add GGML_SCHED_NO_REALLOC option to disable...
2025-12-11 R0CKSTARenable fp16/fast_fp16/bf16_mma on PH1 (llama/17551)
2025-12-11 Aman Guptaggml-cuda: add stricter checking for fusion (llama...
2025-12-11 Piotr Wilkin... model : Qwen3 Next (llama/16095)
2025-12-11 Johannes GäßlerCUDA: no FP16 arithmetic for vector FA kernel (llama...
2025-12-11 Jeff Bolzvulkan: Implement GGML_OP_TRI (llama/17503)
2025-12-11 Radoslav Gerganovrpc : cache and reuse compute graphs (llama/15405)
2025-12-11 yuloHIP: enable mul_mat_f for RDNA4 (llama/17437)
2025-12-11 Piotr Wilkin... SOLVE_TRI CUDA kernel for small matrices (llama/17457)
2025-12-11 Neo Zhang Jianyurefactor pad_reflect_1d to make the UT case pass (llama...
2025-12-11 Jeff Bolzvulkan: Implement SOLVE_TRI (llama/17486)
2025-12-11 matt23654cuda : fix UMA detection on discrete GPUs. (llama/17537)
2025-12-11 Alberto Cabrera... ggml-cpu: aarm64: q4_K repack gemm and gemv implementat...
2025-12-11 Aclyvulkan : move contiguous checks to device_supports_op...
2025-12-11 Jeff Bolzvulkan: use a fixed 1KB buffer for the add_rms_fusion...
2025-12-11 lhezopencl: add sqr, sqrt, mean and ssm_conv (llama/17476)
2025-12-11 Alberto Cabrera... Fix chunks being too small with small matrix sizes...
2025-12-11 Jeff Bolzvulkan: allow graph_optimize for prompt processing...
2025-12-11 Jeff Bolzvulkan: Implement top-k (llama/17418)
2025-12-11 xctanggml-cpu : add RISC-V Zvfh impl for ggml_vec_mad_f16...
2025-12-11 Adrien Gallouëtggml : fix ARM feature verification (llama/17519)
2025-12-11 Jiacheng (Jason... HIP: Patch failed testcase in WMMA-MMQ kernels for...
2025-12-11 hipuddingCANN: Add MROPE and IMROPE support (llama/17401)
2025-12-11 Jeff Bolzvulkan: Implement GGML_OP_CUMSUM (llama/17479)
2025-12-11 Georgi Gerganovggml : add ggml_top_k (llama/17365)
2025-12-11 TianHao324CANN: supports out_prod operator for F32 and F16 (llama...
2025-12-11 Jeff Bolzvulkan: Use fewer rows for scalar FA when HS is not...
2025-12-11 Jeff Bolzvulkan: more FA details in vk_perf_logger (llama/17443)
2025-12-11 Jiacheng (Jason... HIP: WMMA-MMQ kernels for RDNA 4 (llama/17156)
2025-12-11 Alberto Cabrera... ggml-cpu: arm64: q4_K repack gemm and gemv implementati...
2025-12-11 ixgbeggml: add RISC-V cpu-feats (llama/17461)
2025-12-11 Max Krasnyanskyhexagon: add support for ROPE_NEOX (llama/17458)
2025-12-11 Raul TorresCANN: Define `cann_graph_update_required` before macro...
2025-12-11 M. Mediouniggml-hexagon: Initial Hexagon v68/v69 support (llama...
2025-12-11 nullnameggml-hexagon: add `hex_supported_buffer` for better...
2025-12-11 Sigbjørn Skjæretcuda : support non-contiguous i32 to i32 copy (llama...
2025-12-11 Jeff Bolzvulkan: remove a couple unnecessary switches (llama...
2025-12-11 Masato NakasakaRevive MUL_MAT_ID to perf testing (llama/17397)
2025-12-11 yuloHIP: RDNA4 tensor core support for MMF (llama/17077)
2025-12-11 lhezopencl: refine condition for kqv mm (llama/17392)
2025-12-11 Jeff Bolzvulkan: disable async for older Intel devices (llama...
2025-12-11 Raul TorresCANN: Refactor `evaluate_and_capture_cann_graph` (llama...
2025-12-11 nullnameggml-hexagon: fix swiglu failure at `test-backend-ops...
2025-12-11 Piotr Wilkin... ggml : Fix transposed SOLVE_TRI result (llama/17323)
2025-12-11 Scott FudallyDGX Spark: UMA support (llama/17368)
2025-12-11 Adrien Gallouëtggml : remove useless and error-prone variadic macros...
2025-12-11 sudhiarmkleidiai: fix zero-size array declaration (llama/17240)
2025-12-11 ixgbeggml-cpu:add RISC-V RVV (Zvfh) optimization for FP16...
2025-12-11 Giuseppe Scrivanovulkan: implement ADD1, ARANGE, FILL, SOFTPLUS, STEP...
2025-12-11 Jeff Bolzvulkan: support larger argsort (llama/17313)
2025-12-11 Jeff Bolzvulkan: Add copy_transpose shader (llama/17371)
2025-12-11 Aman Guptacuda: fix rope fusion for gemma3 (llama/17378)
2025-12-11 Piotr Wilkin... Fix too relaxed check on CUDA "fast copy" (can_be_trans...
2025-12-11 Ruben Ortlamvulkan: force full subgroups for flash attention to...
2025-12-11 Jeremy Randggml-cpu: Don't pass -mpowerpc64 when -mcpu already...
2025-12-11 Chenguang LiCANN: fix acl_tensor_ptr usage in ASCEND_310P ROPE...
2025-12-11 Jeff Bolzvulkan: support noncontig i32 copy (llama/17328)
2025-12-11 Ruben Ortlamvulkan: add log RTE support to fix Nvidia CI (llama...
2025-12-11 Adrien Gallouëtcmake : fix ARM feature verification (llama/17170)
2025-12-11 Adrien Gallouëtggml : add missing AVX512 feature checks (llama/17270)
2025-11-24 Daniel Beveniusggml : remove dirty flag from version string (#1391)
2025-11-20 Georgi Gerganovsync : whisper.cpp
2025-11-20 YangLemetal : fix compile on macos 11 (whisper/3533)
2025-11-17 Georgi Gerganovsync : llama.cpp
2025-11-17 Georgi Gerganovmetal : support I32 -> I32 copy (llama/17317)
2025-11-17 Georgi Gerganovmetal : faster argsort (llama/17315)
2025-11-17 Georgi Gerganovmetal : add cumsum (llama/17305)
2025-11-17 hipuddingCANN: Use smart pointers to manage ACL objects (llama...
2025-11-17 Pavels Zaicenkovsvulkan: add LOG operation support for F32 and F16 ...
2025-11-17 Ruben Ortlamvulkan: fix MMQ quantize_y condition (llama/17301)
2025-11-17 Georgi Gerganovmetal : remove obosolete asserts (llama/17295)
2025-11-17 lhezopencl: fix rms_norm_mul (llama/17250)
2025-11-17 shaofeiqiopencl: add kernel to handle mat mul in attention to...
next