]> git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog
pkg/ggml/sources/ggml
2025-03-04 mgroeber9110ggml : portability fixes for VS 2017 (llama/12150)
2025-03-04 David HuangHIP: implement FlashAttention via rocWMMA for CDNA...
2025-03-04 Diego Devesatest-backend-ops : add option -p to filter by op params...
2025-03-04 ag2s20150909ggml : fix kleidiai build (llama/12159)
2025-03-04 Akarshan BiswasSYCL: Move CPY kernels to a separate file and add few...
2025-03-04 Diego Devesaggml-backend : keep paths in native string type when...
2025-03-04 Erik ScholzCUDA: compress mode option and default to size (llama...
2025-03-04 William Tambelliniggml : upgrade init_tensor API to return a ggml_status...
2025-03-04 Rémy Ovulkan: add specific MMV kernels for IQ2 and IQ3 quants...
2025-03-04 Johannes GäßlerCUDA: fix logic for V100 + GGML_CUDA_FORCE_MMQ (llama...
2025-03-04 Prashant Vithuleggml: aarch64: implement SVE kernels for q2_k_q8_k...
2025-03-04 hipuddingCANN: Fix build error with GCC 13 (llama/11990)
2025-03-04 Evevulkan: matmul dequantization improvements (llama/12015)
2025-03-04 Danielevulkan: improve im2col (llama/11826)
2025-03-04 Vladimir Vuksanoviccmake: Fix ggml backend dependencies and installation...
2025-03-04 Jeff Bolzvulkan: fix assertion when qy_needs_dequant (llama...
2025-03-04 Molly Sophiaggml-cpu: Fix build with sve (llama/12059)
2025-03-03 cmdr2cuda: unary ops as float + de-duplicate (#1130)
2025-02-28 cmdr2cuda/vulkan: specify fp32-only support for some operati...
2025-02-28 cmdr2cuda/cpu: Increase support for fp16 unary operations...
2025-02-27 Georgi Gerganovsync : whisper.cpp
2025-02-27 Diego Devesawhisper : support GGML_BACKEND_DL (whisper/2843)
2025-02-27 Georgi Gerganovci : fix workflow name
2025-02-27 Georgi Gerganovexamples : remove dr_wab.h (#1127)
2025-02-27 Georgi Gerganovsync : whisper.cpp
2025-02-27 Georgi Gerganovcommon : separate whisper sources (whisper/2846)
2025-02-27 Georgi Gerganovcommon : fix build min/max (whisper/2845)
2025-02-27 Dmitry Atamanovexamples : use miniaudio for direct decoding flac,...
2025-02-27 midnightcmake : fix compile assumptions for power9/etc (whisper...
2025-02-26 petterreinholdtsenTold cmake to install ggml-cpp.h as a public header...
2025-02-25 cmdr2Support pure float16 add/sub/mul/div operations in...
2025-02-25 Georgi Gerganovsync : llama.cpp
2025-02-25 Gian-Carlo... metal : copy kernels for quant to F32/F16 conversions...
2025-02-25 lhezopencl: fix for small models (llama/11950)
2025-02-25 Neo Zhang JianyuOptimize mul_mat for Q4_0 on Intel GPU (llama/12035)
2025-02-25 Akarshan BiswasSYCL: Fix GGML_SYCL_DEBUG macro (llama/11995)
2025-02-25 Aaron Teoggml-cpu: Support s390x SIMD Instruction Set (llama...
2025-02-25 Johannes GäßlerCUDA: app option to compile without FlashAttention...
2025-02-25 Johannes GäßlerCUDA: optimize FA for GQA + large batches (llama/12014)
2025-02-25 Gian-Carlo... cuda: Add Q5_1, Q5_0, Q4_1 and Q4_0 to F32 conversion...
2025-02-25 PureJourneyCUDA: correct the lowest Maxwell supported by CUDA...
2025-02-25 BodhiMUSA: support ARM64 and enable dp4a .etc (llama/11843)
2025-02-25 Charles Xuggml-cpu: Add CPU backend support for KleidiAI library...
2025-02-25 Prashant Vithuleggml: aarch64: implement SVE kernels for q3_K_q8_K...
2025-02-25 Johannes GäßlerCUDA: use async data loading for FlashAttention (llama...
2025-02-25 Rémy Ovulkan: implement several ops relevant for ggml_opt...
2025-02-25 Jeff Bolzvulkan: support multi/vision rope, and noncontiguous...
2025-02-25 Hale Chanmetal : fix the crash caused by the lack of residency...
2025-02-25 Adrian Kretzmetal : optimize dequant q6_K kernel (llama/11892)
2025-02-25 Georgi Gerganovrepo : update links to new url (llama/11886)
2025-02-25 Rémy Ovulkan: initial support for IQ1_S and IQ1_M quantizatio...
2025-02-25 lhezopencl: Fix rope and softmax (llama/11833)
2025-02-25 Diego Devesacuda : add ampere to the list of default architectures...
2025-02-25 Jinyang Heggml: optimize some vec dot functions for LoongArch...
2025-02-25 Evevulkan: linux builds + small subgroup size fixes (llama...
2025-02-25 Jeffrey Morganllamafile: use member variable instead of constant...
2025-02-25 R0CKSTARmusa: bump MUSA SDK version to rc3.1.1 (llama/11822)
2025-02-25 Diego Devesaggml-cpu : add chunking support to mul_mat_id (llama...
2025-02-25 Xuan-Son Nguyenggml : x2 speed for WASM by optimizing SIMD (llama...
2025-02-25 uvosHIP: Remove GCN from list of devices that avoid MMQ...
2025-02-12 Georgi Gerganovsync : llama.cpp upstream/0.0.1722
2025-02-12 uvosHIP: Switch to std::vector in rocblas version check...
2025-02-12 bandoticleanup: fix compile warnings associated with gnu_print...
2025-02-12 Richardggml : fix multi-threaded clamp_f32 (llama/11824)
2025-02-12 Weizhao Ouyangggml-cpu: Fix duplicate MATMUL_INT8 (llama/11817)
2025-02-12 Johannes GäßlerCUDA: fix CUDART_VERSION checks (llama/11821)
2025-02-12 Sheldon RobinsonFix #11802: Compile bug - RegQueryValueExA changed...
2025-02-12 Johannes GäßlerCUDA: use arch list for compatibility check (llama...
2025-02-12 Maxim Evtushfix: typos in documentation files (llama/11791)
2025-02-12 Danny Milosavljevicvulkan: Make Vulkan optional at runtime (#11493). ...
2025-02-12 Wagner Brunavulkan: add environment variable GGML_VK_PREFER_HOST_ME...
2025-02-12 Jeff Bolzvulkan: account for lookup tables when checking shared...
2025-02-12 Karol Kontnyggml: Fix data race in ggml threadpool (llama/11736)
2025-02-12 Johannes GäßlerCUDA: fix min. version for movmatrix (llama/11751)
2025-02-12 Jeff Bolzvulkan: print shared memory size (llama/11719)
2025-02-12 Akarshan BiswasSYCL: remove XMX info from print devices (llama/11712)
2025-02-12 Jinyang Heggml : optimize and build warning fix for LoongArch...
2025-02-12 Akarshan BiswasSYCL: Adjust support condition for norm operators ...
2025-02-12 junchao-zhaoggml : fix LoongArch compile error with 128-bit SIMD...
2025-02-12 Jeff Bolzvulkan: optimize coopmat2 iq2/iq3 callbacks (llama...
2025-02-12 Rémy Ovulkan: initial support for IQ4_XS quantization (llama...
2025-02-12 Jeff Bolzvulkan: use smaller combined allocations to avoid fragm...
2025-02-12 Charles Duffymetal : avoid breaking build when metal API predates...
2025-02-12 Georgi Gerganovmetal : adjust support conditions for norm operators...
2025-02-12 Johannes GäßlerCUDA: support for mat. mul. with ne03 != ne13 (llama...
2025-02-12 Johannes GäßlerCUDA: non-contiguous (RMS) norm support (llama/11659)
2025-02-12 fxzjshmHIP: force max threads per block to be 1024 (llama...
2025-02-12 Jhen-Jie Hongmetal : use residency set for other platforms (llama...
2025-02-09 Shang Yuanchuncmake : fix CPU detection on loongarch64 in tests ...
2025-02-08 Georgi Gerganovreadme : remove transfer notice (#1107)
2025-02-07 Shawn yangfix a bug in examples/simple/simple-backend (#1078)
2025-02-06 Patrick Pengrpc: fix known RCE in rpc-server (#1103)
2025-02-04 Georgi Gerganovreadme : add info about repository transfer
2025-02-04 Georgi Gerganovauthors : update upstream/0.0.1689
2025-02-04 Georgi Gerganovsync : whisper.cpp
2025-02-03 Christian Kastnercmake: Add ability to pass in GGML_BUILD_NUMBER (#1096)
2025-02-03 Georgi Gerganovsync : llama.cpp
2025-02-03 Johannes GäßlerCUDA: fix Volta FlashAttention logic (llama/11615)
2025-02-03 Georgi Gerganovsync : llama.cpp
2025-02-03 Johannes GäßlerHIP: fix flash_attn_stream_k_fixup warning (llama/11604)
next