]> git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog
pkg/ggml/sources/ggml
2024-12-10 Jeff Bolzvulkan: Add VK_NV_cooperative_matrix2 support for mul_m...
2024-12-08 Georgi Gerganovsync : whisper.cpp
2024-12-08 Georgi Gerganovscripts : update whisper sync
2024-12-05 Georgi Gerganovsync : llama.cpp
2024-12-05 Diego Devesaggml : add predefined list of CPU backend variants...
2024-12-05 Diego Devesaggml-cpu : fix HWCAP2_I8MM value (llama/10646)
2024-12-05 Jeff Bolzvulkan: Implement "fast divide" (mul+shift) for unary...
2024-12-05 Nicolò ScipioneSYCL : Move to compile time oneMKL interface backend...
2024-12-05 Frankie RobertsonAvoid using __fp16 on ARM with old nvcc (llama/10616)
2024-12-05 Jeff Bolzvulkan: optimize and reenable split_k (llama/10637)
2024-12-04 PABggml: add `GGML_SET` Metal kernel + i32 CPU kernel...
2024-12-03 PABggml : add `GGML_PAD_REFLECT_1D` operation (#1034)
2024-12-03 Georgi Gerganovci : update requirements.txt
2024-12-03 Georgi Gerganovci : remove opencl workflow
2024-12-03 Georgi Gerganovci : fix pip env
2024-12-03 Georgi Gerganovfiles : remove make artifacts
2024-12-03 Georgi Gerganovcommon : fix compile warning
2024-12-03 Georgi Gerganovauthors : update
2024-12-03 Georgi Gerganovsync : llama.cpp
2024-12-03 Diego Devesaggml : move AMX to the CPU backend (llama/10570)
2024-12-03 Georgi Gerganovscripts : remove amx from sync
2024-12-03 Georgi Gerganovmetal : small-batch mat-mul kernels (llama/10581)
2024-12-03 Akarshan BiswasSYCL: Fix and switch to GGML_LOG system instead of...
2024-12-03 Adrien Gallouëtggml-cpu: replace AArch64 NEON assembly with intrinsics...
2024-12-03 Evevulkan: Dynamic subgroup size support for Q6_K mat_vec...
2024-12-03 Georgi Gerganovggml : fix I8MM Q4_1 scaling factor conversion (llama...
2024-12-03 Shupei Fanggml-cpu: fix typo in gemv/gemm iq4_nl_4_4 (llama/10580)
2024-12-03 Alberto Cabrera... sycl : offload of get_rows set to 0 (llama/10432)
2024-12-03 Alberto Cabrera... sycl : Reroute permuted mul_mats through oneMKL (llama...
2024-12-03 Chenguang LiCANN: RoPE operator optimization (llama/10563)
2024-12-03 Jeff Bolzvulkan: get the first command buffer submitted sooner...
2024-12-03 Georgi Gerganovggml : remove redundant copyright notice + update authors
2024-12-03 Georgi Gerganovggml : fix row condition for i8mm kernels (llama/10561)
2024-12-03 Georgi Gerganovcmake : fix ARM feature detection (llama/10543)
2024-12-03 Shupei Fanggml-cpu: support IQ4_NL_4_4 by runtime repack (llama...
2024-12-03 Sergio Lópezkompute : improve backend to pass test_backend_ops...
2024-12-03 leo-ponyCANN: Fix SOC_TYPE compile bug (llama/10519)
2024-12-03 Chenguang LiCANN: ROPE operator optimization (llama/10540)
2024-12-03 uvosAdd some minimal optimizations for CDNA (llama/10498)
2024-12-03 Georgi Gerganovmetal : fix group_norm support condition (llama/0)
2024-12-03 Jeff Bolzvulkan: define all quant data structures in types.comp...
2024-12-03 Jeff Bolzvulkan: Handle GPUs with less shared memory (llama...
2024-12-03 Jeff Bolzvulkan: further optimize q5_k mul_mat_vec (llama/10479)
2024-12-03 Jeff Bolzvulkan: skip integer div/mod in get_offsets for batch_i...
2024-12-03 Jeff Bolzvulkan: optimize Q2_K and Q3_K mul_mat_vec (llama/10459)
2024-12-03 R0CKSTARmtgpu: Add MUSA_DOCKER_ARCH in Dockerfiles && update...
2024-12-03 Jeff Bolzvulkan: fix group_norm (llama/10496)
2024-12-03 Georgi Gerganovcmake : enable warnings in llama (llama/10474)
2024-12-03 Charles Xuggml-cpu: cmake add arm64 cpu feature check for macos...
2024-12-03 Shanshan ShenCANN: Improve the Inferencing Performance for Ascend...
2024-12-03 Chenguang LiCANN: RoPE and CANCAT operator optimization (llama...
2024-12-03 Junil Kimvulkan: Fix a vulkan-shaders-gen arugment parsing error...
2024-12-03 Georgi Gerganovmetal : enable mat-vec kernels for bs <= 4 (llama/10491)
2024-12-03 Diego Devesallama : accept a list of devices to use to offload...
2024-12-03 Diego Devesaggml : add support for dynamic loading of backends...
2024-12-03 Georgi Gerganovtests : fix compile warning
2024-12-03 Georgi Gerganovmetal : minor code formatting
2024-12-03 Diego Devesaggml : do not use ARM features not included in the...
2024-12-03 leo-ponyCANN: Support Ascend310P to accelerate F32 and F16...
2024-12-03 Diego Devesacuda : optimize argmax (llama/10441)
2024-12-03 Jeff Bolzvulkan: predicate max operation in soft_max shaders...
2024-12-03 Jeff Bolzvulkan: copy iq4_nl LUT into shared memory (llama/10409)
2024-12-03 Jeff Bolzvulkan: further optimize mul_mat_vec using larger loads...
2024-12-03 haopengadd cmake rvv support (llama/10411)
2024-12-03 mahorozteCUDA: remove unnecessary warp reduce in FA (#1032)
2024-12-02 PABfeat: add `GGML_UNARY_OP_ARGMAX` Metal kernel (#1019)
2024-11-28 PABmetal : add `GGML_OP_CONV_TRANSPOSE_1D` kernels (#1026)
2024-11-27 Johannes Gäßlerexamples: link to HuggingFace mirror of MNIST data...
2024-11-26 Tristan DruyenFix build docs for hip (#1029)
2024-11-26 Frankie RobertsonDo not include arm_neon.h when compiling CUDA code...
2024-11-20 M Refi D.ACreate .gitmodules for the kompute backend (#1024)
2024-11-20 Georgi Gerganovsync : whisper.cpp
2024-11-20 slarenggml/sched : do not skip views in pre-assignments
2024-11-20 Johannes Gäßlerggml-opt: fix data corruption (#1022)
2024-11-19 Georgi Gerganovsync : llama.cpp
2024-11-19 bandotiAdd required ggml-base and backend libs to cmake pkg...
2024-11-19 Georgi Gerganovsync : llama.cpp
2024-11-19 Diego Devesacuda : fix CUDA_FLAGS not being applied (llama/10403)
2024-11-19 Georgi Gerganovsync : llama.cpp
2024-11-19 Romain Biessysycl : Add option to set the SYCL architecture for...
2024-11-19 Jeff Bolzvulkan: Optimize soft_max (llama/10301)
2024-11-19 Alberto Cabrera... sycl: Revert MUL_MAT_OP support changes (llama/10385)
2024-11-19 Diego Devesacuda : only use native when supported by cmake (llama...
2024-11-19 Jeff Bolzvulkan: remove use of null initializer (llama/10372)
2024-11-18 Plamen Minevmetal : fox offset integer overflows in im2col (#1015)
2024-11-18 Georgi Gerganovsync : llama.cpp
2024-11-18 0cc4mVulkan: Fix device info output format specifiers (llama...
2024-11-18 PABmetal : add `GGML_UNARY_OP_ELU` kernel (#1018)
2024-11-18 Georgi Gerganovsync : llama.cpp
2024-11-18 Johannes GäßlerCUDA: fix MMV kernel being used for FP16 src1 (llama...
2024-11-18 Georgi Gerganovsync : llama.cpp
2024-11-18 Johannes GäßlerCMake: fix typo in comment [no ci] (llama/10360)
2024-11-18 Diego Devesallama : only use default buffer types for the KV cache...
2024-11-18 Georgi Gerganovmetal : refactor kernel args into structs (llama/10238)
2024-11-18 FirstTimeEZggml : fix undefined reference to 'getcpu' (llama/10354)
2024-11-18 Johannes GäßlerCUDA: remove DMMV, consolidate F16 mult mat vec (llama...
2024-11-18 Johannes GäßlerCMake: default to -arch=native for CUDA build (llama...
2024-11-18 Diego Devesaggml : fix possible buffer use after free in sched...
2024-11-18 Georgi Gerganovggml : inttypes.h -> cinttypes (llama/0)
2024-11-18 Georgi Gerganovggml : adapt AMX to tensor->grad removal (llama/0)
next