]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-11-25 Georgi Gerganovmetal : enable mat-vec kernels for bs <= 4 (#10491)
2024-11-25 Shane ARename Olmo1124 to Olmo2 (#10500)
2024-11-25 Diego Devesallama : accept a list of devices to use to offload...
2024-11-25 Johannes GäßlerGithub: update issue templates [no ci] (#10489)
2024-11-25 bruceproAdd download chat feature to server chat (#10481)
2024-11-25 Georgi Gerganovserver : add speculative decoding support (#10455)
2024-11-25 Diego Devesaggml : add support for dynamic loading of backends...
2024-11-25 Georgi Gerganovtests : fix compile warning
2024-11-25 Georgi Gerganovmetal : minor code formatting
2024-11-25 Neo Zhang Jianyu[SYCL] Fix building Win package for oneAPI 2025.0 updat...
2024-11-25 Georgi Gerganovspeculative : refactor and add a simpler example (...
2024-11-24 Georgi Gerganovflake.lock: Update (#10470)
2024-11-24 Diego Devesallama : fix op mul check with command-r-plus (#10476)
2024-11-24 Gabe Goodhartconvert : XLMRoberta Type Vocab Size (#10458)
2024-11-24 momongafix gguf-py: Conversion error when multiple licenses...
2024-11-23 Diego Devesaggml : do not use ARM features not included in the...
2024-11-22 蕭澧邦ci: Update oneAPI runtime dll packaging (#10428)
2024-11-22 Johannes GäßlerGitHub: ask for more info in issue templates (#10426)
2024-11-22 leo-ponyCANN: Support Ascend310P to accelerate F32 and F16...
2024-11-21 Diego Devesacuda : optimize argmax (#10441)
2024-11-21 Georgi Gerganovllama : handle KV shift for recurrent models (#10402)
2024-11-21 Georgi Gerganovsync : ggml
2024-11-21 slarenggml/sched : do not skip views in pre-assignments
2024-11-21 Johannes Gäßlerggml-opt: fix data corruption (ggml/1022)
2024-11-20 Jeff Bolzvulkan: predicate max operation in soft_max shaders...
2024-11-20 bandoticmake: add link dependencies to cmake find pkg (#10433)
2024-11-20 Diego Devesallama : add .clang-format file (#10415)
2024-11-20 Jeff Bolzvulkan: copy iq4_nl LUT into shared memory (#10409)
2024-11-20 Jeff Bolzvulkan: further optimize mul_mat_vec using larger loads...
2024-11-20 Neo Zhang Jianyuupdate rel to 4040 (#10395)
2024-11-19 Anthony Van... Fix missing file renames in Makefile due to changes...
2024-11-19 haopengadd cmake rvv support (#10411)
2024-11-19 Georgi Gerganovsync : ggml
2024-11-19 Plamen Minevmetal : fox offset integer overflows in im2col (ggml...
2024-11-19 PABmetal : add `GGML_UNARY_OP_ELU` kernel (ggml/1018)
2024-11-19 蕭澧邦cmake: force MSVC compiler charset to utf-8 (#9989)
2024-11-19 bandotiAdd required ggml-base and backend libs to cmake pkg...
2024-11-19 Diego Devesacuda : fix CUDA_FLAGS not being applied (#10403)
2024-11-19 Georgi Gerganovllama : add check for KV cache shifts (#10401)
2024-11-19 Shane Allama : add OLMo November 2024 support (#10394)
2024-11-19 Romain Biessysycl : Add option to set the SYCL architecture for...
2024-11-19 Jeff Bolzvulkan: Optimize soft_max (#10301)
2024-11-19 Alberto Cabrera... sycl: Revert MUL_MAT_OP support changes (#10385)
2024-11-18 Diego Devesacuda : only use native when supported by cmake (#10389)
2024-11-18 bandotiSkip searching root path for cross-compile builds ...
2024-11-18 Jeff Bolzvulkan: remove use of null initializer (#10372)
2024-11-18 Georgi Gerganovflake.lock: Update (#10346)
2024-11-18 0cc4mVulkan: Fix device info output format specifiers (...
2024-11-17 Johannes Gäßlerdocker: use GGML_NATIVE=OFF (#10368)
2024-11-17 Johannes GäßlerCUDA: fix MMV kernel being used for FP16 src1 (#10357)
2024-11-17 Johannes GäßlerCMake: fix typo in comment [no ci] (#10360)
2024-11-17 Diego Devesallama : only use default buffer types for the KV cache...
2024-11-17 Georgi Gerganovgitignore : ignore local run scripts [no ci]
2024-11-17 Georgi Gerganovmetal : refactor kernel args into structs (#10238)
2024-11-17 FirstTimeEZggml : fix undefined reference to 'getcpu' (#10354)
2024-11-17 Johannes GäßlerCUDA: remove DMMV, consolidate F16 mult mat vec (#10318)
2024-11-17 Johannes GäßlerCMake: default to -arch=native for CUDA build (#10320)
2024-11-17 Diego Devesaggml : fix possible buffer use after free in sched...
2024-11-17 Georgi Gerganovggml : inttypes.h -> cinttypes (#0)
2024-11-17 Georgi Gerganovggml : adapt AMX to tensor->grad removal (#0)
2024-11-17 Georgi Gerganovmake : add ggml-opt (#0)
2024-11-17 Georgi Gerganovtests : remove test-grad0
2024-11-17 Georgi Gerganovggml : fix compile warnings (#0)
2024-11-17 Johannes Gäßlerggml: new optimization interface (ggml/988)
2024-11-17 Georgi Gerganovscripts : update sync
2024-11-16 FirstTimeEZdocs : vulkan build instructions to use git bash mingw6...
2024-11-16 Johannes Gäßlerllama/ex: remove --logdir argument (#10339)
2024-11-16 Georgi Gerganovllamafile : fix include path (#0)
2024-11-16 Georgi Gerganovmake : auto-determine dependencies (#0)
2024-11-16 MaggotHATEserver: (web UI) Add samplers sequence customization...
2024-11-16 Georgi Gerganovscripts : fix missing key in compare-llama-bench.py...
2024-11-16 Jeff Bolzvulkan: Optimize some mat-vec mul quant shaders (#10296)
2024-11-16 FirstTimeEZvulkan : add cmake preset debug/release (#10306)
2024-11-16 Dan Johanssonggml : optimize Q4_0 into Q4_0_X_Y repack (#10324)
2024-11-16 FirstTimeEZllama : save number of parameters and the size in llama...
2024-11-15 Srihari-mcwMake updates to fix issues with clang-cl builds while...
2024-11-15 Johannes Gäßlerscripts: update compare-llama-bench.py (#10319)
2024-11-15 slarenggml : fix some build issues
2024-11-15 Georgi Gerganovcmake : fix ppc64 check (whisper/0)
2024-11-15 thewh1teagleggml : vulkan logs (whisper/2547)
2024-11-15 Georgi Gerganovsync : ggml
2024-11-15 EveAVX BF16 and single scale quant optimizations (#10212)
2024-11-15 R0CKSTARci: build test musa with cmake (#10298)
2024-11-15 Romain Biessysycl: Update Intel docker images to use DPC++ 2025...
2024-11-15 Xuan Son Nguyenserver : (web UI) add copy button for code block, fix...
2024-11-15 Chenguang Licann: dockerfile and doc adjustment (#10302)
2024-11-15 Georgi Gerganovscripts : fix regex in sync [no ci]
2024-11-15 Romain Biessysycl: Use syclcompat::dp4a (#10267)
2024-11-15 Charles Xubackend cpu: add online flow for aarch64 Q4_0 GEMV...
2024-11-14 Diego Devesaggml : build backends as libraries (#10256)
2024-11-14 Johannes GäßlerCUDA: no -sm row for very small matrices (#10185)
2024-11-14 Georgi Gerganovspeculative : fix out-of-bounds access (#10289)
2024-11-14 Jeff Bolzvulkan: Optimize binary ops (#10270)
2024-11-13 Jeff Bolzvulkan: Use macros to make the mat mul pipeline creatio...
2024-11-13 Michael Podvitskiyllama : propagate the results of `graph_compute` (...
2024-11-13 Georgi Gerganovsync : ggml
2024-11-13 Small Grass... docs : update bindings list (#10261)
2024-11-13 Alexey Parfenovserver : add missing docs (#10269)
2024-11-13 Jhen-Jie Hongserver : fix incorrect res in validate_model_chat_templ...
2024-11-13 Brianmetadata: Detailed Dataset Authorship Metadata (#8875)
next