]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-11-17 Georgi Gerganovmetal : refactor kernel args into structs (#10238)
2024-11-17 FirstTimeEZggml : fix undefined reference to 'getcpu' (#10354)
2024-11-17 Johannes GäßlerCUDA: remove DMMV, consolidate F16 mult mat vec (#10318)
2024-11-17 Johannes GäßlerCMake: default to -arch=native for CUDA build (#10320)
2024-11-17 Diego Devesaggml : fix possible buffer use after free in sched...
2024-11-17 Georgi Gerganovggml : inttypes.h -> cinttypes (#0)
2024-11-17 Georgi Gerganovggml : adapt AMX to tensor->grad removal (#0)
2024-11-17 Georgi Gerganovmake : add ggml-opt (#0)
2024-11-17 Georgi Gerganovtests : remove test-grad0
2024-11-17 Georgi Gerganovggml : fix compile warnings (#0)
2024-11-17 Johannes Gäßlerggml: new optimization interface (ggml/988)
2024-11-17 Georgi Gerganovscripts : update sync
2024-11-16 FirstTimeEZdocs : vulkan build instructions to use git bash mingw6...
2024-11-16 Johannes Gäßlerllama/ex: remove --logdir argument (#10339)
2024-11-16 Georgi Gerganovllamafile : fix include path (#0)
2024-11-16 Georgi Gerganovmake : auto-determine dependencies (#0)
2024-11-16 MaggotHATEserver: (web UI) Add samplers sequence customization...
2024-11-16 Georgi Gerganovscripts : fix missing key in compare-llama-bench.py...
2024-11-16 Jeff Bolzvulkan: Optimize some mat-vec mul quant shaders (#10296)
2024-11-16 FirstTimeEZvulkan : add cmake preset debug/release (#10306)
2024-11-16 Dan Johanssonggml : optimize Q4_0 into Q4_0_X_Y repack (#10324)
2024-11-16 FirstTimeEZllama : save number of parameters and the size in llama...
2024-11-15 Srihari-mcwMake updates to fix issues with clang-cl builds while...
2024-11-15 Johannes Gäßlerscripts: update compare-llama-bench.py (#10319)
2024-11-15 slarenggml : fix some build issues
2024-11-15 Georgi Gerganovcmake : fix ppc64 check (whisper/0)
2024-11-15 thewh1teagleggml : vulkan logs (whisper/2547)
2024-11-15 Georgi Gerganovsync : ggml
2024-11-15 EveAVX BF16 and single scale quant optimizations (#10212)
2024-11-15 R0CKSTARci: build test musa with cmake (#10298)
2024-11-15 Romain Biessysycl: Update Intel docker images to use DPC++ 2025...
2024-11-15 Xuan Son Nguyenserver : (web UI) add copy button for code block, fix...
2024-11-15 Chenguang Licann: dockerfile and doc adjustment (#10302)
2024-11-15 Georgi Gerganovscripts : fix regex in sync [no ci]
2024-11-15 Romain Biessysycl: Use syclcompat::dp4a (#10267)
2024-11-15 Charles Xubackend cpu: add online flow for aarch64 Q4_0 GEMV...
2024-11-14 Diego Devesaggml : build backends as libraries (#10256)
2024-11-14 Johannes GäßlerCUDA: no -sm row for very small matrices (#10185)
2024-11-14 Georgi Gerganovspeculative : fix out-of-bounds access (#10289)
2024-11-14 Jeff Bolzvulkan: Optimize binary ops (#10270)
2024-11-13 Jeff Bolzvulkan: Use macros to make the mat mul pipeline creatio...
2024-11-13 Michael Podvitskiyllama : propagate the results of `graph_compute` (...
2024-11-13 Georgi Gerganovsync : ggml
2024-11-13 Small Grass... docs : update bindings list (#10261)
2024-11-13 Alexey Parfenovserver : add missing docs (#10269)
2024-11-13 Jhen-Jie Hongserver : fix incorrect res in validate_model_chat_templ...
2024-11-13 Brianmetadata: Detailed Dataset Authorship Metadata (#8875)
2024-11-13 Alberto Cabrera... sycl : Fixes to broken builds and test-backend-ops...
2024-11-13 Jeff Bolzvulkan: Optimize contiguous copies (#10254)
2024-11-11 Jeff Bolzvulkan: Throttle the number of shader compiles during...
2024-11-11 Georgi Gerganovmetal : more precise Q*K in FA vec kernel (#10247)
2024-11-11 Georgi Gerganovserver : enable KV cache defrag by default (#10233)
2024-11-10 Georgi Gerganovflake.lock: Update (#10243)
2024-11-10 MaggotHATEserver : (web UI) Add back sampler settings (#10239)
2024-11-10 Jeff Bolzvulkan: Fix newly added tests for permuted mul_mat...
2024-11-09 Georgi Gerganovmetal : reorder write loop in mul mat kernel + style...
2024-11-09 Georgi Gerganovmetal : fix build and some more comments (#10229)
2024-11-09 Georgi Gerganovmetal : fix F32 accumulation in FA vec kernel (#10232)
2024-11-09 Georgi Gerganovllama : fix Qwen model type strings
2024-11-09 Georgi Gerganovmetal : hide debug messages from normal log
2024-11-09 SXXggml: fix zero division in ‘dne’ calculation in CUDA...
2024-11-09 amritahs-ibmggml : optimize llamafile cpu matrix multiplication...
2024-11-09 haopengscripts : fix pattern and get n_tokens in one go (...
2024-11-08 Georgi Gerganovmetal : opt-in compile flag for BF16 (#10218)
2024-11-08 Georgi Gerganovmetal : improve clarity (minor) (#10171)
2024-11-08 Georgi Gerganovmetal : optimize FA kernels (#10171)
2024-11-08 Jhen-Jie Hongswift : exclude ggml-metal-embed.metal (#10211)
2024-11-07 Xuan Son Nguyenserver : minor UI fix (#10207)
2024-11-07 Xuan Son Nguyenserver : revamp chat UI with vuejs and daisyui (#10175)
2024-11-07 Georgi Gerganovscripts : add amx to sync-ggml.sh [no ci]
2024-11-07 Georgi Gerganovsync : ggml
2024-11-07 Georgi Gerganovscripts : sync update
2024-11-07 Diego Devesaggml : add ggml-cpu.h to the public headers (#10204)
2024-11-07 Faisal ZaghloulRemove identical wte/etw logic for jais (#10203)
2024-11-07 wwoodsTMDRY: Fixes clone functionality (#10192)
2024-11-07 snadampalfix q4_0_8_8 format for corrupted tokens issue (#10198)
2024-11-07 Zhiyuan LiOptimize RWKV6 Operator Naming and Implement Multi...
2024-11-06 Georgi Gerganovmetal : add BF16 support (#8439)
2024-11-06 Georgi Gerganovserver : remove hack for extra parallel slot (#10187)
2024-11-06 Diego Devesametal : fix from ptr buffer name (#10189)
2024-11-06 Georgi Gerganovggml : adjust is_first_call init value (#10193)
2024-11-06 Georgi Gerganovmetal : add quantized FA support (#10149)
2024-11-05 Gabe Goodhartllama : add <|tool_call|> formatting to Granite templat...
2024-11-04 Diego Devesaggml : fix arch check in bf16_to_fp32 (#10164)
2024-11-04 EveQ6_K AVX improvements (#10118)
2024-11-04 Diego Devesaggml : fix gelu tables initialization (#10172)
2024-11-04 Diego Devesaggml : fix q4xx mat mul, increase ggml_aligned_malloc...
2024-11-04 Xuan Son Nguyenserver : clarify /slots endpoint, add is_processing...
2024-11-04 snadampalfix build break on arm64 linux (#10166)
2024-11-04 Diego Devesacuda : clear error after changing peer access (#10153)
2024-11-04 Georgi Gerganovmetal : simplify f16 and f32 dequant kernels (#0)
2024-11-04 Georgi Gerganovmetal : move dequantize templates to beginning of MSL...
2024-11-04 leo-ponyCANN: adjust backend registry refactor. (#10158)
2024-11-04 Georgi Gerganovsync : ggml
2024-11-04 Yuri Khrustalevcmake : make it possible linking ggml as external lib...
2024-11-04 Plamen Minevmetal : fix minor string leaks (ggml/1004)
2024-11-03 Diego Devesaggml : move CPU backend to a separate file (#10144)
2024-11-03 Georgi Gerganovmetal : minor fixup in FA kernel (#10143)
2024-11-03 Georgi Gerganovflake.lock: Update (#10146)
2024-11-02 Christian KöhnenkampAdd apple arm to presets (#10134)
next