]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-02-25 Radosław Grytaggml-quants : provide ggml_vqtbl1q_u8 for 64bit compati...
2024-02-25 kwin1412make : fix nvcc version is empty (#5713)
2024-02-25 Ashok Gelalreadme : add Msty to UI list (#5618)
2024-02-25 Pierrick Hymbertserver: logs - unified format and --log-format option...
2024-02-25 Pierrick Hymbertserver: concurrency fix + monitoring - add /metrics...
2024-02-25 Radosław Grytacmake : fix compilation for Android armeabi-v7a (#5702)
2024-02-25 Georgi Gerganovcode : normalize enum names (#5697)
2024-02-25 Anas Ahouzipy : fix StableLM conversion after config.json changes...
2024-02-24 Pierrick Hymbertserver: continue to update other slots on embedding...
2024-02-24 KawrakowIQ3_S: a much better alternative to Q3_K (#5676)
2024-02-24 Pierrick Hymbertserver: init functional tests (#5566)
2024-02-23 AlpinDaleserver : add KV cache quantization options (#5684)
2024-02-23 Jared Van Bortelconvert : fix missing ftype for gemma (#5690)
2024-02-22 Jared Van Bortelmpt : do not duplicate token_embd.weight on disk (...
2024-02-22 Georgi Gerganovgemma : use more bits for the token_embd.weight tensor...
2024-02-22 Georgi Gerganovpy : add Gemma conversion from HF models (#5647)
2024-02-22 Georgi Gerganovggml : always define ggml_fp16_t as uint16_t (#5666)
2024-02-22 Georgi Gerganovsync : ggml
2024-02-22 Georgi Gerganovggml : 32-bit arm compat (whisper/1891)
2024-02-22 Someonenix: init singularity and docker images (#5056)
2024-02-22 Georgi Gerganovpy : minor fixes (#5668)
2024-02-22 Xuan Son NguyenAdd Gemma chat template (#5665)
2024-02-22 Someoneworkflows: nix: hardcode cachix ids, build unconditiona...
2024-02-22 Georgi Gerganovminor : fix trailing whitespace (#5638)
2024-02-22 Georgi Gerganovreadme : update hot topics
2024-02-22 Xuan Son Nguyenserver : fallback to chatml, add AlphaMonarch chat...
2024-02-22 Alexey Parfenovserver : clarify some params in the docs (#5640)
2024-02-22 Dat Quoc Nguyenmpt : add optional bias tensors (#5638)
2024-02-21 slarenllama : fix loading models with shared tok_embd and...
2024-02-21 Xuan Son NguyenAdd docs for llama_chat_apply_template (#5645)
2024-02-21 slarenllama : fix session save/load with quantized KV (#5649)
2024-02-21 slarengemma : allow offloading the output tensor (#5646)
2024-02-21 Jared Van Bortelexamples : do not assume BOS when shifting context...
2024-02-21 Georgi Gerganovsync : ggml
2024-02-21 Pierrick Hymbertserver: health: fix race condition on slots data using...
2024-02-21 Ettore Di Giacintoreadme : add LocalAI to the availables UI (#5629)
2024-02-21 Georgi Gerganovsync : ggml (#5633)
2024-02-21 Georgi Gerganovreadme : update hot topics
2024-02-21 Daniel Beveniusllava : add --skip-unknown to 1.6 convert.py (#5632)
2024-02-21 postmastersllama : add `gemma` model (#5631)
2024-02-21 Meng, Hengyu[SYCL] conext add name (#5624)
2024-02-21 KawrakowIQ4_NL: 4-bit non-linear quants with blocks of 32 ...
2024-02-20 CJ Paisserver : support llava 1.6 (#5553)
2024-02-20 slarenmake : fix debug build with CUDA (#5616)
2024-02-20 Daniel Beveniusllava : add explicit instructions for llava-1.6 (#5611)
2024-02-20 Xuan Son NguyenServer: use llama_chat_apply_template (#5593)
2024-02-20 Dane Madsenreadme : update UI list (#5605)
2024-02-20 Haoxiang Feimetal : add build system support for embedded metal...
2024-02-20 Pierrick Hymbertserver : health endpoint configurable failure on no...
2024-02-20 AidanBeltonSUpdate ggml_sycl_op_mul_mat_vec_q (#5502)
2024-02-19 Mathijs de... nix: now that we can do so, allow MacOS to build Vulkan...
2024-02-19 0cc4mEnable Vulkan MacOS CI
2024-02-19 0cc4mRefactor validation and enumeration platform checks...
2024-02-19 0cc4mAdd check for VK_KHR_portability_enumeration for Molten...
2024-02-19 Mathijs de... Add preprocessor checks for Apple devices.
2024-02-19 Mathijs de... Resolve ErrorIncompatibleDriver with Vulkan on MacOS.
2024-02-19 Mathijs de... Allow for Vulkan build with Accelerate.
2024-02-19 slarencuda : ignore peer access already enabled errors (...
2024-02-19 Jared Van Bortelmake : pass CPPFLAGS directly to nvcc, not via -Xcompil...
2024-02-19 nopperlexamples : support minItems/maxItems in JSON grammar...
2024-02-19 Georgi Gerganovllava : remove extra cont (#5587)
2024-02-19 slarenllava : replace ggml_cpy with ggml_cont
2024-02-19 Georgi Gerganovsync : ggml
2024-02-19 Georgi Gerganovggml-alloc : apply ggml/731
2024-02-19 Didzis Goskometal : option to embed MSL source into compiled binary...
2024-02-19 Georgi Gerganovci : enable -Werror for CUDA builds (#5579)
2024-02-19 Georgi Gerganovmake : fix CUDA build (#5580)
2024-02-19 valirayreadme : fix typo in README-sycl.md (#5353)
2024-02-19 Abhilash Majumdercmake : remove obsolete sycl compile flags (#5581)
2024-02-19 Georgi Gerganovminor : fix trailing whitespace (#5538)
2024-02-19 Daniel Beveniusllava : avoid changing the original BakLLaVA model...
2024-02-19 NawafAlansaribaby-llama : allocate graphs in ggml_context (#5573)
2024-02-19 Xuan Son Nguyenllama : add llama_chat_apply_template() (#5538)
2024-02-19 slarencuda, metal : fix nans in soft_max (#5574)
2024-02-19 Mirko185readme : update (#5572)
2024-02-19 bmwlggml : android and old glibc NUMA incompatibility bugfi...
2024-02-18 Jared Van Bortelbuild : pass all warning flags to nvcc via -Xcompiler...
2024-02-18 Georgi Gerganovggml : restore vec dot stride arg names (#5453)
2024-02-18 Georgi Gerganovci : fix wikitext url + compile warnings (#5569)
2024-02-18 Georgi Gerganovmetal : fix unused warnings (#0)
2024-02-18 Robey Holderithcommon, server : surface min_keep as its own parameter...
2024-02-18 Pierrick Hymbertserver : slots monitoring endpoint (#5550)
2024-02-18 Georgi Gerganovsampling : do not set min_keep to n_probs (#5564)
2024-02-18 Georgi Gerganovcmake : fix GGML_USE_SYCL typo (#5555)
2024-02-18 Pierrick Hymbertserver : enhanced health endpoint (#5548)
2024-02-18 Pierrick Hymbertserver : --n-predict option document and cap to max...
2024-02-18 Daniel Hiltgenserver : graceful server shutdown (#5244)
2024-02-18 Georgi Gerganovcommon : fix ub (#5530)
2024-02-18 Herman Semenovggml, common, examples, tests : fixed type arguments...
2024-02-18 Daniel Beveniusllava : update surgery script to not remove tensors...
2024-02-18 Kawrakow1.5 bit quantization (#5453)
2024-02-18 github-actions... flake.lock: Update
2024-02-17 Georgi Gerganovggml : add ALiBi support for ggml_soft_max_ext (#5488)
2024-02-17 Ananta Bastolaci : add an option to fail on compile warning (#3952)
2024-02-17 clibdevgitignore : update for CLion IDE (#5544)
2024-02-16 Georgi Gerganovcmake : fix VULKAN and ROCm builds (#5525)
2024-02-16 Georgi Gerganovscripts : add helpers script for bench comparing commit...
2024-02-16 Herman Semenovllava : removed excess free(NULL) operation (#5531)
2024-02-16 Herman Semenovllama : minor fixed return int value (#5529)
2024-02-16 Alexey Parfenovserver : add "samplers" param to control the samplers...
next