]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-07-27 R0CKSTARfeat: Support Moore Threads GPU (#8383)
2024-07-27 Georgi Gerganovscripts : sync vulkan-shaders (#0)
2024-07-27 Georgi Gerganovscripts : sync ggml-aarch64 sources
2024-07-27 Georgi Gerganovggml : add missing semicolon (#0)
2024-07-27 Georgi Gerganovsync : ggml
2024-07-27 Mahesh Madhavggml : loop tiling optimizations for scalar path (ggml...
2024-07-27 Ivan Filipovggml: add support for float16 input tensors in pooling...
2024-07-27 Tony Wasserkavulkan : initialize vk_buffer_struct members to VK_NULL...
2024-07-27 Borislav Stanimirovcmake : only enable GGML_NATIVE and x86 flags if not...
2024-07-27 Daniel Beveniusggml : remove unnecessary UNUSED macro call (ggml/880)
2024-07-27 Jeffrey Morganllama : add support for llama 3.1 rope scaling factors...
2024-07-27 Georgi Gerganovllama : add function for model-based max number of...
2024-07-27 Daniel Beveniuscommon : add --no-warmup option for main/llama-cli...
2024-07-27 wangshuai09cann: Fix Multi-NPU execution error (#8710)
2024-07-27 slarenggml : reduce hash table reset cost (#8698)
2024-07-26 Juddllama : fix order of parameters (#8706)
2024-07-25 Yaikoserver : add Speech Recognition & Synthesis to UI ...
2024-07-25 Xuan Son Nguyenexamples : export-lora : fix issue with quantized base...
2024-07-25 DavidKorczynskiggml: handle ggml_init failure to fix NULL pointer...
2024-07-25 Georgi Gerganovllama : fix build + fix fabs compile warnings (#8683)
2024-07-25 Andreas (Andi... ggml : fix build on Windows with Snapdragon X (#8531)
2024-07-25 Georgi Gerganovtests : fix printfs (#8068)
2024-07-25 Chen Xi[SYCL] fix multi-gpu issue on sycl (#8554)
2024-07-25 Georgi Gerganovggml : add and use ggml_cpu_has_llamafile() (#8664)
2024-07-25 Xuan Son Nguyenexamples : remove `finetune` and `train-text-from-scrat...
2024-07-25 Ujjawal Panchaldocs : Quantum -> Quantized (#8666)
2024-07-25 Fan Shupeillama: use sliding window for phi3 (#8627)
2024-07-24 MorganRO8readme : update games list (#8673)
2024-07-24 Joe ToddBuild Llama SYCL Intel with static libs (#8668)
2024-07-24 Thorsten Sommerreadme : update UI list [no ci] (#8505)
2024-07-24 Xuan Son Nguyenllama : fix `llama_chat_format_single` for mistral...
2024-07-24 Joe ToddRe-add erroneously removed -fsycl from GGML_EXTRA_LIBS...
2024-07-24 Xuan Son Nguyenadd llama_lora_adapter_clear (#8653)
2024-07-23 Xuan Son Nguyenexamples : Fix `llama-export-lora` example (#8607)
2024-07-23 Vali Malinoiuserver : fix URL.parse in the UI (#8646)
2024-07-23 Joe Toddsycl : Add support for non-release DPC++ & oneMKL ...
2024-07-23 Georgi Gerganovllama : move vocab, grammar and sampling into separate...
2024-07-23 0cc4mVulkan IQ4_NL Support (#8613)
2024-07-23 Jeroen MostertAllow all RDNA2 archs to use sdot4 intrinsic (#8629)
2024-07-23 Georgi Gerganovcontrib : clarify PR squashing + module names (#8630)
2024-07-23 luoyu-intel[SYCL] fix scratch size of softmax (#8642)
2024-07-22 Keke Hanllama : fix codeshell support (#8599)
2024-07-22 Jason Stillermanllama : add support for SmolLm pre-tokenizer (#8609)
2024-07-22 Jiří Podivín*.py: Stylistic adjustments for python (#8233)
2024-07-22 Georgi Gerganovllama : allow overrides for tokenizer flags (#8614)
2024-07-22 Georgi Gerganovtests : re-enable tokenizer tests (#8611)
2024-07-22 Douglas Hanleyllama : add Mistral Nemo inference support (#8604)
2024-07-22 Jan Boonserver : update doc to clarify n_keep when there is...
2024-07-22 Mark Zhuangggml: fix compile error for RISC-V (#8623)
2024-07-22 devojonyexamples: fix android example cannot be generated conti...
2024-07-21 Georgi Gerganovflake.lock: Update (#8610)
2024-07-21 M-Aexamples : Rewrite pydantic_models_to_grammar_examples...
2024-07-21 compiladegguf-py : fix some metadata name extraction edge cases...
2024-07-21 compiladeconvert_hf : fix Gemma v1 conversion (#8597)
2024-07-20 Johannes GäßlerCUDA: MMQ code deduplication + iquant support (#8495)
2024-07-20 Georgi Gerganovgguf : handle null name during init (#8587)
2024-07-20 Michael Coppolallama : add support for Tekken pre-tokenizer (#8579)
2024-07-20 Huifeng Oullama.swiftui: fix end of generation bug (#8268)
2024-07-20 Briangguf_dump.py: fix markddown kv array print (#8588)
2024-07-19 slarenggml : fix quant dot product with odd number of blocks...
2024-07-19 Brianconvert-*.py: remove add_name from ChatGLMModel class...
2024-07-19 Georgi Gerganovllama : bump max layers from 256 to 512 (#8530)
2024-07-19 Georgi Gerganovreadme : fix server badge
2024-07-19 Clint Herronggml : add friendlier error message to fopen errors...
2024-07-19 Frank Maifix: typo of chatglm4 chat tmpl (#8586)
2024-07-19 Brianconvert-*.py: add general.name kv override (#8571)
2024-07-18 Johannes GäßlerCUDA: fix partial offloading for ne0 % 256 != 0 (#8572)
2024-07-18 65acmake : install all ggml public headers (#8480)
2024-07-18 Eric Zhangserver: use relative routes for static files in new...
2024-07-18 Brianconvert-*.py: GGUF Naming Convention Refactor and Metad...
2024-07-18 RunningLeonserver : respect `--special` cli arg (#8553)
2024-07-17 Johannes Gäßlerlookup: fibonacci hashing, fix crashes (#8548)
2024-07-17 Al Mochkinbuild : Fix docker build warnings (#8535) (#8537)
2024-07-17 BrianCONTRIBUTING.md : remove mention of noci (#8541)
2024-07-17 hipudding[CANN] Add Ascend NPU backend (#6035)
2024-07-17 Masaya, Katobatched: fix n_predict parameter (#8527)
2024-07-17 Georgi Gerganovllama : disable context-shift for DeepSeek v2 (#8501)
2024-07-16 Johannes Gäßlermake/cmake: add missing force MMQ/cuBLAS for HIP (...
2024-07-16 Briangguf-hash : update clib.json to point to original xxhas...
2024-07-16 Steve Bondsexport-lora : handle help argument (#8497)
2024-07-16 Georgi Gerganovllama : valign + remove unused ftype (#8502)
2024-07-16 compiladeconvert_hf : faster lazy safetensors (#8482)
2024-07-15 Xuan Son NguyenRefactor lora adapter support (#8332)
2024-07-15 Xuan Son Nguyenfix ci (#8494)
2024-07-15 Daniel Beveniusggml : suppress unknown pragma 'GCC' on windows (#8460)
2024-07-15 M-Aserver: update README.md with llama-server --help outpu...
2024-07-15 Georgi Gerganovcommon : add --no-cont-batching arg (#6358)
2024-07-15 NikolaiLyssogordocs: fix links in development docs [no ci] (#8481)
2024-07-15 Meng, Hengyu[SYCL] add concat through dim 1/2 (#8483)
2024-07-15 Georgi Gerganovllama : de-duplicate deepseek2 norm
2024-07-15 0cc4mVulkan MMQ Fix (#8479)
2024-07-14 compiladepydantic : replace uses of __annotations__ with get_typ...
2024-07-14 Georgi Gerganovflake.lock: Update (#8475)
2024-07-14 Georgi Gerganovllama : fix Gemma-2 Query scaling factors (#8473)
2024-07-14 Briangguf_hash.py: Add sha256 (#8470)
2024-07-14 compiladellama : fix pre-tokenization of non-special added token...
2024-07-13 bandotivulkan : cmake integration (#8119)
2024-07-13 Georgi Gerganovmetal : template-ify some of the kernels (#8447)
2024-07-12 Georgi Gerganovserver : handle content array in chat API (#8449)
2024-07-12 Georgi Gerganovmain : print error on empty input (#8456)
next