]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-04-25 Georgi Gerganovcmake : remove obsolete ANDROID check
2024-04-25 slarenllama : synchronize before get/set session data (#6911)
2024-04-25 Georgi Gerganovci : tmp disable slow tests
2024-04-25 BarfingLemursreadme : update model list (#6908)
2024-04-25 slarenllama : check that all the tensor data is in the model...
2024-04-25 Georgi Gerganovggml : fix redefinition of vaddvq_f32 for 32-bit ARM...
2024-04-25 Daniel Beveniusclip : rename lerp function to avoid conflict (#6894)
2024-04-25 Georgi Gerganovggml : fix MIN / MAX macros (#6904)
2024-04-25 Georgi Gerganovtests : minor bash stuff (#6902)
2024-04-25 jiezquantize : add '--keep-split' to quantize model into...
2024-04-24 Johannes GäßlerREADME: add graphic for matrix multiplication (#6881)
2024-04-24 Douglas Hanleyllama : add llama_get_pooling_type function (#6862)
2024-04-24 mgroeber9110server : do not apply Markdown formatting in code secti...
2024-04-24 Kyle Mistelecommon : revert showing control tokens by default for...
2024-04-24 Johannes GäßlerServer: fix seed for multiple slots (#6835)
2024-04-24 Georgi Gerganovggml : move 32-bit arm compat in ggml-impl.h (#6865)
2024-04-24 Tristan Druyenllama : add phi 3 chat template (#6857)
2024-04-24 Junyang Linconvert : add support of codeqwen due to tokenizer...
2024-04-24 liuwei-gitllama : add phi3 support (#6852)
2024-04-23 Anas Ahouzi[SYCL] Windows default build instructions without ...
2024-04-22 Justine Tunneyllamafile : improve sgemm.cpp (#6796)
2024-04-22 Dave Airlieggml : fix calloc argument ordering. (#6820)
2024-04-22 Georgi Gerganovllama : fix typo in <|im_end|> token text (#6745)
2024-04-22 Pierrick Hymbertci: fix job are cancelling each other (#6781)
2024-04-22 github-actions... flake.lock: Update
2024-04-21 Olivier Chafik`build`: generate hex dump of server assets during...
2024-04-21 Georgi Gerganovllama : add option to render special/control tokens...
2024-04-21 Georgi Gerganovggml : fix ggml_backend_cpu_supports_op() for CPY (#0)
2024-04-21 Wouterllama : add llama-3 chat template (#6751)
2024-04-21 pmyslgguf-py : add IQ1_M to GGML_QUANT_SIZES (#6761)
2024-04-21 Jan Boondoc : add link to falcon (#6789)
2024-04-21 Mohammadreza... readme : add Fedora instructions (#6783)
2024-04-21 Justine Tunneyllava : use logger in llava-cli (#6797)
2024-04-21 Pedro Cuencallama : support Llama 3 HF conversion (#6745)
2024-04-20 Jan Boondoc : server tests require llama to be built with curl...
2024-04-20 Georgi Gerganovcommon : try to fix Android CI (#6780)
2024-04-19 loonerinci: add ubuntu latest release and fix missing build...
2024-04-19 Pierrick Hymbertserver: static: upstream upgrade (#6765)
2024-04-19 nopperlImplement the OLMo architecture (#6741)
2024-04-19 Austintrain : add general name (#6752)
2024-04-19 Neo Zhangfix wrong parameter in cmd in readme-sycl.md (#6755)
2024-04-18 slarenggml : group all experts in a single ggml_mul_mat_id...
2024-04-18 Sigbjørn Skjæretconvert : support models with multiple chat templates...
2024-04-18 Ren XuanchengQwen2 : assume tied weights if lm_head/output weights...
2024-04-18 slarenllama : fix compatibility with old 2 expert models...
2024-04-17 Georgi Gerganovllamafile : tmp disable + build sgemm.o when needed...
2024-04-17 Yaroslavreadme : add UI (#6724)
2024-04-16 Zheng.Dengconvert : fix autoawq gemma (#6704)
2024-04-16 Georgi Gerganovllama : make general.name optional (#6709)
2024-04-16 Georgi Gerganovggml : fix llamafile sgemm wdata offsets (#6710)
2024-04-16 Justine Tunneyggml : add llamafile sgemm (#6414)
2024-04-16 Ashishllama : add StableLM2 12B (#6635)
2024-04-16 Shijiellama : add qwen2moe (#6074)
2024-04-16 Daniel Beveniusgritlm : add --outdir option to hf.sh script (#6699)
2024-04-16 Georgi Gerganovperplexity : require positive --ctx-size arg (#6695)
2024-04-16 Daniel Beveniusgguf : add special tokens metadata for FIM/Infill ...
2024-04-15 Olivier Chafik`main`: add --json-schema / -j flag (#6659)
2024-04-15 compiladellama : fix restoring the number of outputs from state...
2024-04-15 Pierrick Hymbertserver : revert "minor layout improvements" (#6684)
2024-04-15 Steven Prichardswift : linux support (#6590)
2024-04-15 Neo Zhang Jianyufix mul_mat_id() for new input, make the ut pass (...
2024-04-14 David Renshawllama : add missing kv clear in llama_beam_search ...
2024-04-14 Chao JiangAdd Command R chat template (#6650)
2024-04-14 Georgi Gerganovflake.lock: Update (#6669)
2024-04-14 DaveAdded support for GGML_OP_CLAMP in Metal (#6662)
2024-04-14 Sigbjørn SkjæretFix --split-max-size (#6655)
2024-04-14 Jaemin Son[bug fix] convert github repository_owner to lowercase...
2024-04-14 James A Capozzoliconvert : enable the `--use-temp-file` cli flag (#6645)
2024-04-14 Neo Zhang Jianyufix memcpy() crash, add missed cmd in guide, fix softma...
2024-04-13 Johannes GäßlerCUDA: fix matrix multiplication logic for tests (#6667)
2024-04-13 Pierrick Hymbertmodel: support arch `DbrxForCausalLM` (#6515)
2024-04-12 Olivier ChafikJSON schema conversion: ⚡️ faster repetitions, min...
2024-04-12 slarenmetal : unify mul_mv_id kernels (#6556)
2024-04-12 Daniel Beveniusinfill : add download instructions for model (#6626)
2024-04-12 Pierrick Hymbertserver : coherent log output for KV cache full (#6637)
2024-04-12 jiezllama : add gguf_remove_key + remove split meta during...
2024-04-12 Rene Leonhardtchore: Fix markdown warnings (#6625)
2024-04-12 Georgi Gerganovimatrix : remove invalid assert (#6632)
2024-04-12 MasterYi1024Correct free memory and total memory. (#6630)
2024-04-12 Pierrick Hymberteval-callback: use ggml_op_desc to pretty print unary...
2024-04-12 Georgi Gerganovci : disable Metal for macOS-latest-cmake-x64 (#6628)
2024-04-12 Clint HerronOptimization: eliminate addition of redundant stacks...
2024-04-11 Clint HerronAs suggested by @slaren, disabling Metal for test to...
2024-04-11 NikolasRefactor Error Handling for CUDA (#6575)
2024-04-11 Olivier Chafikgrammars: 1.5x faster inference w/ complex grammars...
2024-04-11 Hugo Rousselci: download artifacts to release directory (#6612)
2024-04-11 Daniel Beveniusscripts : add --outdir option to hf.sh (#6600)
2024-04-11 Pierrick Hymberteval-callback: Example how to use eval callback for...
2024-04-10 Daniel Beveniusgguf : add option to not check tensor data (#6582)
2024-04-10 Ralph Soikaminor layout improvements (#6572)
2024-04-10 slarenllama : add model types for mixtral (#6589)
2024-04-10 slarenconvert.py : add consolidated.safetensors for mixtral...
2024-04-10 Pierrick Hymbertdocs : how to add a model (#6565)
2024-04-10 Artem Zinnatullinreadme : fix ROCm link (#6579)
2024-04-10 sjxxreadme : update UI list (#6560)
2024-04-09 Jiří Sejkorareadme: fix typo in amdgpu target name (#6573)
2024-04-09 Jared Van BortelBERT tokenizer fixes (#6498)
2024-04-09 Georgi Gerganovsync : ggml
2024-04-09 Ed Leeserver : detect search query to start webchat (#6554)
2024-04-09 Carolinabananallama : add Command R Plus support (#6491)
next