]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-04-19 loonerinci: add ubuntu latest release and fix missing build...
2024-04-19 Pierrick Hymbertserver: static: upstream upgrade (#6765)
2024-04-19 nopperlImplement the OLMo architecture (#6741)
2024-04-19 Austintrain : add general name (#6752)
2024-04-19 Neo Zhangfix wrong parameter in cmd in readme-sycl.md (#6755)
2024-04-18 slarenggml : group all experts in a single ggml_mul_mat_id...
2024-04-18 Sigbjørn Skjæretconvert : support models with multiple chat templates...
2024-04-18 Ren XuanchengQwen2 : assume tied weights if lm_head/output weights...
2024-04-18 slarenllama : fix compatibility with old 2 expert models...
2024-04-17 Georgi Gerganovllamafile : tmp disable + build sgemm.o when needed...
2024-04-17 Yaroslavreadme : add UI (#6724)
2024-04-16 Zheng.Dengconvert : fix autoawq gemma (#6704)
2024-04-16 Georgi Gerganovllama : make general.name optional (#6709)
2024-04-16 Georgi Gerganovggml : fix llamafile sgemm wdata offsets (#6710)
2024-04-16 Justine Tunneyggml : add llamafile sgemm (#6414)
2024-04-16 Ashishllama : add StableLM2 12B (#6635)
2024-04-16 Shijiellama : add qwen2moe (#6074)
2024-04-16 Daniel Beveniusgritlm : add --outdir option to hf.sh script (#6699)
2024-04-16 Georgi Gerganovperplexity : require positive --ctx-size arg (#6695)
2024-04-16 Daniel Beveniusgguf : add special tokens metadata for FIM/Infill ...
2024-04-15 Olivier Chafik`main`: add --json-schema / -j flag (#6659)
2024-04-15 compiladellama : fix restoring the number of outputs from state...
2024-04-15 Pierrick Hymbertserver : revert "minor layout improvements" (#6684)
2024-04-15 Steven Prichardswift : linux support (#6590)
2024-04-15 Neo Zhang Jianyufix mul_mat_id() for new input, make the ut pass (...
2024-04-14 David Renshawllama : add missing kv clear in llama_beam_search ...
2024-04-14 Chao JiangAdd Command R chat template (#6650)
2024-04-14 Georgi Gerganovflake.lock: Update (#6669)
2024-04-14 DaveAdded support for GGML_OP_CLAMP in Metal (#6662)
2024-04-14 Sigbjørn SkjæretFix --split-max-size (#6655)
2024-04-14 Jaemin Son[bug fix] convert github repository_owner to lowercase...
2024-04-14 James A Capozzoliconvert : enable the `--use-temp-file` cli flag (#6645)
2024-04-14 Neo Zhang Jianyufix memcpy() crash, add missed cmd in guide, fix softma...
2024-04-13 Johannes GäßlerCUDA: fix matrix multiplication logic for tests (#6667)
2024-04-13 Pierrick Hymbertmodel: support arch `DbrxForCausalLM` (#6515)
2024-04-12 Olivier ChafikJSON schema conversion: ⚡️ faster repetitions, min...
2024-04-12 slarenmetal : unify mul_mv_id kernels (#6556)
2024-04-12 Daniel Beveniusinfill : add download instructions for model (#6626)
2024-04-12 Pierrick Hymbertserver : coherent log output for KV cache full (#6637)
2024-04-12 jiezllama : add gguf_remove_key + remove split meta during...
2024-04-12 Rene Leonhardtchore: Fix markdown warnings (#6625)
2024-04-12 Georgi Gerganovimatrix : remove invalid assert (#6632)
2024-04-12 MasterYi1024Correct free memory and total memory. (#6630)
2024-04-12 Pierrick Hymberteval-callback: use ggml_op_desc to pretty print unary...
2024-04-12 Georgi Gerganovci : disable Metal for macOS-latest-cmake-x64 (#6628)
2024-04-12 Clint HerronOptimization: eliminate addition of redundant stacks...
2024-04-11 Clint HerronAs suggested by @slaren, disabling Metal for test to...
2024-04-11 NikolasRefactor Error Handling for CUDA (#6575)
2024-04-11 Olivier Chafikgrammars: 1.5x faster inference w/ complex grammars...
2024-04-11 Hugo Rousselci: download artifacts to release directory (#6612)
2024-04-11 Daniel Beveniusscripts : add --outdir option to hf.sh (#6600)
2024-04-11 Pierrick Hymberteval-callback: Example how to use eval callback for...
2024-04-10 Daniel Beveniusgguf : add option to not check tensor data (#6582)
2024-04-10 Ralph Soikaminor layout improvements (#6572)
2024-04-10 slarenllama : add model types for mixtral (#6589)
2024-04-10 slarenconvert.py : add consolidated.safetensors for mixtral...
2024-04-10 Pierrick Hymbertdocs : how to add a model (#6565)
2024-04-10 Artem Zinnatullinreadme : fix ROCm link (#6579)
2024-04-10 sjxxreadme : update UI list (#6560)
2024-04-09 Jiří Sejkorareadme: fix typo in amdgpu target name (#6573)
2024-04-09 Jared Van BortelBERT tokenizer fixes (#6498)
2024-04-09 Georgi Gerganovsync : ggml
2024-04-09 Ed Leeserver : detect search query to start webchat (#6554)
2024-04-09 Carolinabananallama : add Command R Plus support (#6491)
2024-04-09 Georgi Gerganovlicense : update copyright notice + add AUTHORS (#6405)
2024-04-08 Georgi Gerganovllama : fix attention layer count sanity check (#6550)
2024-04-08 kunnisComment explaining a decision (#6531)
2024-04-08 Georgi Gerganovquantize : fix precedence of cli args (#6541)
2024-04-08 Rick Gllama : support negative ith in llama_get_ API (#6519)
2024-04-08 Jan Boonllama : save and restore kv cache for single seq id...
2024-04-08 Abhilash Majumderremove row=1 cond (#6532)
2024-04-08 FiratAdding KodiBot to UI list (#6535)
2024-04-07 Mark FairbairnChange Windows AMD example to release build to make...
2024-04-07 Georgi Gerganovflake.lock: Update (#6517)
2024-04-07 DAN™Add GritLM as supported models. (#6513)
2024-04-07 Georgi Gerganovsync : ggml
2024-04-07 Slava Primenkoggml: bypass code incompatible with CUDA < 11.1 (whispe...
2024-04-07 Georgi Gerganovscripts : sync ggml-cuda folder
2024-04-07 limitedAtonementRun make to build the project (#6457)
2024-04-07 Neo Zhang Jianyusupport/fix OPs GGML_TYPE_IQ4_NL, GGML_TYPE_IQ4_XS...
2024-04-06 Georgi Gerganovsync : ggml
2024-04-06 Daniel Beveniusbackend : fix typo in scheduler documentation (ggml...
2024-04-06 Clint HerronTests: Added integration tests for GBNF parser (#6472)
2024-04-06 Pierrick Hymbertci: bench: support sse and fix prompt processing time...
2024-04-05 Briangguf.py : add licence and version to gguf writer (...
2024-04-05 Hoang Nguyenreadme : update UI list (#6503)
2024-04-05 Ting Sunbench : make n_batch and n_ubatch configurable in Batch...
2024-04-05 Ouadie EL FAROUKI[SYCL] Fixed minor bug when enabling FP16 for non intel...
2024-04-04 alexpinelreadme : add Dot to UI list (#6487)
2024-04-04 Jun Jiereadme : fix typo (#6481)
2024-04-04 Ed Lepedusserver: add cURL support to server Dockerfiles (#6474)
2024-04-04 Minsoo Cheongci: exempt master branch workflows from getting cancell...
2024-04-04 Ewout ter Hoevenbuild CI: Name artifacts (#6482)
2024-04-04 Shakhar Dasguptaserver: allow penalizing repetition of newlines on...
2024-04-04 Pierrick Hymbertci: bench fix concurrency for workflow trigger dispatch...
2024-04-04 limitedAtonementCorrect README link (#6458)
2024-04-04 Pierrick Hymbertci: bench: add more ftype, fix triggers and bot comment...
2024-04-04 Daniel Beveniuscommon: remove duplicate check for curl (#6471)
2024-04-04 Clint Herronexamples : add GBNF validator program (#5948)
2024-04-04 Georgi Gerganovserver : remove obsolete --memory-f32 option
next