]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-08-12 Nico Bosshardllama : model-based max number of graph nodes calculati...
2024-08-12 Frank Maidocs: introduce gpustack and gguf-parser (#8873)
2024-08-12 DavidKorczynskigrammar-parser : fix possible null-deref (#9004)
2024-08-12 DavidKorczynskiggml: fix div-by-zero (#9003)
2024-08-12 Liu JiaFix a spelling mistake (#9001)
2024-08-12 Georgi Gerganovpy : fix requirements check '==' -> '~=' (#8982)
2024-08-12 Georgi Gerganovserver : handle models with missing EOS token (#8997)
2024-08-11 compiladegguf-py : Numpy dequantization for most types (#8939)
2024-08-11 Georgi Gerganovflake.lock: Update (#8979)
2024-08-11 Neo Zhangupdate guide (#8909)
2024-08-11 fairydreamingllama : check all graph nodes when searching for result...
2024-08-11 Markus TavenrathOptimize Vulkan backend for better CPU performance...
2024-08-10 slarenmetal : fix uninitialized abort_callback (#8968)
2024-08-10 Xuan Son Nguyenllama : default n_swa for phi-3 (#8931)
2024-08-10 fairydreamingAdd support for encoder-only T5 models (#8900)
2024-08-10 Matteo Mortarigguf-py : fix double call to add_architecture() (#8952)
2024-08-09 Georgi GerganovMerge commit from fork
2024-08-09 fairydreamingllama : add support for lora adapters in T5 model ...
2024-08-09 Georgi Gerganovmake : fix llava obj file race (#8946)
2024-08-09 Georgi Gerganovllama : better replace_all (cont) (#8926)
2024-08-09 tc-mbllava : support MiniCPM-V-2.5 (#7599)
2024-08-09 Georgi Gerganovsync : ggml
2024-08-09 Matt Stephensonwhisper : use vulkan as gpu backend when available...
2024-08-09 Daniel Beveniusembedding : add --pooling option to README.md [no ci...
2024-08-09 Daniel Beveniusllama : fix typo in llama_tensor_get_type comment ...
2024-08-09 Mathieu Geliserver : add one level list nesting for embeddings...
2024-08-09 compiladellama : reduce useless copies when saving session ...
2024-08-08 compiladegguf-py : simplify support for quant types (#8838)
2024-08-08 Georgi Gerganovscripts : sync cann files (#0)
2024-08-08 Georgi Gerganovscripts : fix sync filenames (#0)
2024-08-08 Georgi Gerganovsync : ggml
2024-08-08 Borislav Stanimirovggml : ignore more msvc warnings (ggml/906)
2024-08-08 Georgi Gerganovmetal : fix struct name (ggml/912)
2024-08-08 Conrad Kramermetal : add abort callback (ggml/905)
2024-08-08 Pablo Dubouemake : clean llamafile objects (#8923)
2024-08-07 slarenmake : use C compiler to build metal embed object ...
2024-08-07 slarenggml-backend : fix async copy from CPU (#8897)
2024-08-07 Ouadie EL FAROUKI[SYCL] Updated SYCL device filtering (#8901)
2024-08-07 Johannes GäßlerCUDA/HIP: fix tests/test-backend-ops (#8896)
2024-08-07 Zhenwei Jinllama-bench : add support for getting cpu info on Windo...
2024-08-06 Daniel Beveniusquantize : update usage comment in quantize.cpp (#8889)
2024-08-06 Nexes the Oldtypo correction (#8891)
2024-08-06 Xuan Son Nguyenserver : add lora hotswap endpoint (WIP) (#8857)
2024-08-06 Johannes GäßlerCUDA: fix padding logic for FP16/FP32 (#8884)
2024-08-06 Daniel Beveniussimple : update name of executable to llama-simple...
2024-08-06 Jaeden Amerocmake : Link vulkan-shaders-gen with pthreads (#8835)
2024-08-06 MaggotHATE[Vulkan] Fix compilation of `vulkan-shaders-gen` on...
2024-08-06 Georgi Gerganovcontributing : add note about write access
2024-08-06 Molly Sophiaggml : add epsilon as a parameter for group_norm (...
2024-08-06 Douglas Hanleyconvert : add support for XLMRoberta embedding models...
2024-08-06 Mengqing Cao[CANN]: Fix ggml_backend_cann_buffer_get_tensor (#8871)
2024-08-06 Neo Zhang[SYCL] correct cmd name (#8877)
2024-08-05 Liu Jiacommon : Changed tuple to struct (TODO fix) (#8823)
2024-08-05 wangshuai09cann: fix buffer_num and runtime speed slowly error...
2024-08-05 Eric Curtinreadme : add ramalama to the availables UI (#8811)
2024-08-05 Justine Tunneyggml : fix overflows in elu function (#8866)
2024-08-05 Brianpy: Add more authorship metadata from model card (...
2024-08-05 fairydreamingStop the generation when <|eom_id|> token is encountere...
2024-08-05 stduhpfcmake: fix paths for vulkan shaders compilation on...
2024-08-05 BarfingLemursreadme : update model list (#8851)
2024-08-05 Georgi Gerganovllama : better replace_all (#8852)
2024-08-05 0cc4mvulkan : fix Qantized Mat-Vec Mul on AMD GPUs for ncols...
2024-08-05 Georgi Gerganovsync : ggml
2024-08-05 0cc4mvulkan : implement Stable Diffusion operators (ggml...
2024-08-05 Daniel Beveniusggml : move c parameter comment to ggml_rope_ext (ggml...
2024-08-05 wangshuai09cann: support q4_0 model (#8822)
2024-08-04 Brandon SquizzatoInstall curl in runtime layer (#8693)
2024-08-04 ardforkServer: Don't ignore llama.cpp params (#8754)
2024-08-04 Brian Cunniebatched-bench : handle empty `-npl` (#8839)
2024-08-04 Daniel Beveniusbaby-llama : remove duplicate vector include
2024-08-04 Georgi Gerganovflake.lock: Update (#8847)
2024-08-03 jdomkeggml : reading the runtime sve config of the cpu (...
2024-08-02 Sigbjørn SkjæretFix conversion of unnormalized BF16->BF16 weights ...
2024-08-02 Mengqing Caocann: Fix ggml_cann_im2col for 1D im2col (#8819)
2024-08-02 Ouadie EL FAROUKI[SYCL] Fixing wrong VDR iq4nl value (#8812)
2024-08-01 matteoggml-cuda: Adding support for unified memory (#8035)
2024-08-01 Alex O'ConnellBuild: Only include execinfo.h on linux systems that...
2024-08-01 slarencuda : fix dmmv cols requirement to 2*GGML_CUDA_DMMV_X...
2024-08-01 wangshuai09cann: support q8_0 for Ascend\b backend (#8805)
2024-07-31 Igor Okulistserver : update llama-server embedding flag documentati...
2024-07-31 Clint HerronBuild: Fix potential race condition (#8781)
2024-07-31 pcullitonAdding Gemma 2 2B configs (#8784)
2024-07-31 Borislav Stanimirovcmake : fix use of external ggml (#8787)
2024-07-30 Someonenix: cuda: rely on propagatedBuildInputs (#8772)
2024-07-30 Brianpy: add_array() will not add to kv store if value is...
2024-07-30 l3utterflyadded android implementation of ggml_print_backtrace_sy...
2024-07-30 Georgi Gerganovflake.lock: Update (#8729)
2024-07-30 wangshuai09cann: update cmake (#8765)
2024-07-30 zhentaoyu[SYCL] Add `TIMESTEP_EMBEDDING` OP (#8707)
2024-07-29 CarterLi999ggml: bugfix: fix the inactive elements is agnostic...
2024-07-29 R0CKSTARcuda : organize vendor-specific headers into vendors...
2024-07-29 Meng, Hengyu[SYCL] add conv support (#8688)
2024-07-28 Johannes Gäßlercmake: use 1 more thread for non-ggml in CI (#8740)
2024-07-28 Austinchore : Fix vulkan related compiler warnings, add help...
2024-07-28 compiladellama : refactor session file management (#8699)
2024-07-27 R0CKSTARfeat: Support Moore Threads GPU (#8383)
2024-07-27 Georgi Gerganovscripts : sync vulkan-shaders (#0)
2024-07-27 Georgi Gerganovscripts : sync ggml-aarch64 sources
2024-07-27 Georgi Gerganovggml : add missing semicolon (#0)
2024-07-27 Georgi Gerganovsync : ggml
next