]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2024-08-21 fairydreamingllava : zero-initialize clip_ctx structure fields with...
2024-08-21 Daniel Beveniusllama : std::move llm_bigram_bpe from work_queue (...
2024-08-20 Changyeon Kimllava: Add ACC OP for GPU acceleration to the Vulkan...
2024-08-20 Meng, Hengyu[SYCL] fallback mmvq (#9088)
2024-08-20 zhentaoyu[SYCL] Fix SYCL `im2col` and `convert` Overflow with...
2024-08-20 fairydreamingtests : add missing comma in grammar integration tests...
2024-08-19 wangshuai09cann: add doc for cann backend (#8867)
2024-08-19 Radoslav Gerganovrpc : print error message when failed to connect endpoi...
2024-08-19 Radoslav Gerganovrpc : prevent crashes on invalid input (#9040)
2024-08-18 Georgi Gerganovflake.lock: Update (#9068)
2024-08-18 ltoniazzitests : add integration test for lora adapters (#8957)
2024-08-17 Yoshi SuharaFix incorrect use of ctx_split for bias tensors (#9063)
2024-08-16 Xuan Son Nguyenserver : refactor middleware and /health endpoint ...
2024-08-16 tc-mbllava : support MiniCPM-V-2.6 (#8967)
2024-08-16 Farbod Bijarypy : fix wrong input type for raw_dtype in ggml to...
2024-08-16 AisukoFix inference example lacks required parameters (#9035)
2024-08-16 compiladegguf-py : bump version from 0.9.1 to 0.10.0 (#9051)
2024-08-16 Minsoo Cheongllama : add EXAONE model support (#9025)
2024-08-16 Liu Jiacommon : add support for cpu_get_num_physical_cores...
2024-08-16 Yoshi SuharaAdd Nemotron/Minitron GGUF Conversion & Inference Suppo...
2024-08-16 Nico Bosshardggml : dynamic ggml_sched_max_splits based on graph_siz...
2024-08-15 gtygoretrieval : fix memory leak in retrieval query handling...
2024-08-15 Riceball LEEserver : fix duplicated n_predict key in the generation...
2024-08-15 Zhenwei Jincommon : remove duplicate function llama_should_add_bos...
2024-08-15 Esko Toivonenllama : add pre-tokenizer regexes for BLOOM and gpt3...
2024-08-15 Georgi Gerganovci : disable bench workflow (#9010)
2024-08-15 Jiří Podivínserver : init stop and error fields of the result struc...
2024-08-14 0cc4mVulkan Optimizations and Fixes (#8959)
2024-08-14 compiladeserver : fix segfault on long system prompt (#8987)
2024-08-14 Georgi Gerganovcmake : remove unused option GGML_CURL (#9011)
2024-08-13 Daniel Beveniusggml : move rope type enum to ggml.h (#8949)
2024-08-13 Xuan Son Nguyenexport-lora : throw error if lora is quantized (#9002)
2024-08-12 Diogo Teles... ci : fix github workflow vulnerable to script injection...
2024-08-12 Radoslav Gerganovci : enable RPC in all of the released builds (#9006)
2024-08-12 Nico Bosshardllama : model-based max number of graph nodes calculati...
2024-08-12 Frank Maidocs: introduce gpustack and gguf-parser (#8873)
2024-08-12 DavidKorczynskigrammar-parser : fix possible null-deref (#9004)
2024-08-12 DavidKorczynskiggml: fix div-by-zero (#9003)
2024-08-12 Liu JiaFix a spelling mistake (#9001)
2024-08-12 Georgi Gerganovpy : fix requirements check '==' -> '~=' (#8982)
2024-08-12 Georgi Gerganovserver : handle models with missing EOS token (#8997)
2024-08-11 compiladegguf-py : Numpy dequantization for most types (#8939)
2024-08-11 Georgi Gerganovflake.lock: Update (#8979)
2024-08-11 Neo Zhangupdate guide (#8909)
2024-08-11 fairydreamingllama : check all graph nodes when searching for result...
2024-08-11 Markus TavenrathOptimize Vulkan backend for better CPU performance...
2024-08-10 slarenmetal : fix uninitialized abort_callback (#8968)
2024-08-10 Xuan Son Nguyenllama : default n_swa for phi-3 (#8931)
2024-08-10 fairydreamingAdd support for encoder-only T5 models (#8900)
2024-08-10 Matteo Mortarigguf-py : fix double call to add_architecture() (#8952)
2024-08-09 Georgi GerganovMerge commit from fork
2024-08-09 fairydreamingllama : add support for lora adapters in T5 model ...
2024-08-09 Georgi Gerganovmake : fix llava obj file race (#8946)
2024-08-09 Georgi Gerganovllama : better replace_all (cont) (#8926)
2024-08-09 tc-mbllava : support MiniCPM-V-2.5 (#7599)
2024-08-09 Georgi Gerganovsync : ggml
2024-08-09 Matt Stephensonwhisper : use vulkan as gpu backend when available...
2024-08-09 Daniel Beveniusembedding : add --pooling option to README.md [no ci...
2024-08-09 Daniel Beveniusllama : fix typo in llama_tensor_get_type comment ...
2024-08-09 Mathieu Geliserver : add one level list nesting for embeddings...
2024-08-09 compiladellama : reduce useless copies when saving session ...
2024-08-08 compiladegguf-py : simplify support for quant types (#8838)
2024-08-08 Georgi Gerganovscripts : sync cann files (#0)
2024-08-08 Georgi Gerganovscripts : fix sync filenames (#0)
2024-08-08 Georgi Gerganovsync : ggml
2024-08-08 Borislav Stanimirovggml : ignore more msvc warnings (ggml/906)
2024-08-08 Georgi Gerganovmetal : fix struct name (ggml/912)
2024-08-08 Conrad Kramermetal : add abort callback (ggml/905)
2024-08-08 Pablo Dubouemake : clean llamafile objects (#8923)
2024-08-07 slarenmake : use C compiler to build metal embed object ...
2024-08-07 slarenggml-backend : fix async copy from CPU (#8897)
2024-08-07 Ouadie EL FAROUKI[SYCL] Updated SYCL device filtering (#8901)
2024-08-07 Johannes GäßlerCUDA/HIP: fix tests/test-backend-ops (#8896)
2024-08-07 Zhenwei Jinllama-bench : add support for getting cpu info on Windo...
2024-08-06 Daniel Beveniusquantize : update usage comment in quantize.cpp (#8889)
2024-08-06 Nexes the Oldtypo correction (#8891)
2024-08-06 Xuan Son Nguyenserver : add lora hotswap endpoint (WIP) (#8857)
2024-08-06 Johannes GäßlerCUDA: fix padding logic for FP16/FP32 (#8884)
2024-08-06 Daniel Beveniussimple : update name of executable to llama-simple...
2024-08-06 Jaeden Amerocmake : Link vulkan-shaders-gen with pthreads (#8835)
2024-08-06 MaggotHATE[Vulkan] Fix compilation of `vulkan-shaders-gen` on...
2024-08-06 Georgi Gerganovcontributing : add note about write access
2024-08-06 Molly Sophiaggml : add epsilon as a parameter for group_norm (...
2024-08-06 Douglas Hanleyconvert : add support for XLMRoberta embedding models...
2024-08-06 Mengqing Cao[CANN]: Fix ggml_backend_cann_buffer_get_tensor (#8871)
2024-08-06 Neo Zhang[SYCL] correct cmd name (#8877)
2024-08-05 Liu Jiacommon : Changed tuple to struct (TODO fix) (#8823)
2024-08-05 wangshuai09cann: fix buffer_num and runtime speed slowly error...
2024-08-05 Eric Curtinreadme : add ramalama to the availables UI (#8811)
2024-08-05 Justine Tunneyggml : fix overflows in elu function (#8866)
2024-08-05 Brianpy: Add more authorship metadata from model card (...
2024-08-05 fairydreamingStop the generation when <|eom_id|> token is encountere...
2024-08-05 stduhpfcmake: fix paths for vulkan shaders compilation on...
2024-08-05 BarfingLemursreadme : update model list (#8851)
2024-08-05 Georgi Gerganovllama : better replace_all (#8852)
2024-08-05 0cc4mvulkan : fix Qantized Mat-Vec Mul on AMD GPUs for ncols...
2024-08-05 Georgi Gerganovsync : ggml
2024-08-05 0cc4mvulkan : implement Stable Diffusion operators (ggml...
2024-08-05 Daniel Beveniusggml : move c parameter comment to ggml_rope_ext (ggml...
2024-08-05 wangshuai09cann: support q4_0 model (#8822)
next