]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
12 days ago Đinh Trọng Huymodel : add NeoBERT (#14164)
12 days ago uvosHIP: disable rocwmma on gfx12 by default until rocm...
12 days ago Georgi Gerganovllama : rework embeddings logic (#14208)
12 days ago Charles Xuggml: Add Android support for GGML_CPU_ALL_VARIANTS...
12 days ago Bartowskiconvert : remove arcee change in convert_hf_to_gguf_upd...
12 days ago Đinh Trọng Huygguf-py : allow key override when adding value to GGUFW...
12 days ago Jeff Bolzvulkan: mutex around vkQueueSubmit (#14127)
12 days ago xctanggml-cpu : rework weak alias on apple targets (#14146)
12 days ago Bartowskimodel : Add support for Arcee AI's upcoming AFM model...
12 days ago Eric Curtinserver : When listening on a unix domain socket don...
13 days ago Ed Addarioquantize : change int to unsigned int for KV overrides...
13 days ago uvosCUDA/HIP: fix ssm_scan on devices where warp size is...
13 days ago uvosHIP: Replace usage of depricated preprocessor macro...
13 days ago Georgi Gerganovkv-cache : fix use-after-move of defrag info (#14189)
13 days ago Mikko Juolamodel : add dots.llm1 architecture support (#14044...
13 days ago Georgi Gerganovcparams : rename LLAMA_MAX_PARALLEL_SEQUENCES to LLAMA_...
13 days ago Georgi Gerganovbatch : auto-gen positions + verify multi-sequence...
13 days ago Pepijn de Vosdocs : remove WIP since PR has been merged (#13912)
2025-06-14 Piotrllama-chat : Do not throw when tool parsing fails ...
2025-06-14 Aman Guptacompare-llama-bench: add option to plot (#14169)
2025-06-13 Georgi Gerganovvocab : fix build (#14175)
2025-06-13 Svetlozar Georgievsycl: fix docker image (#14144)
2025-06-13 Guy GoldenbergMerge commit from fork
2025-06-13 Georgi Gerganovbatch : add LLAMA_BATCH_DEBUG environment variable...
2025-06-13 ddpasadocs : Update multimodal.md (#14122)
2025-06-13 Georgi Gerganovbatch : rework llama_batch_allocr (#14153)
2025-06-13 Georgi Gerganovreadme : remove survey link (#14168)
2025-06-13 Christian Kastnercmake: Add ability to pass in LLAMA_BUILD_NUMBER/COMMIT...
2025-06-13 Đinh Trọng Huypooling : make cls_b and cls_out_b optional (#14165)
2025-06-13 Georgi Gerganovserver : fix SWA condition for full context reprocess...
2025-06-13 Anton Mitkovsycl: Adding additional cpy dbg print output (#14034)
2025-06-13 Ewan CrawfordSYCL: Bump oneMath commit (#14152)
2025-06-13 Christian Kastnercmake : Improve build-info.cpp generation (#14156)
2025-06-13 Georgi Gerganovvocab : prevent heap overflow when vocab is too small...
2025-06-12 Anton Mitkovsycl: Remove not needed copy f16->f32 for dnnl mul...
2025-06-12 Georgi Gerganovreadme : remove project status link (#14149)
2025-06-12 Georgi Gerganovserver : re-enable SWA speculative decoding (#14131)
2025-06-12 Georgi Gerganovcontext : simplify output counting logic during decode...
2025-06-12 Georgi Gerganovbatch : remove logits_all flag (#14141)
2025-06-12 Georgi Gerganovcmake : handle whitepsaces in path during metal build...
2025-06-12 Georgi Gerganovkv-cache : fix split_equal handling in unified implemen...
2025-06-12 compiladecontext : round n_tokens to next multiple of n_seqs...
2025-06-11 bandoticommon: fix issue with regex_escape routine on windows...
2025-06-11 Christian KastnerImplement GGML_CPU_ALL_VARIANTS for ARM (#14080)
2025-06-11 Sigbjørn Skjæretchore : clean up relative source dir paths (#14128)
2025-06-11 Sigbjørn Skjærettests : add test-tokenizers-repo (#14017)
2025-06-11 Jeff Bolzvulkan: Better thread-safety for command pools/buffers...
2025-06-11 Amanwebui: Wrap long numbers instead of infinite horizontal...
2025-06-11 Georgi Gerganovkv-cache : relax SWA masking condition (#14119)
2025-06-11 Taylorserver : pass default --keep argument (#14120)
2025-06-11 Georgi Gerganovkv-cache : add LLAMA_KV_CACHE_DEBUG environment variabl...
2025-06-11 Jeff Bolzvulkan: Track descriptor pools/sets per-context (#14109)
2025-06-10 lhezopencl: add `mul_mv_id_q4_0_f32_8x_flat` (#14003)
2025-06-10 compiladekv-cache : avoid modifying recurrent cells when setting...
2025-06-10 Sigbjørn Skjæretconvert : fix duplicate key DeepSeek-R1 conversion...
2025-06-10 Sigbjørn Skjæretllama : support GEGLU for jina-bert-v2 (#14090)
2025-06-10 Jeff Bolzvulkan: force device 0 in CI (#14106)
2025-06-10 Juk ArmstrongFixed spec timings to: accepted/tested instead of accep...
2025-06-10 Georgi Gerganovsync : ggml
2025-06-10 Georgi Gerganovggml : fix weak alias win32 (whisper/0)
2025-06-10 0cc4mVulkan: Don't default to CPU device (like llvmpipe...
2025-06-10 Isaac McFadyenrpc : nicer error messages for RPC server crash (#14076)
2025-06-10 Georgi Gerganovsync : ggml
2025-06-10 Kai PastorAdd in-build ggml::ggml ALIAS library (ggml/1260)
2025-06-09 Georgi Gerganovmetal : use less stack memory in FA kernel (#14088)
2025-06-09 Georgi Gerganovkv-cache : fix shift and defrag logic (#14081)
2025-06-09 Diego Devesallama : allow building all tests on windows when not...
2025-06-09 xctanggml-cpu : split arch-specific implementations (#13892)
2025-06-09 Diego Devesacuda : fix device sync on buffer clear (#14033)
2025-06-09 Georgi Gerganovgraph : fix geglu (#14077)
2025-06-09 Xinpeng DouCANN: Simplify the environment variable setting(#13104)
2025-06-09 R0CKSTARwebui: fix sidebar being covered by main content (...
2025-06-09 Georgi Gerganovserver : fix LRU check (#14079)
2025-06-09 Nicolò Scipionesycl: Add reorder to Q6_K mmvq implementation (#13885)
2025-06-09 Đinh Trọng Huyadd geglu activation function (#14074)
2025-06-09 Yuanhao JiCANN: Enable labeler for Ascend NPU (#13914)
2025-06-08 Diego Devesacuda : fix buffer type check with integrated GPUs ...
2025-06-07 吴小白ci: add LoongArch cross-compile build (#13944)
2025-06-07 Akarshan BiswasSYCL: Implement few same quantized type copy kernels...
2025-06-07 Sigbjørn Skjæretllama : fix llama_model_chat_template with template...
2025-06-06 Georgi Gerganovllama : deprecate llama_kv_self_ API (#14030)
2025-06-06 Georgi Gerganovcontext : fix SWA-related warning for multiple sequence...
2025-06-06 Sigbjørn Skjæretllama : support multiple classifier outputs and labels...
2025-06-05 Sigbjørn Skjæretgguf-py : add add_classifier_output_labels method to...
2025-06-05 Masato Nakasakavulkan: Enable VK_KHR_cooperative_matrix extension...
2025-06-05 pockers21ci: fix CUDA build failure on autodl cloud machines...
2025-06-05 Georgi Gerganovmemory : migrate from llama_kv_cache to more generic...
2025-06-05 Diego Devesallama : allow using mmap without PrefetchVirtualMemory...
2025-06-05 Olexandr88readme : add badge (#13938)
2025-06-05 Sigbjørn Skjæretvocab : warn about missing mask token (#14022)
2025-06-05 Georgi Gerganovcontext : fix pos_min initialization upon error decode...
2025-06-05 Jeff Bolzvulkan: automatically deduce size of push constants...
2025-06-04 Ervin Áron... ggml-vulkan: adds support for op CONV_TRANSPOSE_1D...
2025-06-04 Georgi Gerganovkv-cache : refactor the update/defrag mechanism (#13988)
2025-06-04 Diego Devesaci : remove cuda 11.7 releases, switch runner to window...
2025-06-04 Diego Devesareleases : use dl backend for linux release, remove...
2025-06-04 Xuan-Son Nguyenllama-graph : use ggml_repeat_4d (#13998)
2025-06-04 Johannes GäßlerCUDA: fix FTZ in FA for Gemma 3 (#13991)
2025-06-04 Georgi Gerganovkv-cache : fix unified::seq_rm to work with seq_id...
2025-06-03 Jeff Bolzvulkan: fix warnings in perf logger querypool code...
next