]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2026-01-01 ttmodel: support youtu-vl model (#18479)
2026-01-01 Piotr Wilkin... Add conversion support for IQuestCoderForCausalLM ...
2026-01-01 o7simodel : add support for JinaBertModel with non-gated...
2026-01-01 o7siconvert : fix encoding of WPM vocab for BERT models...
2026-01-01 HelloKSmodel: add Solar Open model (#18511)
2026-01-01 Anri Lombardwebui: fix code copy stripping XML/HTML tags (#18518)
2026-01-01 Aman Guptaggml-cuda: remove unneccesary prints on ggml_cuda_init...
2026-01-01 Jeff Bolzvulkan: extend topk_moe to handle sigmoid w/exp_probs_b...
2026-01-01 triplenomllama: handle short reads in direct I/O path (#18504) upstream/0.0.7599
2025-12-31 Anri Lombardchat: make tool description and parameters optional...
2025-12-31 Georgi Gerganovsync : ggml
2025-12-31 Georgi Gerganovggml : bump version to 0.9.5 (ggml/1410)
2025-12-31 Anri Lombardquantize: prevent input/output file collision (#18451)
2025-12-31 Sigbjørn Skjæretconvert : lint fix (#18507)
2025-12-31 Henry147147mtmd : Adding support for Nvidia Music Flamingo Model...
2025-12-31 gatbontonpcmetal : add count_equal op (#18314)
2025-12-31 Johannes GäßlerCUDA: fix KQ max calculation (#18487)
2025-12-31 Georgi Gerganovmetal : remove BF16 x F16 kernels (#18456)
2025-12-31 Aman Guptasycl: add newline at the end of CMakeLists.txt (#18503)
2025-12-31 Rahul SatheWork around broken IntelSYCLConfig.cmake in Intel oneAP...
2025-12-30 Sigbjørn Skjæretdocker : add CUDA 13.1 image build (#18441)
2025-12-30 Bart Louwersdocs : document that JSON Schema is not available to...
2025-12-30 Aldehir Rojascommon : default content to an empty string (#18485)
2025-12-30 Daniel Beveniusllama : fix typo in comment in llama-kv-cache.h [no...
2025-12-30 Xuan-Son Nguyenlora: count lora nodes in graph_max_nodes (#18469)
2025-12-30 Jay Zenithsampling: reuse token data buffer in llama_sampler_samp...
2025-12-30 Jeff Bolzserver: fix files built redundantly (#18474)
2025-12-30 Charles Xukleidiai: add and integrate SVE 256-bit vector-length...
2025-12-30 Aman GuptaCUDA: add log line when mxfp4 acceleration is used...
2025-12-30 Daniel Beveniusmodel-conversion : use CONVERTED_MODEL for compare...
2025-12-29 Xuan-Son Nguyenwebui: fix prompt progress ETA calculation (#18468)
2025-12-29 PascalWebui/prompt processing progress (#18300)
2025-12-29 Johannes GäßlerCUDA: fix replacment of bad archs in CMake (#18457)
2025-12-29 wbtekserver : Cmdline arg -to changes http read timeout...
2025-12-29 Xuan-Son Nguyencontributing: tighten AI usage policy (#18388)
2025-12-29 Naco Sirenandroid: routine maintenance - Dec 2025 (#18338)
2025-12-29 Georgi Gerganovserver : handle closed connection for tasks (#18459)
2025-12-29 Daniel Beveniusmodel-conversion : add device option to embd run orig...
2025-12-29 Héctor Estrada... retrieval : use at most n_seq_max chunks (#18400)
2025-12-29 o7sicommon: fix return value check for setpriority (#18412)
2025-12-29 Johannes GäßlerCUDA: Blackwell features for non-native builds (#18436)
2025-12-29 Aman Guptacuda: fix race condition in cumsum (#18448)
2025-12-28 Tim Neumannci : re-enable rocm build on amd64 (#18439)
2025-12-28 uvosHIP: Use mmq on MFMA devices for MUL_MAT_ID in cases...
2025-12-28 momongamodel : Plamo3 support (#17304)
2025-12-28 Aman GuptaRevert "ggml-cuda: use CMAKE_CUDA_ARCHITECTURES if...
2025-12-28 o7sirpc: fix segfault on invalid endpoint format (#18387)
2025-12-28 Johannes Gäßlerllama-fit-params: fix step size for last device (#18415)
2025-12-28 Johannes Gäßlergithub: update issue templates [no ci] (#18410)
2025-12-28 Xuan-Son Nguyenmtmd: clarify that we no longer accept AI-generated...
2025-12-28 Boian Berberovcmake: Added more x86_64 CPU backends when building...
2025-12-28 QDeltaggml-cuda: use CMAKE_CUDA_ARCHITECTURES if set when...
2025-12-27 lhezopencl: allow resizing transpose buffers (#18384)
2025-12-27 Johannes Gäßlerllama-fit-params: fix overflow check (#18354)
2025-12-27 Johannes Gäßlerllama: fix magic number of 999 for GPU layers (#18266)
2025-12-27 Aman Guptaggml-cuda: Use same regex for GGML_NATIVE=OFF (#18407)
2025-12-27 Johannes Gäßlerllama_fit_params: return enum for fail vs. error (...
2025-12-27 Johannes Gäßlerllama-fit-params: fix Gemma 3 calculation (#18372)
2025-12-26 Jeff Bolzvulkan: preprocess mul_mat_id experts and discard workg...
2025-12-26 Jeff Bolzvulkan: optimize decodeFuncB in coopmat2 mul_mat_id...
2025-12-26 Jeff Bolzvulkan: Use BK=32 for coopmat2 mul_mat_id (#18332)
2025-12-26 Evevulkan: small dequantization improvements (#18380)
2025-12-26 Jeff Bolzvulkan: Support UPSCALE w/antialias (#18327)
2025-12-26 Jeff Bolzvulkan: handle rope with large number of rows (#18306)
2025-12-26 o7siserver : fix crash when seq_rm fails for hybrid/recurre...
2025-12-26 Francisco Herreradocs: added note for pre SYCL Intel hardware (#18016)
2025-12-26 0MarbleCANN: implement the SSM_CONV operator (#17737)
2025-12-25 Aman Guptaggml-cuda: fix regex for arch list (#18371)
2025-12-25 Aman Guptacuda: optimize cumsum cub path (#18362)
2025-12-25 Aman Guptaggml-cuda: fix blackwell native builds (#18361)
2025-12-25 Penglin CaiCANN: Add support for CONV_TRANSPOSE_1D when kernel...
2025-12-25 Aadeshveer... ggml : optimize cuda cumsum fallback kernel (#18343)
2025-12-24 Xuan-Son Nguyenserver: (router) add stop-timeout option (#18350)
2025-12-24 Xuan-Son Nguyenmodel: support MiMo-V2-Flash (#18328)
2025-12-24 Aadeshveer... fit-params : fix race condition in fit-params output...
2025-12-24 Aman GuptaCUDA: experimental native mxfp4 support for blackwell...
2025-12-24 Saba Fallahmodel : support for LlamaBidirectionalModel architectur...
2025-12-24 Jeff Bolzvulkan: fix command buffer corruption in ggml_backend_v...
2025-12-24 Wang WeixuanCANN : refactor ACL graph cache (#17752)
2025-12-24 Jesse Ikonendocs: Fix typos in SYCL documentation (#18269)
2025-12-24 Ruben Ortlamvulkan: use fewer FA rows for small cache runs (#18280)
2025-12-24 TianHao324CANN: Uses yarn_ramp cache in ROPE (#17725)
2025-12-24 ddh0common: add `LLAMA_ARG_OVERRIDE_TENSOR` env var for...
2025-12-23 Xuan-Son Nguyenserver: return_progress to also report 0% processing...
2025-12-23 Pascalwebui: apply webui_settings on first load (#18223)
2025-12-23 Xuan-Son Nguyenserver: fix crash with model not having BOS/EOS (#18321)
2025-12-23 Daniel Beveniusmodel-conversion : add device option to run-org-model...
2025-12-23 Chris Rohlfrpc : add check for rpc buffer type (#18242)
2025-12-23 nullnameggml-hexagon: create generalized functions for cpu...
2025-12-23 Daniel Beveniusmodel-conversion : add trust_remote_code for embedding...
2025-12-23 Neo Zhang[SYCL] replace llama-cli by llama-completion to rm...
2025-12-23 Alessandro98-gitmodel : fix div-by-zero for Nemotron V2 (#18309)
2025-12-22 Ryan Mangenomodel : Granite Embedding support (#15641)
2025-12-22 compiladegguf-py : do not align the data start offset (#18291)
2025-12-22 Shouyuggml-hexagon: gelu optimization (#18151)
2025-12-22 Xuan-Son Nguyengen-docs: automatically update markdown file (#18294)
2025-12-22 Taimur Ahmadllamafile: add rvv support for sgemm kernels (#18199)
2025-12-22 lhezopencl: unpack q4_0 for adreno in get_tensor (#18278)
2025-12-22 Jeff Bolzvulkan: Extend rope fusions to allow mrope (#18264)
2025-12-22 Xuan-Son Nguyenserver: prevent data race from HTTP threads (#18263)
next