]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog
pkg/ggml/sources/llama.cpp
2025-05-01 Xuan-Son Nguyenmtmd : add **vision** support for Mistral Small 3.1...
2025-05-01 Xuan-Son Nguyenarg : remove CURLINFO_EFFECTIVE_METHOD (#13228)
2025-05-01 Jared Van Bortelllama-model : fix the reported size class for nomic...
2025-05-01 Georgi Gerganovsync : ggml
2025-05-01 Diego Devesaggml : fix ggml_gallocr_ptr type (ggml/1205)
2025-05-01 Georgi Gerganovcuda : fix unused variable compile warning (whisper/0)
2025-04-30 Johannes GäßlerCUDA: batched+noncont MMQ, refactor bs>1 MoE code ...
2025-04-30 Xuan-Son Nguyenarg : -hf do not fail if url mismatch (#13219)
2025-04-30 ddh0fix typo: `n_ctx_pre_seq` -> `n_ctx_per_seq` (#13221)
2025-04-30 Xuan-Son Nguyenconvert : improve model arch handling (#13122)
2025-04-30 Tatsuya Tanakallava : remove duplicate include (#13207)
2025-04-30 Olivier Chafikcommon : add -jf / --json-schema-file flag (#12011)
2025-04-30 Jeff Bolzvulkan: use uint array index to avoid glslang bug ...
2025-04-30 shalinib-ibmggml : fix ppc64le build (#13176)
2025-04-30 Xuan-Son Nguyenconvert : correct typo image_mean --> image_std (#13208)
2025-04-30 Aaron Teofeat(ggml-cpu): enable z17 compile (#13182)
2025-04-30 Xuan-Son Nguyenarg : allow using -hf offline (#13202)
2025-04-30 Xuan-Son Nguyendocker : do not build tests (#13204)
2025-04-30 xiaofeirpc : fix cache directory initialization (#13188)
2025-04-29 Johannes Gäßlerscripts: n_depth for compare-llama-bench [no ci] (...
2025-04-29 matteoserver : Prefilling assistant message in openai compati...
2025-04-29 Georgi Gerganovsampling : when top-k <= 0 -> noop (#13173)
2025-04-29 Alberto Cabrera... llama-bench: fixed size of fields to correctly map...
2025-04-29 Johannes GäßlerCUDA: fix non-cont. inputs for batched mat mul (#13155)
2025-04-29 Sigbjørn Skjæretllama : llm_type order by size (#13177)
2025-04-29 Xuan-Son Nguyenmtmd : add qwen2vl and qwen2.5vl (#13141)
2025-04-29 Sigbjørn Skjæretllama : set qwen3 model type sizes (#13175)
2025-04-29 Xuan-Son Nguyenllama-graph : fix text position for mrope (#13159)
2025-04-28 ATmodel : Nomic Embed Text V2 with Mixture-of-Experts...
2025-04-28 Xuan-Son Nguyenclip : fix model size display (#13153)
2025-04-28 Ville Vesilehtofix(rpc): Improve input validation and error handling...
2025-04-28 Vishal Agarwalllama-bench: add `-d` depth arg (#13096)
2025-04-28 Xuan-Son Nguyenmtmd : fix glm-edge redundant token count (#13139)
2025-04-28 pockers21context : do not clear output buffer on reserve (#13152)
2025-04-28 Xuan-Son Nguyenllama : (mrope) allow using normal 1D position for...
2025-04-28 Xuan-Son Nguyenclip : refactor set input for cgraph + fix qwen2.5vl...
2025-04-28 Akarshan BiswasSYCL: Add all missing unary kernels (#13074)
2025-04-28 Georgi Gerganovreadme : update hot topics (#13150)
2025-04-28 Georgi Gerganovcommon : fix noreturn compile warning (#13151)
2025-04-28 Xuan-Son Nguyenllama-chat : fix typo GML --> GLM (#13143)
2025-04-28 R0CKSTARmusa: fix typo in cc control (#13144)
2025-04-28 Johannes GäßlerCUDA: fix q_nope_absorbed prec for DS 2 Lite f16 (...
2025-04-28 Xuan-Son Nguyenarg : fix unused variable (#13142)
2025-04-27 4onenllama-bench : Add `--override-tensors` arg (#12922)
2025-04-27 matteollama-chat : fix wrong template in GLM4-0414 (#13140)
2025-04-27 R0CKSTARmusa: fix build warning (#13129)
2025-04-27 LostRuins ConcedoFixes Qwen2.5VL segfault during inference with https...
2025-04-27 HimariOclip : Add Qwen2.5VL support (#12402)
2025-04-26 Xuan-Son Nguyencommon : add common_remote_get_content (#13123)
2025-04-26 Xuan-Son Nguyenclip : improve projector naming (#13118)
2025-04-26 SXXggml: move fp16/bf16 conversion optimizations to CPU...
2025-04-26 frobgrammar : handle maxItems == 0 in JSON schema (#13117)
2025-04-25 Diego Devesallama : fix K-shift with quantized K and BLAS backend...
2025-04-25 CityForce FP32 compute in GLM4 FFN Down (#13101)
2025-04-25 Xuan-Son Nguyenclip : fix pixtral on some GPU backends (#13097)
2025-04-25 Neo Zhang Jianyuchange the reorder tensor from init to execute OP ...
2025-04-25 Radoslav Gerganovrpc : do not wait for response when sending RPC_CMD_SET...
2025-04-24 Xuan-Son Nguyenclip : remove boi/eoi embeddings for GLM-edge model...
2025-04-24 Georgi Gerganovembeddings : fix batch sizes (#13076) upstream/0.0.5185
2025-04-24 Georgi Gerganovggml : fix trailing whitespaces (#0)
2025-04-24 Georgi Gerganovsync : ggml
2025-04-24 Aclyggml : Depthwise 2D convolution (ggml/1152)
2025-04-24 Johannes GäßlerCUDA: use switch statements in constexpr functions...
2025-04-24 Georgi Gerganovcmake : do not include ./src as public for libllama...
2025-04-24 Georgi Gerganovclang-tidy : disable warning about missing math parenth...
2025-04-24 Xuan-Son Nguyenarg : add --no-mmproj-offload (#13093)
2025-04-24 Xuan-Son Nguyenarg : clean up handling --mmproj with -hf (#13082)
2025-04-24 Georgi Gerganovmetal : fix floating-point range of attention scores...
2025-04-24 Evevulkan: matmul gcn tuning (#13016)
2025-04-23 pl752llama-mtmd-cli: Sigint rework in mtmd vision example...
2025-04-23 Xuan-Son Nguyenmtmd : Support Pixtral 12B (#13065)
2025-04-23 piDackconvert : Append mult-eos,half-rope,bos to GLM4-0414...
2025-04-23 Radoslav Gerganovrpc : add command line option for number of threads...
2025-04-22 Johannes GäßlerCUDA: noncont MMVQ + batched bs1 MUL_MAT_ID (#13014)
2025-04-22 Xuan-Son Nguyenmtmd : support SmolVLM (version 1 and 2) (#13050)
2025-04-22 Georgi Gerganovsecurity : add note about RPC and server functionality...
2025-04-22 Georgi Gerganovmetal : add memory pool for temp allocs (#12850)
2025-04-22 Xuan-Son Nguyenllava : update documentations (#13055)
2025-04-21 Diego Devesaggml : add SSE 4.2 and x64 base variant for CPUs withou...
2025-04-21 Akarshan BiswasSYCL: Add non-contiguous support in ROPE (#12993)
2025-04-21 Xuan-Son Nguyenmtmd : merge llava, gemma3 and minicpmv CLI into single...
2025-04-20 Xuan-Son Nguyenconvert : experimental support for `--mmproj` flag...
2025-04-20 Jeffrey Morganllava: fix errors in clip.h on certain compilers (...
2025-04-20 Jeff Bolzvulkan: support noncontiguous rms_norm (#13031)
2025-04-20 Jeffrey Morganmetal: add neg operator (#13029)
2025-04-19 bandotiDisable CI cross-compile builds (#13022)
2025-04-19 Sigbjørn Skjæretgguf-py : fix upload python package workflow (#13020) gguf-v0.16.2
2025-04-19 Xuan-Son Nguyenclip : refactor, add `image_manipulation` and `llava_uh...
2025-04-18 Daniel Tangmain : Fix Ctrl+D/newline handling (#12951)
2025-04-18 Chris Thompsongguf-py : GGUF Editor GUI - Python + Qt6 (#12930) gguf-v0.16.1
2025-04-18 Xuan-Son Nguyenserver : use std::move whenever possible (#12936)
2025-04-18 Akarshan BiswasSYCL: Refactor and enable FP16 in binary broadcast...
2025-04-18 Xuan-Son Nguyenmtmd : add methods to access `mtmd_image_tokens` (...
2025-04-18 Radoslav Gerganovrpc : add RPC_CMD_HELLO (#12955)
2025-04-17 Georgi Gerganovgraph : make FA compatible with MLA + add initial Metal...
2025-04-17 Alan Grayggml: Re-enable CUDA graphs in presence of CONT and...
2025-04-17 hipuddingCANN: Add support for async operator submission (#12864)
2025-04-17 Mikko Juolallama : recognize IBM Granite 3.3 FIM tokens (#12988)
2025-04-16 kimminsuopencl: fix incorrect local_size index in profiling...
2025-04-16 Jeff Bolzvulkan: enable coopmat2 FA gqa and split_k optimization...
next