git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2025-05-11	Anthony Umfer	tools : fix uninitialized llama_batch in server (#13436)	commit \| commitdiff \| tree
2025-05-11	Sigbjørn Skjæret	scripts : exit compare-llama-bench.py gracefully when...	commit \| commitdiff \| tree
2025-05-11	Johannes Gäßler	CUDA: fix crash with partial offloading of MoE (#13439)	commit \| commitdiff \| tree
2025-05-11	David Huang	Add `--no-op-offload` to improve `-ot` pp perf in MoE...	commit \| commitdiff \| tree
2025-05-11	City	mtmd : support InternVL 3 38B and 78B mmproj (#13443)	commit \| commitdiff \| tree
2025-05-11	Xuan-Son Nguyen	mtmd : move helpers to dedicated file (#13442)	commit \| commitdiff \| tree
2025-05-10	Thomas Germer	docs : Fix typo in InternVL3 model name (#13440)	commit \| commitdiff \| tree
2025-05-10	Johannes Gäßler	CUDA: fix race conditions FlashAttention kernels (...	commit \| commitdiff \| tree
2025-05-10	Sigbjørn Skjæret	vocab : add ByteDance-Seed/Seed-Coder (#13423)	commit \| commitdiff \| tree
2025-05-10	Xuan-Son Nguyen	mtmd : add hard limit on image resolution for qwen2vl...	commit \| commitdiff \| tree
2025-05-10	Xuan-Son Nguyen	server : update docs (#13432)	commit \| commitdiff \| tree
2025-05-10	Sigbjørn Skjæret	llguidance : set tokenizer slices to default (#13424)	commit \| commitdiff \| tree
2025-05-10	Thammachart...	ci: free_disk_space flag enabled for intel variant...	commit \| commitdiff \| tree
2025-05-10	Xuan-Son Nguyen	mtmd : support InternVL 2.5 and 3 (#13422)	commit \| commitdiff \| tree
2025-05-10	Johannes Gäßler	CUDA: fix FlashAttention on Turing (#13415)	commit \| commitdiff \| tree
2025-05-10	Xuan-Son Nguyen	arg : add env var to control mmproj (#13416)	commit \| commitdiff \| tree
2025-05-10	Jeff Bolz	vulkan: scalar flash attention implementation (#13324)	commit \| commitdiff \| tree
2025-05-09	Helton Reis	chore(llguidance): use tagged version that does not...	commit \| commitdiff \| tree
2025-05-09	Xuan-Son Nguyen	server : vision support via libmtmd (#12898)	commit \| commitdiff \| tree
2025-05-09	Alberto Cabrera...	sycl : implementation of reordered Q4_0 MMVQ for Intel...	commit \| commitdiff \| tree
2025-05-09	Georgi Gerganov	metal : optimize MoE for large batches (#13388)	commit \| commitdiff \| tree
2025-05-09	Johannes Gäßler	CUDA: FA support for Deepseek (Ampere or newer) (#13306)	commit \| commitdiff \| tree
2025-05-09	Diego Devesa	llama : do not crash if there is no CPU backend (#13395)	commit \| commitdiff \| tree
2025-05-09	Johannes Gäßler	CUDA: fix crash on large batch size for MoE models...	commit \| commitdiff \| tree
2025-05-09	Bartowski	imatrix : Add --parse-special for enabling parsing...	commit \| commitdiff \| tree
2025-05-09	R0CKSTAR	llama-run: add support for downloading models from...	commit \| commitdiff \| tree
2025-05-09	Xuan-Son Nguyen	mtmd : fix batch_view for m-rope (#13397)	commit \| commitdiff \| tree
2025-05-09	Xuan-Son Nguyen	llama : one-off chat template fix for Mistral-Small...	commit \| commitdiff \| tree
2025-05-09	Radoslav Gerganov	rpc : add rpc_msg_set_tensor_hash_req (#13353)	commit \| commitdiff \| tree
2025-05-09	Jeff Bolz	vulkan: Allow up to 4096 elements for mul_mat_id row_id...	commit \| commitdiff \| tree
2025-05-09	Xuan-Son Nguyen	server : (webui) rename has_multimodal --> modalities...	commit \| commitdiff \| tree
2025-05-08	Diego Devesa	ci : limit write permission to only the release step... upstream/0.0.5318	commit \| commitdiff \| tree
2025-05-08	Matt Clayton	mtmd : Expose helper_decode_image_chunk (#13366)	commit \| commitdiff \| tree
2025-05-08	Xuan-Son Nguyen	server : (webui) fix a very small misalignment (#13387)	commit \| commitdiff \| tree
2025-05-08	Xuan-Son Nguyen	server : (webui) revamp the input area, plus many small...	commit \| commitdiff \| tree
2025-05-08	Sigbjørn Skjæret	convert : support rope_scaling type and rope_type ...	commit \| commitdiff \| tree
2025-05-08	welix	mtmd : fix the calculation of n_tokens for smolvlm...	commit \| commitdiff \| tree
2025-05-08	Georgi Gerganov	context : allow cache-less context for embeddings ...	commit \| commitdiff \| tree
2025-05-08	Georgi Gerganov	context : remove logits_all flag (#13284)	commit \| commitdiff \| tree
2025-05-08	Diego Devesa	ci : move release workflow to a separate file (#13362)	commit \| commitdiff \| tree
2025-05-08	Diego Devesa	llama : print size and type of overridden tensors ...	commit \| commitdiff \| tree
2025-05-08	Alberto Cabrera...	sycl: addressing non-contiguous src1 mul_mats (nc and...	commit \| commitdiff \| tree
2025-05-07	Diego Devesa	docker : disable arm64 and intel images (#13356)	commit \| commitdiff \| tree
2025-05-07	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-05-07	Daniel Bevenius	whisper: remove MSVC warnings pragmas (whisper/3090)	commit \| commitdiff \| tree
2025-05-07	Jared Tweed	cmake : removed stdc++fs (whisper/3097)	commit \| commitdiff \| tree
2025-05-07	Sigbjørn Skjæret	llama : deci : support ffn-free with attention (#13296)	commit \| commitdiff \| tree
2025-05-07	Ycros	common : Add a warning when we can't match samplers...	commit \| commitdiff \| tree
2025-05-07	R0CKSTAR	cuda : remove nrows_x in mul_mat_q_process_tile (#13325)	commit \| commitdiff \| tree
2025-05-07	Georgi Gerganov	examples : remove infill (#13283)	commit \| commitdiff \| tree
2025-05-07	piDack	llama : support tie embedding for chatglm models (...	commit \| commitdiff \| tree
2025-05-06	Johannes Gäßler	CUDA: mix virt/real CUDA archs for GGML_NATIVE=OFF...	commit \| commitdiff \| tree
2025-05-06	Xuan-Son Nguyen	clip : refactor graph builder (#13321)	commit \| commitdiff \| tree
2025-05-06	DocShotgun	sampling : make top_n_sigma no-op at <=0 or a single...	commit \| commitdiff \| tree
2025-05-06	oobabooga	sampling : don't consider -infinity values in top_n_sig...	commit \| commitdiff \| tree
2025-05-06	Diego Devesa	cmake : remove arm64 msvc presets (#13342)	commit \| commitdiff \| tree
2025-05-06	Akarshan Biswas	SYCL: Disable reorder optimize by default and stop...	commit \| commitdiff \| tree
2025-05-06	Xuan-Son Nguyen	llama : fix build_ffn without gate (#13336)	commit \| commitdiff \| tree
2025-05-06	Johannes Gäßler	CUDA: fix bad asserts for partial offload (#13337)	commit \| commitdiff \| tree
2025-05-06	Sigbjørn Skjæret	convert : qwen2/3moe : set yarn metadata if present...	commit \| commitdiff \| tree
2025-05-06	Johannes Gäßler	CUDA: fix --split-mode row for MMQ (#13323)	commit \| commitdiff \| tree
2025-05-06	compilade	gguf-py : avoid requiring pyside6 for other scripts... gguf-v0.16.3	commit \| commitdiff \| tree
2025-05-05	Johannes Gäßler	CUDA: fix logic for clearing padding with -ngl 0 (...	commit \| commitdiff \| tree
2025-05-05	oobabooga	sampling : Integrate Top-nσ into main sampling chain...	commit \| commitdiff \| tree
2025-05-05	igardev	server : Webui - change setText command from parent...	commit \| commitdiff \| tree
2025-05-05	Xuan-Son Nguyen	mtmd : rename llava directory to mtmd (#13311)	commit \| commitdiff \| tree
2025-05-05	Xuan-Son Nguyen	clip : fix confused naming ffn_up and ffn_down (#13290)	commit \| commitdiff \| tree
2025-05-05	Sigbjørn Skjæret	convert : bailingmoe : set yarn metadata if present...	commit \| commitdiff \| tree
2025-05-05	Akarshan Biswas	SYCL: Disable mul_mat kernels for noncontiguous tensor...	commit \| commitdiff \| tree
2025-05-04	Xuan-Son Nguyen	mtmd : add C public API (#13184)	commit \| commitdiff \| tree
2025-05-04	Diego Devesa	rpc : use backend registry, support dl backends (#13304)	commit \| commitdiff \| tree
2025-05-04	Aaron Teo	ggml : activate s390x simd for Q3_K (#13301)	commit \| commitdiff \| tree
2025-05-04	Diego Devesa	llava/mtmd : fixes to fully support dl backends (#13303)	commit \| commitdiff \| tree
2025-05-04	Diego Devesa	llama : build windows releases with dl backends (#13220)	commit \| commitdiff \| tree
2025-05-04	Johannes Gäßler	CUDA: fix race condition in MMQ stream-k fixup (#13299)	commit \| commitdiff \| tree
2025-05-04	Johannes Gäßler	CUDA: fix race condition in MMQ ids_dst (#13294)	commit \| commitdiff \| tree
2025-05-04	Jeff Bolz	vulkan: Additional type support for unary, binary,...	commit \| commitdiff \| tree
2025-05-03	Johannes Gäßler	imatrix: fix oob writes if src1 is not contiguous ...	commit \| commitdiff \| tree
2025-05-03	Xuan-Son Nguyen	clip : revert the change of BOI/EOI token for GLM-edge...	commit \| commitdiff \| tree
2025-05-03	ymcki	llama : Llama-3_1-Nemotron-Ultra-253B-v1 support (...	commit \| commitdiff \| tree
2025-05-02	Diego Devesa	llama : move end-user examples to tools directory ...	commit \| commitdiff \| tree
2025-05-02	Georgi Gerganov	sync : ggml (#13268)	commit \| commitdiff \| tree
2025-05-02	Georgi Gerganov	context : fix reorder logic (#13267)	commit \| commitdiff \| tree
2025-05-02	shalinib-ibm	ggml : Enable MMA for BF16 in llamafile_sgemm (#13148)	commit \| commitdiff \| tree
2025-05-02	Jared Van Bortel	llama-model : support Qwen2 embedding models and poolin...	commit \| commitdiff \| tree
2025-05-02	Jared Van Bortel	convert : use correct context length for nomic-embed...	commit \| commitdiff \| tree
2025-05-02	Xuan-Son Nguyen	convert : converting mmproj for Qwen2/2.5VL from conver...	commit \| commitdiff \| tree
2025-05-02	Georgi Gerganov	kv-cache : separate recurrent vs non-recurrent impl...	commit \| commitdiff \| tree
2025-05-02	Sigbjørn Skjæret	llama : orion rope type is neox (#13261)	commit \| commitdiff \| tree
2025-05-02	Sigbjørn Skjæret	llama : plamo rope type is neox (#13260)	commit \| commitdiff \| tree
2025-05-02	piDack	llama-chat : reset glmedge chat template (#13253)	commit \| commitdiff \| tree
2025-05-02	Shakil Ahmed	mtmd-cli : fix out_of_range when input image path is...	commit \| commitdiff \| tree
2025-05-02	Georgi Gerganov	server : add cache reuse card link to help (#13230)	commit \| commitdiff \| tree
2025-05-02	Xuan-Son Nguyen	convert : explicitly disable trust_remote_code for...	commit \| commitdiff \| tree
2025-05-01	bandoti	ci: fix cross-compile sync issues (#12804)	commit \| commitdiff \| tree
2025-05-01	Justin Santa...	rpc : avoid uninitialized memory in serialize_tensor...	commit \| commitdiff \| tree
2025-05-01	Jesse Gross	ggml: Don't assert fail when tensor data changes (...	commit \| commitdiff \| tree
2025-05-01	Diego Devesa	build : fix build info on windows (#13239)	commit \| commitdiff \| tree
2025-05-01	Loïc Carrère	clip : (minicpmv) Re-enable upscaling of images smaller...	commit \| commitdiff \| tree
2025-05-01	matteo	llama-chat : update GLM4 chat template (#13238)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom