git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2025-05-20	Georgi Gerganov	kv-cache : add SWA support (#13194)	commit \| commitdiff \| tree
2025-05-20	Xinpeng Dou	CANN: Update CANN model support (#13162)	commit \| commitdiff \| tree
2025-05-20	Nicolò Scipione	sycl : Overcoming workaround for mmap() allocation...	commit \| commitdiff \| tree
2025-05-19	psocolovsky	common : add load_progress_callback (#13617)	commit \| commitdiff \| tree
2025-05-19	0cc4m	Vulkan: Add f32 accumulator support to quantized mul...	commit \| commitdiff \| tree
2025-05-19	Alberto Cabrera...	sycl : backend documentation review (#13544)	commit \| commitdiff \| tree
2025-05-19	Xuan-Son Nguyen	mtmd : add vision support for llama 4 (#13282)	commit \| commitdiff \| tree
2025-05-19	Alberto Cabrera...	ci : upgraded oneAPI version in SYCL workflows and...	commit \| commitdiff \| tree
2025-05-19	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-05-19	Johannes Gäßler	mnist: fix segmentation fault (ggml/1227)	commit \| commitdiff \| tree
2025-05-19	Diego Devesa	ggml : fix apple OS check in ggml_print_backtrace ...	commit \| commitdiff \| tree
2025-05-19	Daniel Tang	ggml : Fix missing backtrace on Linux (ggml/1228)	commit \| commitdiff \| tree
2025-05-19	Nick	fix: check model pointer validity before use (#13631)	commit \| commitdiff \| tree
2025-05-19	Chenguang Li	CANN: Support MOE Model MUL_MAT_ID (#13042)	commit \| commitdiff \| tree
2025-05-17	Isaac McFadyen	server : added --no-prefill-assistant flag (#13608)	commit \| commitdiff \| tree
2025-05-17	Gilad S.	cmake: use the current build config for vulkan-shaders...	commit \| commitdiff \| tree
2025-05-17	Georgi Gerganov	parallel : add option for non-shared and larger prompts...	commit \| commitdiff \| tree
2025-05-17	Jeff Bolz	vulkan: move common FA code to flash_attn_base.comp...	commit \| commitdiff \| tree
2025-05-17	Jeff Bolz	vulkan: use scalar FA rather than coopmat2 when N=...	commit \| commitdiff \| tree
2025-05-16	Z	llguidance : official v0.7.20 release (no actual change...	commit \| commitdiff \| tree
2025-05-16	Xuan-Son Nguyen	server : do not return error out of context (with ctx...	commit \| commitdiff \| tree
2025-05-16	Xuan-Son Nguyen	webui : improve accessibility for visually impaired...	commit \| commitdiff \| tree
2025-05-16	Xuan-Son Nguyen	readme : add list of dependencies and their license...	commit \| commitdiff \| tree
2025-05-16	Diego Devesa	releases : use arm version of curl for arm releases...	commit \| commitdiff \| tree
2025-05-16	Georgi Gerganov	metal : add FA-vec kernel for head size 64 (#13583)	commit \| commitdiff \| tree
2025-05-16	Diego Devesa	llama : print hint when loading a model when no backend...	commit \| commitdiff \| tree
2025-05-16	Sigbjørn Skjæret	ci : add ppc64el to build-linux-cross (#13575)	commit \| commitdiff \| tree
2025-05-16	Łukasz Ślusarczyk	sycl : fixed compilation warnings (#13582)	commit \| commitdiff \| tree
2025-05-15	Olivier Chafik	minja: sync (qwen3) (#13573)	commit \| commitdiff \| tree
2025-05-15	Diego Devesa	gguf : use ggml log system (#13571)	commit \| commitdiff \| tree
2025-05-15	Daniel Tang	gguf-py : fix disconnect-before-connect in editor-gui...	commit \| commitdiff \| tree
2025-05-15	Xuan-Son Nguyen	convert : fix conversion for llama 4 (#13567)	commit \| commitdiff \| tree
2025-05-15	Atharva Dubey	sycl: simplify bin_bcast_kernel (#13383)	commit \| commitdiff \| tree
2025-05-15	Svetlozar Georgiev	sycl: reordered Q4_K MMVQ (#13109)	commit \| commitdiff \| tree
2025-05-15	Łukasz Ślusarczyk	sycl: use oneDNN for matrices multiplication (#12972)	commit \| commitdiff \| tree
2025-05-15	Diego Devesa	llama-bench : fix -ot with dl backends (#13563)	commit \| commitdiff \| tree
2025-05-15	Xuan-Son Nguyen	webui : handle PDF input (as text or image) + convert...	commit \| commitdiff \| tree
2025-05-15	Piotr Wilkin...	server : proper error handling for missing elements...	commit \| commitdiff \| tree
2025-05-15	Georgi Gerganov	bench : handle decode errors (#13548)	commit \| commitdiff \| tree
2025-05-15	Olivier Chafik	`server`: inject date_string in llama 3.x template...	commit \| commitdiff \| tree
2025-05-14	Georgi Gerganov	kv-cache : fix out-of-bounds view during reserve graph...	commit \| commitdiff \| tree
2025-05-14	Yibo Cai	arm64: optimize q6_k_q8_k kernel with i8mm (#13519)	commit \| commitdiff \| tree
2025-05-14	Olivier Chafik	`common`: add partial regex support (#12808)	commit \| commitdiff \| tree
2025-05-14	Sigbjørn Skjæret	editorconfig : fix trailing whitespace from #13542...	commit \| commitdiff \| tree
2025-05-14	Gilad S.	fix: crash when calling `llama_state_get_size` on a...	commit \| commitdiff \| tree
2025-05-14	Johannes Gäßler	CUDA: fix crash on large batch size for quant. MoE...	commit \| commitdiff \| tree
2025-05-14	Diego Devesa	llama : fix quantize with dl backends (#13539)	commit \| commitdiff \| tree
2025-05-14	Johannes Gäßler	CUDA: faster Deepseek FA, add Turing support (#13435)	commit \| commitdiff \| tree
2025-05-14	Gabe Goodhart	fix: Move build_inp_pos to the top of the graph section...	commit \| commitdiff \| tree
2025-05-14	Georgi Gerganov	server : passthrough the /models endpoint during loadin...	commit \| commitdiff \| tree
2025-05-14	Xuan-Son Nguyen	server : fix cache_tokens bug with no cache_prompt...	commit \| commitdiff \| tree
2025-05-14	bandoti	cmake: simplify vulkan shader test logic (#13263)	commit \| commitdiff \| tree
2025-05-14	Jeff Bolz	vulkan: KHR_coopmat flash attention (#13506)	commit \| commitdiff \| tree
2025-05-14	Xuan-Son Nguyen	webui : use fflate for more deterministic gzip compress...	commit \| commitdiff \| tree
2025-05-14	Luca Stefani	webui: Allow pasting file from clipboard (#13526)	commit \| commitdiff \| tree
2025-05-14	ddpasa	docs: Update link to ggml-org in multimodal.md (#13513)	commit \| commitdiff \| tree
2025-05-14	Sigbjørn Skjæret	scripts : fix compare-llama-bench.py show parameter...	commit \| commitdiff \| tree
2025-05-14	Jeff Bolz	vulkan: workaround FA compile failures on macos (#13517)	commit \| commitdiff \| tree
2025-05-13	Ed Addario	quantize : improve tensor-type pattern matching (#13033)	commit \| commitdiff \| tree
2025-05-13	Xuan-Son Nguyen	clip : clip.h become private API (⚠️ breaking change...	commit \| commitdiff \| tree
2025-05-13	Georgi Gerganov	metal : use FA-vec kernel up to batch size 20 (#13496)	commit \| commitdiff \| tree
2025-05-13	Georgi Gerganov	metal : optimize multi-sequence FA vec kernel (#13493)	commit \| commitdiff \| tree
2025-05-13	Dan Johansson	ggml-cpu: Update KleidiAI to v1.6 and fix include direc...	commit \| commitdiff \| tree
2025-05-13	Georgi Gerganov	batched-bench : fix pp batch contents (#13492)	commit \| commitdiff \| tree
2025-05-13	Xuan-Son Nguyen	mtmd : remove libllava, remove clip-quantize-cli (...	commit \| commitdiff \| tree
2025-05-13	Sigbjørn Skjæret	scripts : support arbitrary input file formats in compa...	commit \| commitdiff \| tree
2025-05-13	Gabe Goodhart	model : Granite MoE shared (#13269)	commit \| commitdiff \| tree
2025-05-13	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-05-12	Diego Devesa	llama-bench : add defrag-thold, check for invalid range...	commit \| commitdiff \| tree
2025-05-12	lhez	opencl: remove unnecessary assert for `add` (#13257)	commit \| commitdiff \| tree
2025-05-12	Xuan-Son Nguyen	clip : cap max image size 1024 for qwen vl model (...	commit \| commitdiff \| tree
2025-05-12	Johannes Gäßler	llama/ggml: add LLM training support (#10544)	commit \| commitdiff \| tree
2025-05-12	Georgi Gerganov	context : fix state io for memory-less contexts (#13470)	commit \| commitdiff \| tree
2025-05-12	Anudit Nagar	server : allow content to be null in oaicompat_completi...	commit \| commitdiff \| tree
2025-05-12	Diego Devesa	llama-bench : accept ranges for integer parameters...	commit \| commitdiff \| tree
2025-05-12	Dan Johansson	ggml-cpu: Integrate fp32=bf16xbf16 SME KleidiAI kernel...	commit \| commitdiff \| tree
2025-05-12	Johannes Gäßler	CUDA: fix misaligned synchronization in FA (#13469)	commit \| commitdiff \| tree
2025-05-12	Xuan-Son Nguyen	ggml : add mrope kernel for metal (#13457)	commit \| commitdiff \| tree
2025-05-12	Atharva Dubey	enable dpcpp nightly builds with libraries (#13406)	commit \| commitdiff \| tree
2025-05-11	City	mtmd : Use RMS norm for InternVL 3 38B and 78B mmproj...	commit \| commitdiff \| tree
2025-05-11	Anthony Umfer	tools : fix uninitialized llama_batch in server (#13436)	commit \| commitdiff \| tree
2025-05-11	Sigbjørn Skjæret	scripts : exit compare-llama-bench.py gracefully when...	commit \| commitdiff \| tree
2025-05-11	Johannes Gäßler	CUDA: fix crash with partial offloading of MoE (#13439)	commit \| commitdiff \| tree
2025-05-11	David Huang	Add `--no-op-offload` to improve `-ot` pp perf in MoE...	commit \| commitdiff \| tree
2025-05-11	City	mtmd : support InternVL 3 38B and 78B mmproj (#13443)	commit \| commitdiff \| tree
2025-05-11	Xuan-Son Nguyen	mtmd : move helpers to dedicated file (#13442)	commit \| commitdiff \| tree
2025-05-10	Thomas Germer	docs : Fix typo in InternVL3 model name (#13440)	commit \| commitdiff \| tree
2025-05-10	Johannes Gäßler	CUDA: fix race conditions FlashAttention kernels (...	commit \| commitdiff \| tree
2025-05-10	Sigbjørn Skjæret	vocab : add ByteDance-Seed/Seed-Coder (#13423)	commit \| commitdiff \| tree
2025-05-10	Xuan-Son Nguyen	mtmd : add hard limit on image resolution for qwen2vl...	commit \| commitdiff \| tree
2025-05-10	Xuan-Son Nguyen	server : update docs (#13432)	commit \| commitdiff \| tree
2025-05-10	Sigbjørn Skjæret	llguidance : set tokenizer slices to default (#13424)	commit \| commitdiff \| tree
2025-05-10	Thammachart...	ci: free_disk_space flag enabled for intel variant...	commit \| commitdiff \| tree
2025-05-10	Xuan-Son Nguyen	mtmd : support InternVL 2.5 and 3 (#13422)	commit \| commitdiff \| tree
2025-05-10	Johannes Gäßler	CUDA: fix FlashAttention on Turing (#13415)	commit \| commitdiff \| tree
2025-05-10	Xuan-Son Nguyen	arg : add env var to control mmproj (#13416)	commit \| commitdiff \| tree
2025-05-10	Jeff Bolz	vulkan: scalar flash attention implementation (#13324)	commit \| commitdiff \| tree
2025-05-09	Helton Reis	chore(llguidance): use tagged version that does not...	commit \| commitdiff \| tree
2025-05-09	Xuan-Son Nguyen	server : vision support via libmtmd (#12898)	commit \| commitdiff \| tree
2025-05-09	Alberto Cabrera...	sycl : implementation of reordered Q4_0 MMVQ for Intel...	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom