git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2026-03-08	Jeff Bolz	vulkan: Fix data races in coopmat1 mul_mat(_id) (#20084)	commit \| commitdiff \| tree
2026-03-08	Johannes Gäßler	llama: end-to-end tests (#19802)	commit \| commitdiff \| tree
2026-03-08	Christopher...	readme : update infra list (#20212)	commit \| commitdiff \| tree
2026-03-08	Piotr Wilkin...	Revert to OAI-compatible args (#20213)	commit \| commitdiff \| tree
2026-03-08	decahedron1	server : correct index on finish in OAI completion...	commit \| commitdiff \| tree
2026-03-08	Neo Zhang	[SYCL] supprt Flash Attention for fp32/fp16/Q4/Q5/Q8...	commit \| commitdiff \| tree
2026-03-07	Aman Gupta	ggml: add GATED_DELTA_NET op (#19504)	commit \| commitdiff \| tree
2026-03-07	lhez	opencl: add l2_norm (#20160)	commit \| commitdiff \| tree
2026-03-07	Piotr Wilkin...	Autoparser: True streaming (#20177)	commit \| commitdiff \| tree
2026-03-06	Piotr Wilkin...	Autoparser: add optional argument reshuffle capability...	commit \| commitdiff \| tree
2026-03-06	Bartowski	quants : Add memsets and other fixes for IQ quants...	commit \| commitdiff \| tree
2026-03-06	Piotr Wilkin...	Add @pwilkin to CODEOWNERS for autoparser code (#20174)	commit \| commitdiff \| tree
2026-03-06	Piotr Wilkin...	Autoparser - complete refactoring of parser architectur...	commit \| commitdiff \| tree
2026-03-06	Todor Boinovski	hexagon: add f32 ssm_conv op (#20122)	commit \| commitdiff \| tree
2026-03-06	Tom Vaucourt	server : preserve anthropic thinking blocks in conversi...	commit \| commitdiff \| tree
2026-03-06	Max Krasnyansky	cpu: skip redudant ROPE cache updates (#20149)	commit \| commitdiff \| tree
2026-03-06	Aman Gupta	ggml-cuda: add mem check for fusion (#19916)	commit \| commitdiff \| tree
2026-03-06	Aaron Teo	ggml: update comments for backends which have no memory...	commit \| commitdiff \| tree
2026-03-06	shalinib-ibm	ggml-cpu: Fix gcc 15 ICE on ppc64le (#20083) (#20130)	commit \| commitdiff \| tree
2026-03-06	Aman Gupta	CUDA: use shared mem for ssm_conv (#20128)	commit \| commitdiff \| tree
2026-03-06	Tim Neumann	context: ignore zero scale LoRAs when checking sameness...	commit \| commitdiff \| tree
2026-03-06	Piotr Wilkin...	Checkpoint every n tokens: squash (#20087)	commit \| commitdiff \| tree
2026-03-06	Aleksander...	webui: Agentic Loop + MCP Client with support for Tools...	commit \| commitdiff \| tree
2026-03-06	Johannes Gäßler	ggml-cpu: fix data race for debug asserts (#20148)	commit \| commitdiff \| tree
2026-03-06	Georgi Gerganov	kv-cache : fix M-RoPE checkpoints (#20132)	commit \| commitdiff \| tree
2026-03-06	Roj234	cli : Don't clear system prompt when using '/clear...	commit \| commitdiff \| tree
2026-03-06	lhez	opencl: add neg, exp and diag (#20127)	commit \| commitdiff \| tree
2026-03-06	YardenTal44	hexagon: add fp16 support for binary ops: add,sub,mul...	commit \| commitdiff \| tree
2026-03-05	ymcki	models : kda chunk size = 16 (#19827)	commit \| commitdiff \| tree
2026-03-05	Andreas Kieslinger	CUDA: Improve performance via less synchronizations...	commit \| commitdiff \| tree
2026-03-05	Eric Zhang	model : update Qwen3.5 model type detection (#20126)	commit \| commitdiff \| tree
2026-03-05	Sigbjørn Skjæret	cli : add command and file auto-completion (#19985)	commit \| commitdiff \| tree
2026-03-05	Sigbjørn Skjæret	convert : register Qwen 3.5 ForCausalLM for text only...	commit \| commitdiff \| tree
2026-03-05	Aleksander...	webui: Improvements for Models Selector UI (#20066)	commit \| commitdiff \| tree
2026-03-05	Marcel Petrick	chore : correct typos [no ci] (#20041)	commit \| commitdiff \| tree
2026-03-05	Max Krasnyansky	hexagon: Flash Attention optimizations (dma, mpyacc...	commit \| commitdiff \| tree
2026-03-05	lhez	opencl: add `SET`, support i32 for `CPY`, minor refacto...	commit \| commitdiff \| tree
2026-03-04	Todor Boinovski	hexagon: add llama-completion runner script (#20095)	commit \| commitdiff \| tree
2026-03-04	Nikhil Jain	[WebGPU] Fix wait logic for inflight jobs (#20096)	commit \| commitdiff \| tree
2026-03-04	Masashi Yoshimura	Add concat op to webgpu. (#20068)	commit \| commitdiff \| tree
2026-03-04	Sigbjørn Skjæret	tools : add missing clocale include in mtmd-cli [no...	commit \| commitdiff \| tree
2026-03-04	Johannes Gäßler	ggml: fix ggml_is_contiguous_n for ne == 1 (#20092)	commit \| commitdiff \| tree
2026-03-04	Adrien Gallouët	ggml : use a simple std::thread in AMX without OpenMP...	commit \| commitdiff \| tree
2026-03-04	ddh0	impl : use 6 digits for tensor dims (#20094)	commit \| commitdiff \| tree
2026-03-04	SamareshSingh	Fix locale-dependent float printing in GGUF metadata...	commit \| commitdiff \| tree
2026-03-04	standby24x7	completion : Fix a typo in warning message (#20082)	commit \| commitdiff \| tree
2026-03-03	Mickael Desgranges	docs: Fix intel documentation link (#20040)	commit \| commitdiff \| tree
2026-03-03	Charles Xu	kleidiai : add sme fp16 compute path for q4_0 gemm...	commit \| commitdiff \| tree
2026-03-03	shaofeiqi	opencl: add optimized q4_1 mm kernel for adreno (#19840)	commit \| commitdiff \| tree
2026-03-03	Abhijit Ramesh	ggml webgpu: fix workgroup dispatch limit for large...	commit \| commitdiff \| tree
2026-03-02	Nikhil Jain	ggml webgpu: Clean up per-thread parameter buffer pool...	commit \| commitdiff \| tree
2026-03-02	Masashi Yoshimura	ggml-webgpu: Support non-contiguous `src0` and overlapp...	commit \| commitdiff \| tree
2026-03-02	Ruben Ortlam	vulkan: tune MMVQ for Intel Windows (#19988)	commit \| commitdiff \| tree
2026-03-02	Adrien Gallouët	scripts : improve get-wikitext-2.sh (#19952)	commit \| commitdiff \| tree
2026-03-02	Aaron Teo	ggml-cpu: optimise s390x multiply extend instructions...	commit \| commitdiff \| tree
2026-03-01	Ruben Ortlam	vulkan: improve partial offloading performance on AMD...	commit \| commitdiff \| tree
2026-03-01	oobabooga	cuda: cap grid.y at 65535 in non-contiguous dequantize...	commit \| commitdiff \| tree
2026-02-28	Dmitry Atamanov	vendors : update miniaudio library to 0.11.24 (#19914)	commit \| commitdiff \| tree
2026-02-28	Adrien Gallouët	vendor : update cpp-httplib to 0.35.0 (#19969)	commit \| commitdiff \| tree
2026-02-28	Bartowski	tests : model metadata loading from huggingface (#19796)	commit \| commitdiff \| tree
2026-02-27	Jayant Lohia	CUDA: add CDNA3 MFMA support for flash attention MMA...	commit \| commitdiff \| tree
2026-02-27	Roj234	server: Add pragma once to server-context.h (#19944)	commit \| commitdiff \| tree
2026-02-27	Sami Kama	server: Mirroring /v1/responses to /responses to match...	commit \| commitdiff \| tree
2026-02-27	Daniel Bevenius	ci : use ubuntu-latest for gguf-publish workflow (...	commit \| commitdiff \| tree
2026-02-27	Aman Gupta	ggml-cpu: add repack for mxfp4 (#19738)	commit \| commitdiff \| tree
2026-02-27	Daniel Bevenius	gguf-py : dump version to 0.18.0 (#19950) gguf-v0.18.0	commit \| commitdiff \| tree
2026-02-27	Pascal	server : support multiple model aliases via comma-separ...	commit \| commitdiff \| tree
2026-02-27	Jan Patrick...	tests : enable test-chat out of tree build (#19558)	commit \| commitdiff \| tree
2026-02-27	Neo Zhang	replace the magic nunber 768 by max work group size...	commit \| commitdiff \| tree
2026-02-27	Vishal Singh	ggml-zendnn: update code for latest ZenDNN API (#19923)	commit \| commitdiff \| tree
2026-02-26	Adrien Gallouët	ggml : fix AMX and add batched support (#19925)	commit \| commitdiff \| tree
2026-02-26	Ruben Ortlam	vulkan: fix fp16 Flash Attention on Windows AMD RDNA2...	commit \| commitdiff \| tree
2026-02-26	Georgi Gerganov	mtmd : fix padding of n_tokens (#19930)	commit \| commitdiff \| tree
2026-02-26	Georgi Gerganov	server : fix ctx checkpoint restore logic (#19924)	commit \| commitdiff \| tree
2026-02-26	Georgi Gerganov	kv-cache : fix can_shift() check to take into account...	commit \| commitdiff \| tree
2026-02-26	Aman Gupta	llama: Add option to merge gate and exp weights (#19139)	commit \| commitdiff \| tree
2026-02-26	Kevin Pouget	ggml-virtgpu: improve the reliability of the code ...	commit \| commitdiff \| tree
2026-02-26	drrros	server: fix load-on-startup not respected in ini file...	commit \| commitdiff \| tree
2026-02-26	Eric Zhang	jinja : correct default size for string slices (#19913)	commit \| commitdiff \| tree
2026-02-26	Maximilian...	model : add Jina Embeddings v5 Nano (partial EuroBERT...	commit \| commitdiff \| tree
2026-02-26	Georgi Gerganov	gguf : avoid too many file size calls (#19919)	commit \| commitdiff \| tree
2026-02-26	yggdrasil75	server : fix typo in server README.md (#19900)	commit \| commitdiff \| tree
2026-02-26	Neo Zhang	support permuted, remove check s0/s10 (#19889)	commit \| commitdiff \| tree
2026-02-25	Jeff Bolz	vulkan: check for memory overlap before doing fusion...	commit \| commitdiff \| tree
2026-02-25	ddh0	common : add more aliases for sampler CLI params (...	commit \| commitdiff \| tree
2026-02-25	Slobodan Josic	ci : update the ROCm/HIP toolchain versions [no ci...	commit \| commitdiff \| tree
2026-02-25	Georgi Gerganov	server : enable multi-modal prompt caching (#19877)	commit \| commitdiff \| tree
2026-02-25	Georgi Gerganov	server : support multi-modal context checkpoints (...	commit \| commitdiff \| tree
2026-02-25	Xuan-Son Nguyen	scripts: update corpus of compare-logprobs (#19326)	commit \| commitdiff \| tree
2026-02-25	Mario Limonciello	ci : update Windows ROCm build to 26.Q1 [no ci] (#19810)	commit \| commitdiff \| tree
2026-02-25	Aldehir Rojas	gguf : fix ftell/fseek for Windows (#19870)	commit \| commitdiff \| tree
2026-02-24	Georgi Gerganov	models : fix graph splits (#19866)	commit \| commitdiff \| tree
2026-02-24	Pascal	server: fix query params lost when proxying requests...	commit \| commitdiff \| tree
2026-02-24	Georgi Gerganov	ggml/gguf : prevent integer overflows (#19856)	commit \| commitdiff \| tree
2026-02-24	Tarek Dakhran	model : update label for LFM2-24B-A2B (#19848)	commit \| commitdiff \| tree
2026-02-24	Radoslav Gerganov	server : support max_completion_tokens request property...	commit \| commitdiff \| tree
2026-02-24	Ruben Ortlam	Vulkan Scalar Flash Attention Refactor (#19625)	commit \| commitdiff \| tree
2026-02-24	Jeff Bolz	vulkan: fix coopmat1 without bf16 support (#19793)	commit \| commitdiff \| tree
2026-02-24	Jeff Bolz	vulkan: fix data race in mul_mat_id shader (#19790)	commit \| commitdiff \| tree
2026-02-24	Max Krasnyansky	hexagon refactor all Ops to use local context struct...	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom