git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2026-03-03	Mickael Desgranges	docs: Fix intel documentation link (#20040)	commit \| commitdiff \| tree
2026-03-03	Charles Xu	kleidiai : add sme fp16 compute path for q4_0 gemm...	commit \| commitdiff \| tree
2026-03-03	shaofeiqi	opencl: add optimized q4_1 mm kernel for adreno (#19840)	commit \| commitdiff \| tree
2026-03-03	Abhijit Ramesh	ggml webgpu: fix workgroup dispatch limit for large...	commit \| commitdiff \| tree
2026-03-02	Nikhil Jain	ggml webgpu: Clean up per-thread parameter buffer pool...	commit \| commitdiff \| tree
2026-03-02	Masashi Yoshimura	ggml-webgpu: Support non-contiguous `src0` and overlapp...	commit \| commitdiff \| tree
2026-03-02	Ruben Ortlam	vulkan: tune MMVQ for Intel Windows (#19988)	commit \| commitdiff \| tree
2026-03-02	Adrien Gallouët	scripts : improve get-wikitext-2.sh (#19952)	commit \| commitdiff \| tree
2026-03-02	Aaron Teo	ggml-cpu: optimise s390x multiply extend instructions...	commit \| commitdiff \| tree
2026-03-01	Ruben Ortlam	vulkan: improve partial offloading performance on AMD...	commit \| commitdiff \| tree
2026-03-01	oobabooga	cuda: cap grid.y at 65535 in non-contiguous dequantize...	commit \| commitdiff \| tree
2026-02-28	Dmitry Atamanov	vendors : update miniaudio library to 0.11.24 (#19914)	commit \| commitdiff \| tree
2026-02-28	Adrien Gallouët	vendor : update cpp-httplib to 0.35.0 (#19969)	commit \| commitdiff \| tree
2026-02-28	Bartowski	tests : model metadata loading from huggingface (#19796)	commit \| commitdiff \| tree
2026-02-27	Jayant Lohia	CUDA: add CDNA3 MFMA support for flash attention MMA...	commit \| commitdiff \| tree
2026-02-27	Roj234	server: Add pragma once to server-context.h (#19944)	commit \| commitdiff \| tree
2026-02-27	Sami Kama	server: Mirroring /v1/responses to /responses to match...	commit \| commitdiff \| tree
2026-02-27	Daniel Bevenius	ci : use ubuntu-latest for gguf-publish workflow (...	commit \| commitdiff \| tree
2026-02-27	Aman Gupta	ggml-cpu: add repack for mxfp4 (#19738)	commit \| commitdiff \| tree
2026-02-27	Daniel Bevenius	gguf-py : dump version to 0.18.0 (#19950) gguf-v0.18.0	commit \| commitdiff \| tree
2026-02-27	Pascal	server : support multiple model aliases via comma-separ...	commit \| commitdiff \| tree
2026-02-27	Jan Patrick...	tests : enable test-chat out of tree build (#19558)	commit \| commitdiff \| tree
2026-02-27	Neo Zhang	replace the magic nunber 768 by max work group size...	commit \| commitdiff \| tree
2026-02-27	Vishal Singh	ggml-zendnn: update code for latest ZenDNN API (#19923)	commit \| commitdiff \| tree
2026-02-26	Adrien Gallouët	ggml : fix AMX and add batched support (#19925)	commit \| commitdiff \| tree
2026-02-26	Ruben Ortlam	vulkan: fix fp16 Flash Attention on Windows AMD RDNA2...	commit \| commitdiff \| tree
2026-02-26	Georgi Gerganov	mtmd : fix padding of n_tokens (#19930)	commit \| commitdiff \| tree
2026-02-26	Georgi Gerganov	server : fix ctx checkpoint restore logic (#19924)	commit \| commitdiff \| tree
2026-02-26	Georgi Gerganov	kv-cache : fix can_shift() check to take into account...	commit \| commitdiff \| tree
2026-02-26	Aman Gupta	llama: Add option to merge gate and exp weights (#19139)	commit \| commitdiff \| tree
2026-02-26	Kevin Pouget	ggml-virtgpu: improve the reliability of the code ...	commit \| commitdiff \| tree
2026-02-26	drrros	server: fix load-on-startup not respected in ini file...	commit \| commitdiff \| tree
2026-02-26	Eric Zhang	jinja : correct default size for string slices (#19913)	commit \| commitdiff \| tree
2026-02-26	Maximilian...	model : add Jina Embeddings v5 Nano (partial EuroBERT...	commit \| commitdiff \| tree
2026-02-26	Georgi Gerganov	gguf : avoid too many file size calls (#19919)	commit \| commitdiff \| tree
2026-02-26	yggdrasil75	server : fix typo in server README.md (#19900)	commit \| commitdiff \| tree
2026-02-26	Neo Zhang	support permuted, remove check s0/s10 (#19889)	commit \| commitdiff \| tree
2026-02-25	Jeff Bolz	vulkan: check for memory overlap before doing fusion...	commit \| commitdiff \| tree
2026-02-25	ddh0	common : add more aliases for sampler CLI params (...	commit \| commitdiff \| tree
2026-02-25	Slobodan Josic	ci : update the ROCm/HIP toolchain versions [no ci...	commit \| commitdiff \| tree
2026-02-25	Georgi Gerganov	server : enable multi-modal prompt caching (#19877)	commit \| commitdiff \| tree
2026-02-25	Georgi Gerganov	server : support multi-modal context checkpoints (...	commit \| commitdiff \| tree
2026-02-25	Xuan-Son Nguyen	scripts: update corpus of compare-logprobs (#19326)	commit \| commitdiff \| tree
2026-02-25	Mario Limonciello	ci : update Windows ROCm build to 26.Q1 [no ci] (#19810)	commit \| commitdiff \| tree
2026-02-25	Aldehir Rojas	gguf : fix ftell/fseek for Windows (#19870)	commit \| commitdiff \| tree
2026-02-24	Georgi Gerganov	models : fix graph splits (#19866)	commit \| commitdiff \| tree
2026-02-24	Pascal	server: fix query params lost when proxying requests...	commit \| commitdiff \| tree
2026-02-24	Georgi Gerganov	ggml/gguf : prevent integer overflows (#19856)	commit \| commitdiff \| tree
2026-02-24	Tarek Dakhran	model : update label for LFM2-24B-A2B (#19848)	commit \| commitdiff \| tree
2026-02-24	Radoslav Gerganov	server : support max_completion_tokens request property...	commit \| commitdiff \| tree
2026-02-24	Ruben Ortlam	Vulkan Scalar Flash Attention Refactor (#19625)	commit \| commitdiff \| tree
2026-02-24	Jeff Bolz	vulkan: fix coopmat1 without bf16 support (#19793)	commit \| commitdiff \| tree
2026-02-24	Jeff Bolz	vulkan: fix data race in mul_mat_id shader (#19790)	commit \| commitdiff \| tree
2026-02-24	Max Krasnyansky	hexagon refactor all Ops to use local context struct...	commit \| commitdiff \| tree
2026-02-23	Aleksander...	feat: Add code blocks full height setting to parameter...	commit \| commitdiff \| tree
2026-02-23	Adrien Gallouët	vendor : update cpp-httplib to 0.34.0 (#19830)	commit \| commitdiff \| tree
2026-02-23	Daniel Bevenius	tests : fix typos in comments in test-backend-sampler...	commit \| commitdiff \| tree
2026-02-23	Aleksander...	webui: Add setting to have full height Code Blocks...	commit \| commitdiff \| tree
2026-02-23	Daniel Bevenius	model-conversion : merge inspect-org-model.py with...	commit \| commitdiff \| tree
2026-02-23	Alberto Cabrera...	ggml-cpu: arm64: q5_K repack gemm and gemv (and generic...	commit \| commitdiff \| tree
2026-02-23	Daniel Bevenius	llama : remove write/read of output ids/logits/embeddin...	commit \| commitdiff \| tree
2026-02-22	Sigbjørn Skjæret	cli : provide model with text filename (#19783)	commit \| commitdiff \| tree
2026-02-22	Xuan-Son Nguyen	jinja: correct stats for tojson and string filters...	commit \| commitdiff \| tree
2026-02-22	Aldehir Rojas	common : fix improper trimming in XML parser on complet...	commit \| commitdiff \| tree
2026-02-22	Kilian Krampf	Fix wrong cli-argument in documentation (#19804)	commit \| commitdiff \| tree
2026-02-22	HelloKS	model : add Kanana-2 model support (#19803)	commit \| commitdiff \| tree
2026-02-22	Sigbjørn Skjæret	ci : fix rocm archive name [no ci] (#19808)	commit \| commitdiff \| tree
2026-02-22	Aldehir Rojas	server : merge contiguous Responses input items into...	commit \| commitdiff \| tree
2026-02-22	Sigbjørn Skjæret	ci : fix rocm release path [no ci] (#19784)	commit \| commitdiff \| tree
2026-02-21	Mario Limonciello	Update ROCm docker container to 7.2 release (#19418)	commit \| commitdiff \| tree
2026-02-21	Mario Limonciello	Add a build target to generate ROCm artifacts using...	commit \| commitdiff \| tree
2026-02-21	Adrien Gallouët	vendor : update cpp-httplib to 0.33.1 (#19778)	commit \| commitdiff \| tree
2026-02-21	Gaurav Garg	Improve CUDA graph capture (#19754)	commit \| commitdiff \| tree
2026-02-21	crsawyer	fix: UI single model selection in router mode (#19767)	commit \| commitdiff \| tree
2026-02-21	Mengsheng Wu	hexagon : fix build release (#19444) (#19587)	commit \| commitdiff \| tree
2026-02-20	Aldehir Rojas	common : merge qwen3-coder and nemotron nano 3 parsers...	commit \| commitdiff \| tree
2026-02-20	Taimur Ahmad	ggml-cpu: add RVV vec dot kernels for quantization...	commit \| commitdiff \| tree
2026-02-20	ddh0	quantize : add --dry-run option (#19526)	commit \| commitdiff \| tree
2026-02-20	Jeff Bolz	test: mul_mat tests with huge batch size (#19519)	commit \| commitdiff \| tree
2026-02-19	crsawyer	WebUI hide models in router mode (#19374)	commit \| commitdiff \| tree
2026-02-19	Jesse Posner	common : fix Step-3.5-Flash format detection and thinki...	commit \| commitdiff \| tree
2026-02-19	abhijitb11	common : fix gpt-oss Jinja error when assistant message...	commit \| commitdiff \| tree
2026-02-19	Masashi Yoshimura	ggml-webgpu: Add unary op (SQR, SQRT, SIN, COS) support...	commit \| commitdiff \| tree
2026-02-19	megemini	model: Add PaddleOCR-VL model support (#18825)	commit \| commitdiff \| tree
2026-02-19	Ruben Ortlam	vulkan: fix MMQ shader push constants and multi-dispatc...	commit \| commitdiff \| tree
2026-02-19	Georgi Gerganov	models : fix qwen3.5 beta/gate shapes (#19730)	commit \| commitdiff \| tree
2026-02-19	Saba Fallah	mtmd: build_attn modified, flash_attn on/off via ctx_pa...	commit \| commitdiff \| tree
2026-02-19	3 a l i	model : add JAIS-2 architecture support (#19488)	commit \| commitdiff \| tree
2026-02-19	Johannes Gäßler	CUDA: fix kernel selection logic for tile FA (#19686)	commit \| commitdiff \| tree
2026-02-19	Tarek Dakhran	mtmd : chat : Fix extra \n between text and media marke...	commit \| commitdiff \| tree
2026-02-19	Aleksander...	webui: Fix Attachments not being included in completion...	commit \| commitdiff \| tree
2026-02-19	Tarek Dakhran	model : add tokenizer from LFM2.5-Audio-1.5B (#19687)	commit \| commitdiff \| tree
2026-02-19	Daniel Bevenius	llama : use output_resolve_row() in get_logits_ith...	commit \| commitdiff \| tree
2026-02-19	Ryan Mangeno	model : full modern bert support (#18330)	commit \| commitdiff \| tree
2026-02-19	shalinib-ibm	llamafile: powerpc: add FP16 MMA path for Q4/Q8 matmul...	commit \| commitdiff \| tree
2026-02-19	Georgi Gerganov	models : dedup qwen35 graphs (#19660)	commit \| commitdiff \| tree
2026-02-19	ymcki	models : dedup Kimi Linear delta net implementation...	commit \| commitdiff \| tree
2026-02-18	Piotr Wilkin...	Add Jinja support for "indent" string filter (#19529)	commit \| commitdiff \| tree
2026-02-18	Reese Levine	ggml webgpu: Fix bug in dispatching large matrix-vector...	commit \| commitdiff \| tree
2026-02-18	matteo	server: save generated text for the /slots endpoint...	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom