git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2025-11-05	Georgi Gerganov	server : do not default to multiple slots with speculat...	commit \| commitdiff \| tree
2025-11-05	Xuan-Son Nguyen	mtmd: improve struct initialization (#16981)	commit \| commitdiff \| tree
2025-11-05	손희준	docs: Clarify the endpoint that webui uses (#17001)	commit \| commitdiff \| tree
2025-11-05	Li Pengzhan	model : add openPangu-Embedded (#16941)	commit \| commitdiff \| tree
2025-11-05	Reese Levine	ggml webgpu: minor set rows optimization (#16810)	commit \| commitdiff \| tree
2025-11-05	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-11-05	Georgi Gerganov	ggml : fix conv2d_dw SVE path (ggml/1380)	commit \| commitdiff \| tree
2025-11-05	mnehete32	CUDA: update ops.md (#17005)	commit \| commitdiff \| tree
2025-11-05	lhez	opencl: update doc (#17011)	commit \| commitdiff \| tree
2025-11-04	nullname	refactor: replace sprintf with snprintf for safer strin...	commit \| commitdiff \| tree
2025-11-04	Jeff Bolz	vulkan: remove the need for the dryrun (#16826)	commit \| commitdiff \| tree
2025-11-04	Georgi Gerganov	server : do context shift only while generating (#17000)	commit \| commitdiff \| tree
2025-11-04	Georgi Gerganov	readme : update hot topics (#17002)	commit \| commitdiff \| tree
2025-11-04	Acly	ggml-cpu : bicubic interpolation (#16891)	commit \| commitdiff \| tree
2025-11-04	Sigbjørn Skjæret	ci : apply model label to models (#16994)	commit \| commitdiff \| tree
2025-11-04	Sigbjørn Skjæret	chore : fix models indent after refactor (#16992)	commit \| commitdiff \| tree
2025-11-04	Noah	Fix garbled output with REPACK at high thread counts...	commit \| commitdiff \| tree
2025-11-04	Aman Gupta	CUDA: avoid mul + bias fusion when doing fusion (#16935)	commit \| commitdiff \| tree
2025-11-03	lhez	opencl: support imrope (#16914)	commit \| commitdiff \| tree
2025-11-03	Aleksander...	fix: Viewing multiple PDF attachments (#16974)	commit \| commitdiff \| tree
2025-11-03	Daniel Bevenius	model-conversion : pass config to from_pretrained ...	commit \| commitdiff \| tree
2025-11-03	Georgi Gerganov	server : add props.model_alias (#16943)	commit \| commitdiff \| tree
2025-11-03	theo77186	ggml: CUDA: add head size 72 for flash-attn (#16962)	commit \| commitdiff \| tree
2025-11-03	Xuan-Son Nguyen	mtmd: add --image-min/max-tokens (#16921)	commit \| commitdiff \| tree
2025-11-03	Xuan-Son Nguyen	mtmd: pad mask for qwen2.5vl (#16954)	commit \| commitdiff \| tree
2025-11-03	Jinyang He	ggml : LoongArch fixes (#16958)	commit \| commitdiff \| tree
2025-11-03	Olivier Chafik	sync: minja (glm 4.6 & minmax m2 templates) (#16949)	commit \| commitdiff \| tree
2025-11-03	shani-f	SYCL: optimized repeat_back kernel (3× fewer asm instru...	commit \| commitdiff \| tree
2025-11-02	Sascha Rogmann	feat(webui): improve LaTeX rendering with currency...	commit \| commitdiff \| tree
2025-11-02	Shagun Bera	test-backend-ops : fix segfault in moe-expert-reduce...	commit \| commitdiff \| tree
2025-11-02	Sigbjørn Skjæret	ci : disable failing riscv cross build (#16952)	commit \| commitdiff \| tree
2025-11-02	Zhiyong Wang	model: add Janus Pro for image understanding (#16906)	commit \| commitdiff \| tree
2025-11-02	Georgi Gerganov	clip : use FA (#16837)	commit \| commitdiff \| tree
2025-11-02	Georgi Gerganov	server : support unified cache across slots (#16736)	commit \| commitdiff \| tree
2025-11-02	Aldehir Rojas	common : move gpt-oss reasoning processing to init...	commit \| commitdiff \| tree
2025-11-02	Adrian Lundberg	docs: remove llama_sampler_accept reference in sampling...	commit \| commitdiff \| tree
2025-11-02	mnehete32	CUDA: add FLOOR, CEIL, ROUND, TRUNC unary ops (#16917)	commit \| commitdiff \| tree
2025-11-02	Aaron Teo	devops: fix failing s390x docker build (#16918)	commit \| commitdiff \| tree
2025-11-02	Aaron Teo	ggml: add s390x cpu-feats (#16774)	commit \| commitdiff \| tree
2025-11-01	Georgi Gerganov	scripts : add script to bench models (#16894)	commit \| commitdiff \| tree
2025-11-01	Pascal	webui: auto-refresh /props on inference start to resync...	commit \| commitdiff \| tree
2025-11-01	Pascal	webui: add HTML/JS preview support to MarkdownContent...	commit \| commitdiff \| tree
2025-11-01	Adrien Gallouët	vendor : update cpp-httplib to 0.27.0 (#16846)	commit \| commitdiff \| tree
2025-11-01	Xuan-Son Nguyen	mtmd: refactor preprocessing + support max/min pixels...	commit \| commitdiff \| tree
2025-11-01	Aleksander...	Add a setting to display message generation statistics...	commit \| commitdiff \| tree
2025-11-01	Jaromír Hradílek	webui: recognize AsciiDoc files as valid text files...	commit \| commitdiff \| tree
2025-11-01	Sigbjørn Skjæret	common : allow --system-prompt-file for diffusion-cli...	commit \| commitdiff \| tree
2025-11-01	Sigbjørn Skjæret	codeowners : update after refactor (#16905)	commit \| commitdiff \| tree
2025-11-01	Jeff Bolz	vulkan: Fix multi_add invalid descriptor usage (#16899)	commit \| commitdiff \| tree
2025-11-01	Jeff Bolz	vulkan: fuse mul_mat+add and mul_mat_id+add_id (#16868)	commit \| commitdiff \| tree
2025-11-01	Oliver Simons	CUDA: Remove unneded bias/gate dims in fused mmvq ...	commit \| commitdiff \| tree
2025-10-31	Piotr Wilkin...	refactor : llama-model.cpp (#16252)	commit \| commitdiff \| tree
2025-10-31	Piotr Wilkin...	model : Minimax M2 (#16831)	commit \| commitdiff \| tree
2025-10-31	Giuseppe Scrivano	model : add Granite Hybrid nano types (#16896)	commit \| commitdiff \| tree
2025-10-31	Johannes Gäßler	CUDA: Volta tensor core support for MMF (#16843)	commit \| commitdiff \| tree
2025-10-31	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-10-31	Aman Gupta	CUDA: add expert reduce kernel (#16857)	commit \| commitdiff \| tree
2025-10-31	Georgi Gerganov	batch : fix consistency checks for the input positions...	commit \| commitdiff \| tree
2025-10-31	Georgi Gerganov	server : don't print user inputs to console (#16871)	commit \| commitdiff \| tree
2025-10-31	Daniel Bevenius	server : fix typos in server.cpp comments [no ci] ...	commit \| commitdiff \| tree
2025-10-31	Jeff Bolz	vulkan: disable spirv-opt for rope shaders (#16872)	commit \| commitdiff \| tree
2025-10-31	Masato Nakasaka	vulkan: Fix crash when FP16 mul_mat accumulation is...	commit \| commitdiff \| tree
2025-10-31	Ruben Ortlam	vulkan: fix shmem overrun in mmq id shader (#16873)	commit \| commitdiff \| tree
2025-10-31	l3utterfly	ggml-hexagon: respect input size when getting/setting...	commit \| commitdiff \| tree
2025-10-30	Sigbjørn Skjæret	ci : enable free-disk-space on cuda docker build (...	commit \| commitdiff \| tree
2025-10-30	lhez	opencl: fix boundary handling for mul_mm (#16875)	commit \| commitdiff \| tree
2025-10-30	RodriMora	convert : update transformers requirements (#16866)	commit \| commitdiff \| tree
2025-10-30	chansikpark	server : bump request URI max length to 32768 (#16862)	commit \| commitdiff \| tree
2025-10-30	Georgi Gerganov	server : remove n_past (#16818)	commit \| commitdiff \| tree
2025-10-30	Max Krasnyansky	cpu: introduce chunking for repack matmuls and enable...	commit \| commitdiff \| tree
2025-10-30	Shagun Bera	common: fix typo in cli help text (#16864)	commit \| commitdiff \| tree
2025-10-30	JJJYmmm	model: add support for qwen3vl series (#16780)	commit \| commitdiff \| tree
2025-10-30	Max Krasnyansky	cpu: introduce chunking for flash attention (#16829)	commit \| commitdiff \| tree
2025-10-30	Tianyue-Zhao	model: Add support for CogVLM model (#15002)	commit \| commitdiff \| tree
2025-10-30	Sigbjørn Skjæret	cuda : fix argsort with 64k+ rows (#16849)	commit \| commitdiff \| tree
2025-10-30	Jan Boon	llama : use std::abs instead of abs (#16853)	commit \| commitdiff \| tree
2025-10-30	Jeff Bolz	vulkan: Handle argsort with a large number of rows...	commit \| commitdiff \| tree
2025-10-30	Oliver Simons	Hide latency of bias and gate-loading (#16847)	commit \| commitdiff \| tree
2025-10-29	Jeff Bolz	vulkan: Fuse rope+set_rows (#16769)	commit \| commitdiff \| tree
2025-10-29	Xuan-Son Nguyen	llama: fix ASAN error with M-RoPE (#16848)	commit \| commitdiff \| tree
2025-10-29	Xuan-Son Nguyen	llama: store mrope data in KV cell (#16825)	commit \| commitdiff \| tree
2025-10-29	Jeff Bolz	vulkan: Update topk_moe fusion to handle gpt's late...	commit \| commitdiff \| tree
2025-10-29	Ruben Ortlam	Vulkan MMQ Integer Dot Refactor and K-Quant support...	commit \| commitdiff \| tree
2025-10-29	Max Krasnyansky	Hexagon Op queue & dispatch optimizations (#16820)	commit \| commitdiff \| tree
2025-10-29	Aman Gupta	CUDA: use fastdiv in set-rows (#16834)	commit \| commitdiff \| tree
2025-10-29	Sigbjørn Skjæret	vendor : sync minja (#16500)	commit \| commitdiff \| tree
2025-10-29	Jeff Bolz	vulkan: Call ggml_vk_buffer_write_2d from ggml_vk_buffe...	commit \| commitdiff \| tree
2025-10-29	Aman Gupta	CUDA: Fix bug in topk-moe for gpt-oss (#16821)	commit \| commitdiff \| tree
2025-10-29	YaelLogic	sycl: add RMS_NORM_BACK operation support (#16808)	commit \| commitdiff \| tree
2025-10-28	YaelGitAccount	cuda: add SET operation support (#16804)	commit \| commitdiff \| tree
2025-10-28	Georgi Gerganov	memory : remove KV cache size padding (#16812)	commit \| commitdiff \| tree
2025-10-28	Georgi Gerganov	llama-bench : clarify benchmarked parts of the computat...	commit \| commitdiff \| tree
2025-10-28	l3utterfly	initialise buffer.device in ggml_hexagon_session (...	commit \| commitdiff \| tree
2025-10-28	Sam Malayek	embedding: add raw option for --embd-output-format...	commit \| commitdiff \| tree
2025-10-28	Johannes Gäßler	llama: consistent ctx <-> buf order for KV cache (...	commit \| commitdiff \| tree
2025-10-28	Aldehir Rojas	grammar : support array references in json schema ...	commit \| commitdiff \| tree
2025-10-28	Chenguang Li	CANN: Improve device ID handling and aclnnArange checks...	commit \| commitdiff \| tree
2025-10-28	Aman Gupta	CUDA: add unused vars to mmvf and mmvq (#16807)	commit \| commitdiff \| tree
2025-10-28	tamarPal	sycl: add SSM_CONV operation support (#16800)	commit \| commitdiff \| tree
2025-10-27	Yuri Khrustalev	chat: Add LFM2 tool handling (#16763)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom