git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2025-10-31	Piotr Wilkin...	model : Minimax M2 (#16831)	commit \| commitdiff \| tree
2025-10-31	Giuseppe Scrivano	model : add Granite Hybrid nano types (#16896)	commit \| commitdiff \| tree
2025-10-31	Johannes Gäßler	CUDA: Volta tensor core support for MMF (#16843)	commit \| commitdiff \| tree
2025-10-31	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-10-31	Aman Gupta	CUDA: add expert reduce kernel (#16857)	commit \| commitdiff \| tree
2025-10-31	Georgi Gerganov	batch : fix consistency checks for the input positions...	commit \| commitdiff \| tree
2025-10-31	Georgi Gerganov	server : don't print user inputs to console (#16871)	commit \| commitdiff \| tree
2025-10-31	Daniel Bevenius	server : fix typos in server.cpp comments [no ci] ...	commit \| commitdiff \| tree
2025-10-31	Jeff Bolz	vulkan: disable spirv-opt for rope shaders (#16872)	commit \| commitdiff \| tree
2025-10-31	Masato Nakasaka	vulkan: Fix crash when FP16 mul_mat accumulation is...	commit \| commitdiff \| tree
2025-10-31	Ruben Ortlam	vulkan: fix shmem overrun in mmq id shader (#16873)	commit \| commitdiff \| tree
2025-10-31	l3utterfly	ggml-hexagon: respect input size when getting/setting...	commit \| commitdiff \| tree
2025-10-30	Sigbjørn Skjæret	ci : enable free-disk-space on cuda docker build (...	commit \| commitdiff \| tree
2025-10-30	lhez	opencl: fix boundary handling for mul_mm (#16875)	commit \| commitdiff \| tree
2025-10-30	RodriMora	convert : update transformers requirements (#16866)	commit \| commitdiff \| tree
2025-10-30	chansikpark	server : bump request URI max length to 32768 (#16862)	commit \| commitdiff \| tree
2025-10-30	Georgi Gerganov	server : remove n_past (#16818)	commit \| commitdiff \| tree
2025-10-30	Max Krasnyansky	cpu: introduce chunking for repack matmuls and enable...	commit \| commitdiff \| tree
2025-10-30	Shagun Bera	common: fix typo in cli help text (#16864)	commit \| commitdiff \| tree
2025-10-30	JJJYmmm	model: add support for qwen3vl series (#16780)	commit \| commitdiff \| tree
2025-10-30	Max Krasnyansky	cpu: introduce chunking for flash attention (#16829)	commit \| commitdiff \| tree
2025-10-30	Tianyue-Zhao	model: Add support for CogVLM model (#15002)	commit \| commitdiff \| tree
2025-10-30	Sigbjørn Skjæret	cuda : fix argsort with 64k+ rows (#16849)	commit \| commitdiff \| tree
2025-10-30	Jan Boon	llama : use std::abs instead of abs (#16853)	commit \| commitdiff \| tree
2025-10-30	Jeff Bolz	vulkan: Handle argsort with a large number of rows...	commit \| commitdiff \| tree
2025-10-30	Oliver Simons	Hide latency of bias and gate-loading (#16847)	commit \| commitdiff \| tree
2025-10-29	Jeff Bolz	vulkan: Fuse rope+set_rows (#16769)	commit \| commitdiff \| tree
2025-10-29	Xuan-Son Nguyen	llama: fix ASAN error with M-RoPE (#16848)	commit \| commitdiff \| tree
2025-10-29	Xuan-Son Nguyen	llama: store mrope data in KV cell (#16825)	commit \| commitdiff \| tree
2025-10-29	Jeff Bolz	vulkan: Update topk_moe fusion to handle gpt's late...	commit \| commitdiff \| tree
2025-10-29	Ruben Ortlam	Vulkan MMQ Integer Dot Refactor and K-Quant support...	commit \| commitdiff \| tree
2025-10-29	Max Krasnyansky	Hexagon Op queue & dispatch optimizations (#16820)	commit \| commitdiff \| tree
2025-10-29	Aman Gupta	CUDA: use fastdiv in set-rows (#16834)	commit \| commitdiff \| tree
2025-10-29	Sigbjørn Skjæret	vendor : sync minja (#16500)	commit \| commitdiff \| tree
2025-10-29	Jeff Bolz	vulkan: Call ggml_vk_buffer_write_2d from ggml_vk_buffe...	commit \| commitdiff \| tree
2025-10-29	Aman Gupta	CUDA: Fix bug in topk-moe for gpt-oss (#16821)	commit \| commitdiff \| tree
2025-10-29	YaelLogic	sycl: add RMS_NORM_BACK operation support (#16808)	commit \| commitdiff \| tree
2025-10-28	YaelGitAccount	cuda: add SET operation support (#16804)	commit \| commitdiff \| tree
2025-10-28	Georgi Gerganov	memory : remove KV cache size padding (#16812)	commit \| commitdiff \| tree
2025-10-28	Georgi Gerganov	llama-bench : clarify benchmarked parts of the computat...	commit \| commitdiff \| tree
2025-10-28	l3utterfly	initialise buffer.device in ggml_hexagon_session (...	commit \| commitdiff \| tree
2025-10-28	Sam Malayek	embedding: add raw option for --embd-output-format...	commit \| commitdiff \| tree
2025-10-28	Johannes Gäßler	llama: consistent ctx <-> buf order for KV cache (...	commit \| commitdiff \| tree
2025-10-28	Aldehir Rojas	grammar : support array references in json schema ...	commit \| commitdiff \| tree
2025-10-28	Chenguang Li	CANN: Improve device ID handling and aclnnArange checks...	commit \| commitdiff \| tree
2025-10-28	Aman Gupta	CUDA: add unused vars to mmvf and mmvq (#16807)	commit \| commitdiff \| tree
2025-10-28	tamarPal	sycl: add SSM_CONV operation support (#16800)	commit \| commitdiff \| tree
2025-10-27	Yuri Khrustalev	chat: Add LFM2 tool handling (#16763)	commit \| commitdiff \| tree
2025-10-27	Xuan-Son Nguyen	mtmd : fix idefics3 preprocessing (#16806)	commit \| commitdiff \| tree
2025-10-27	Diego Devesa	llama : disable pipeline parallelism if compute buffer...	commit \| commitdiff \| tree
2025-10-27	Acly	ggml : fix interpolate with align-corners and ne=1...	commit \| commitdiff \| tree
2025-10-27	Johannes Gäßler	HIP: fix AMDGPU_TARGETS, update documentation (#16803)	commit \| commitdiff \| tree
2025-10-27	Xuan-Son Nguyen	model : add LightOnOCR-1B model (#16764)	commit \| commitdiff \| tree
2025-10-27	Johannes Gäßler	llama: fix leaked buffers for mmap + split files (...	commit \| commitdiff \| tree
2025-10-27	Aman Gupta	test-backend-ops: print failed tests at the end (#16785)	commit \| commitdiff \| tree
2025-10-27	tamarPal	sycl: add ROLL operation support (#16665)	commit \| commitdiff \| tree
2025-10-27	shani-f	sycl: add REPEAT_BACK operation support (#16734)	commit \| commitdiff \| tree
2025-10-27	Aman Gupta	CUDA: support for weight clamp in top-k norm (#16702)	commit \| commitdiff \| tree
2025-10-26	Acly	ggml-alloc : make gallocr prefer chunks that allow...	commit \| commitdiff \| tree
2025-10-26	Sigbjørn Skjæret	cuda : use fast copy when src and dst are of different...	commit \| commitdiff \| tree
2025-10-26	leejet	ggml: fix cuda kernel launch configuration for k_comput...	commit \| commitdiff \| tree
2025-10-26	Sigbjørn Skjæret	convert : enable expert group selection for all models...	commit \| commitdiff \| tree
2025-10-26	Sigbjørn Skjæret	graph : add clamping to ffn_moe_weights_sum to avoid...	commit \| commitdiff \| tree
2025-10-26	Sigbjørn Skjæret	model : set res->t_embd in SmallThinker models (#16782)	commit \| commitdiff \| tree
2025-10-26	amirai21	docs : add Jamba to Text-only models list (#16778)	commit \| commitdiff \| tree
2025-10-26	Aman Gupta	CUDA: General GEMV fusion (#16715)	commit \| commitdiff \| tree
2025-10-26	Gilad S.	vulkan: deduplicate Microsoft Direct3D12 devices (...	commit \| commitdiff \| tree
2025-10-25	Galunid	convert : handle mmproj filename/path properly (#16760)	commit \| commitdiff \| tree
2025-10-25	Shunta Saito	model : set res->t_embd in PLaMo2 models (#16766)	commit \| commitdiff \| tree
2025-10-25	Giuseppe Scrivano	vulkan: delete dead code (#16732)	commit \| commitdiff \| tree
2025-10-25	Jeff Bolz	vulkan: Optimize SSM_SCAN (#16645)	commit \| commitdiff \| tree
2025-10-25	compilade	convert : avoid dequantizing mxfp4 for GPT-OSS (#16756)	commit \| commitdiff \| tree
2025-10-24	leejet	ggml: fix CUDA grid launch condition for large block_nu...	commit \| commitdiff \| tree
2025-10-24	Aman Gupta	CUDA: use CUB for arbitary size argsort (#16754)	commit \| commitdiff \| tree
2025-10-24	Florian Badie	webui: support q URL parameter (#16728)	commit \| commitdiff \| tree
2025-10-24	Daniel Bevenius	model-conversion : add trust_remote_code for orig model...	commit \| commitdiff \| tree
2025-10-23	compilade	convert : handle pre-quantized models (#14810)	commit \| commitdiff \| tree
2025-10-23	Johannes Gäßler	server: add memory breakdown print (#16740)	commit \| commitdiff \| tree
2025-10-23	Julien Denize	convert : Make mistral-common dependency optional ...	commit \| commitdiff \| tree
2025-10-23	Xuan-Son Nguyen	mtmd-cli : allow using --jinja (#16718)	commit \| commitdiff \| tree
2025-10-23	Prajwal B Mehendarkar	Manually link -lbsd to resolve flock symbol on AIX...	commit \| commitdiff \| tree
2025-10-23	Aman Gupta	ggml-cuda: use passed ops instead of hardcoded ops...	commit \| commitdiff \| tree
2025-10-23	matteo	server : send partial stop string when <EOG> is reached...	commit \| commitdiff \| tree
2025-10-23	Matthew Michel	sycl: use async memory allocation to fix crashes during...	commit \| commitdiff \| tree
2025-10-22	Max Krasnyansky	Add experimental ggml-hexagon backend for the Hexagon...	commit \| commitdiff \| tree
2025-10-22	Diego Devesa	Revert "ggml : Leverage the existing GGML_F32_VEC helpe...	commit \| commitdiff \| tree
2025-10-22	Pascal	webui: introduce OpenAI-compatible model selector in...	commit \| commitdiff \| tree
2025-10-22	sirus20x6	ggml : Leverage the existing GGML_F32_VEC helpers to...	commit \| commitdiff \| tree
2025-10-22	Acly	tests : fix test-thread-safety when compiling with...	commit \| commitdiff \| tree
2025-10-22	Aman Gupta	CUDA: fix bug in topk-moe softmax (#16711)	commit \| commitdiff \| tree
2025-10-21	Aman Gupta	CUDA: topk-moe: add optional parameter for gpt-oss...	commit \| commitdiff \| tree
2025-10-21	Johannes Gäßler	CUDA: better error for FA kernel with 0 occupancy ...	commit \| commitdiff \| tree
2025-10-21	Aman Gupta	ggml: add ggml_can_fuse_subgraph (#16662)	commit \| commitdiff \| tree
2025-10-21	lhez	opencl: fix warnings and clean up profiling (#16688)	commit \| commitdiff \| tree
2025-10-21	Jeff Bolz	vulkan: Handle FA with all -inf mask values (#16447)	commit \| commitdiff \| tree
2025-10-20	YehuditE	sycl : add PAD_REFLECT_D1 operator support (#16145)	commit \| commitdiff \| tree
2025-10-20	Sigbjørn Skjæret	model : add BailingMoeV2 support (#16063)	commit \| commitdiff \| tree
2025-10-20	Aleksander...	Handle legacy 'context' attachments (#16687)	commit \| commitdiff \| tree
2025-10-20	Diego Devesa	ggml-alloc : fix leak when reusing a tensor with a...	commit \| commitdiff \| tree
2025-10-20	Aleksander...	Prevent premature submission on IME input (#16673)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom