git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2025-12-29	Georgi Gerganov	server : handle closed connection for tasks (#18459)	commit \| commitdiff \| tree
2025-12-29	Daniel Bevenius	model-conversion : add device option to embd run orig...	commit \| commitdiff \| tree
2025-12-29	Héctor Estrada...	retrieval : use at most n_seq_max chunks (#18400)	commit \| commitdiff \| tree
2025-12-29	o7si	common: fix return value check for setpriority (#18412)	commit \| commitdiff \| tree
2025-12-29	Johannes Gäßler	CUDA: Blackwell features for non-native builds (#18436)	commit \| commitdiff \| tree
2025-12-29	Aman Gupta	cuda: fix race condition in cumsum (#18448)	commit \| commitdiff \| tree
2025-12-28	Tim Neumann	ci : re-enable rocm build on amd64 (#18439)	commit \| commitdiff \| tree
2025-12-28	uvos	HIP: Use mmq on MFMA devices for MUL_MAT_ID in cases...	commit \| commitdiff \| tree
2025-12-28	momonga	model : Plamo3 support (#17304)	commit \| commitdiff \| tree
2025-12-28	Aman Gupta	Revert "ggml-cuda: use CMAKE_CUDA_ARCHITECTURES if...	commit \| commitdiff \| tree
2025-12-28	o7si	rpc: fix segfault on invalid endpoint format (#18387)	commit \| commitdiff \| tree
2025-12-28	Johannes Gäßler	llama-fit-params: fix step size for last device (#18415)	commit \| commitdiff \| tree
2025-12-28	Johannes Gäßler	github: update issue templates [no ci] (#18410)	commit \| commitdiff \| tree
2025-12-28	Xuan-Son Nguyen	mtmd: clarify that we no longer accept AI-generated...	commit \| commitdiff \| tree
2025-12-28	Boian Berberov	cmake: Added more x86_64 CPU backends when building...	commit \| commitdiff \| tree
2025-12-28	QDelta	ggml-cuda: use CMAKE_CUDA_ARCHITECTURES if set when...	commit \| commitdiff \| tree
2025-12-27	lhez	opencl: allow resizing transpose buffers (#18384)	commit \| commitdiff \| tree
2025-12-27	Johannes Gäßler	llama-fit-params: fix overflow check (#18354)	commit \| commitdiff \| tree
2025-12-27	Johannes Gäßler	llama: fix magic number of 999 for GPU layers (#18266)	commit \| commitdiff \| tree
2025-12-27	Aman Gupta	ggml-cuda: Use same regex for GGML_NATIVE=OFF (#18407)	commit \| commitdiff \| tree
2025-12-27	Johannes Gäßler	llama_fit_params: return enum for fail vs. error (...	commit \| commitdiff \| tree
2025-12-27	Johannes Gäßler	llama-fit-params: fix Gemma 3 calculation (#18372)	commit \| commitdiff \| tree
2025-12-26	Jeff Bolz	vulkan: preprocess mul_mat_id experts and discard workg...	commit \| commitdiff \| tree
2025-12-26	Jeff Bolz	vulkan: optimize decodeFuncB in coopmat2 mul_mat_id...	commit \| commitdiff \| tree
2025-12-26	Jeff Bolz	vulkan: Use BK=32 for coopmat2 mul_mat_id (#18332)	commit \| commitdiff \| tree
2025-12-26	Eve	vulkan: small dequantization improvements (#18380)	commit \| commitdiff \| tree
2025-12-26	Jeff Bolz	vulkan: Support UPSCALE w/antialias (#18327)	commit \| commitdiff \| tree
2025-12-26	Jeff Bolz	vulkan: handle rope with large number of rows (#18306)	commit \| commitdiff \| tree
2025-12-26	o7si	server : fix crash when seq_rm fails for hybrid/recurre...	commit \| commitdiff \| tree
2025-12-26	Francisco Herrera	docs: added note for pre SYCL Intel hardware (#18016)	commit \| commitdiff \| tree
2025-12-26	0Marble	CANN: implement the SSM_CONV operator (#17737)	commit \| commitdiff \| tree
2025-12-25	Aman Gupta	ggml-cuda: fix regex for arch list (#18371)	commit \| commitdiff \| tree
2025-12-25	Aman Gupta	cuda: optimize cumsum cub path (#18362)	commit \| commitdiff \| tree
2025-12-25	Aman Gupta	ggml-cuda: fix blackwell native builds (#18361)	commit \| commitdiff \| tree
2025-12-25	Penglin Cai	CANN: Add support for CONV_TRANSPOSE_1D when kernel...	commit \| commitdiff \| tree
2025-12-25	Aadeshveer...	ggml : optimize cuda cumsum fallback kernel (#18343)	commit \| commitdiff \| tree
2025-12-24	Xuan-Son Nguyen	server: (router) add stop-timeout option (#18350)	commit \| commitdiff \| tree
2025-12-24	Xuan-Son Nguyen	model: support MiMo-V2-Flash (#18328)	commit \| commitdiff \| tree
2025-12-24	Aadeshveer...	fit-params : fix race condition in fit-params output...	commit \| commitdiff \| tree
2025-12-24	Aman Gupta	CUDA: experimental native mxfp4 support for blackwell...	commit \| commitdiff \| tree
2025-12-24	Saba Fallah	model : support for LlamaBidirectionalModel architectur...	commit \| commitdiff \| tree
2025-12-24	Jeff Bolz	vulkan: fix command buffer corruption in ggml_backend_v...	commit \| commitdiff \| tree
2025-12-24	Wang Weixuan	CANN : refactor ACL graph cache (#17752)	commit \| commitdiff \| tree
2025-12-24	Jesse Ikonen	docs: Fix typos in SYCL documentation (#18269)	commit \| commitdiff \| tree
2025-12-24	Ruben Ortlam	vulkan: use fewer FA rows for small cache runs (#18280)	commit \| commitdiff \| tree
2025-12-24	TianHao324	CANN: Uses yarn_ramp cache in ROPE (#17725)	commit \| commitdiff \| tree
2025-12-24	ddh0	common: add `LLAMA_ARG_OVERRIDE_TENSOR` env var for...	commit \| commitdiff \| tree
2025-12-23	Xuan-Son Nguyen	server: return_progress to also report 0% processing...	commit \| commitdiff \| tree
2025-12-23	Pascal	webui: apply webui_settings on first load (#18223)	commit \| commitdiff \| tree
2025-12-23	Xuan-Son Nguyen	server: fix crash with model not having BOS/EOS (#18321)	commit \| commitdiff \| tree
2025-12-23	Daniel Bevenius	model-conversion : add device option to run-org-model...	commit \| commitdiff \| tree
2025-12-23	Chris Rohlf	rpc : add check for rpc buffer type (#18242)	commit \| commitdiff \| tree
2025-12-23	nullname	ggml-hexagon: create generalized functions for cpu...	commit \| commitdiff \| tree
2025-12-23	Daniel Bevenius	model-conversion : add trust_remote_code for embedding...	commit \| commitdiff \| tree
2025-12-23	Neo Zhang	[SYCL] replace llama-cli by llama-completion to rm...	commit \| commitdiff \| tree
2025-12-23	Alessandro98-git	model : fix div-by-zero for Nemotron V2 (#18309)	commit \| commitdiff \| tree
2025-12-22	Ryan Mangeno	model : Granite Embedding support (#15641)	commit \| commitdiff \| tree
2025-12-22	compilade	gguf-py : do not align the data start offset (#18291)	commit \| commitdiff \| tree
2025-12-22	Shouyu	ggml-hexagon: gelu optimization (#18151)	commit \| commitdiff \| tree
2025-12-22	Xuan-Son Nguyen	gen-docs: automatically update markdown file (#18294)	commit \| commitdiff \| tree
2025-12-22	Taimur Ahmad	llamafile: add rvv support for sgemm kernels (#18199)	commit \| commitdiff \| tree
2025-12-22	lhez	opencl: unpack q4_0 for adreno in get_tensor (#18278)	commit \| commitdiff \| tree
2025-12-22	Jeff Bolz	vulkan: Extend rope fusions to allow mrope (#18264)	commit \| commitdiff \| tree
2025-12-22	Xuan-Son Nguyen	server: prevent data race from HTTP threads (#18263)	commit \| commitdiff \| tree
2025-12-22	Xuan-Son Nguyen	server: fix data race in to_json_anthropic (#18283)	commit \| commitdiff \| tree
2025-12-22	Mattt	release: update release workflow to store XCFramework...	commit \| commitdiff \| tree
2025-12-22	Aaron Teo	convert: rework ftype heuristics (#18214)	commit \| commitdiff \| tree
2025-12-22	Xuan-Son Nguyen	server: (docs) remove mention about extra_args (#18262)	commit \| commitdiff \| tree
2025-12-22	Johannes Gäßler	tool/ex/tests: consistently free ctx, then model (...	commit \| commitdiff \| tree
2025-12-21	Jeff Bolz	vulkan: Implement set_tensor_async and the event interf...	commit \| commitdiff \| tree
2025-12-21	Johannes Gäßler	llama: fix RPC for -fit on (#18233)	commit \| commitdiff \| tree
2025-12-21	Xuan-Son Nguyen	move copilot instructions to AGENTS.md (#18259)	commit \| commitdiff \| tree
2025-12-21	Jeff Bolz	vulkan: fix im2col overflowing maxworkgroupcount (...	commit \| commitdiff \| tree
2025-12-21	Jeff Bolz	vulkan/cuda: fix topk_moe with exp_probs_b (#18071)	commit \| commitdiff \| tree
2025-12-21	Jeff Bolz	vulkan: support GGML_UNARY_OP_XIELU (#18062)	commit \| commitdiff \| tree
2025-12-21	Jeff Bolz	vulkan: in graph_optimize, try to group ADD operations...	commit \| commitdiff \| tree
2025-12-21	lovedheart	Vulkan: some improvement on mul_mat_iq2_xs (#18031)	commit \| commitdiff \| tree
2025-12-21	Daniel Bevenius	docs : fix links in parsing.md (#18245)	commit \| commitdiff \| tree
2025-12-21	Aldehir Rojas	common : reorganize includes to prioritize vendored...	commit \| commitdiff \| tree
2025-12-21	Xuan-Son Nguyen	server: add auto-sleep after N seconds of idle (#18228)	commit \| commitdiff \| tree
2025-12-20	Jeff Bolz	tests: Avoid floating point precision false positives...	commit \| commitdiff \| tree
2025-12-20	Jeff Bolz	test-backend-ops: improve msvc build time (#18209)	commit \| commitdiff \| tree
2025-12-20	Aadeshveer...	Added comments explaining thread block size selection...	commit \| commitdiff \| tree
2025-12-20	Oleksandr Kuvshynov	server : [easy] fix per round speculative decode loggin...	commit \| commitdiff \| tree
2025-12-20	Xuan-Son Nguyen	server: support load model on startup, support preset...	commit \| commitdiff \| tree
2025-12-19	Sigbjørn Skjæret	ci : remove non-windows zip artifacts (#18201)	commit \| commitdiff \| tree
2025-12-19	Sigbjørn Skjæret	ci : only save ccache on master (#18207)	commit \| commitdiff \| tree
2025-12-19	Alfred	ggml-hexagon: Implement true Q8_0 quantization on Hexag...	commit \| commitdiff \| tree
2025-12-19	Pascal	arg: fix order to use short form before long form ...	commit \| commitdiff \| tree
2025-12-19	Julius Tischbein	llama : Changing off_t to size_t for Windows (#18204)	commit \| commitdiff \| tree
2025-12-19	Aman Gupta	server: friendlier error msg when ctx < input (#18174)	commit \| commitdiff \| tree
2025-12-19	Xuan-Son Nguyen	presets: refactor, allow cascade presets from different...	commit \| commitdiff \| tree
2025-12-19	Aleksander...	webui: Add editing attachments in user messages (#18147)	commit \| commitdiff \| tree
2025-12-19	Daniel Bevenius	model-conversion : add verbose flag in run-org-model...	commit \| commitdiff \| tree
2025-12-19	Naco Siren	android: fix missing screenshots for Android.md (#18156)	commit \| commitdiff \| tree
2025-12-19	Jeff Bolz	vulkan: Add perf logger mode with concurrency (#17944)	commit \| commitdiff \| tree
2025-12-18	Xuan-Son Nguyen	model : add ASR support for LFM2-Audio-1.5B (conformer...	commit \| commitdiff \| tree
2025-12-18	Pascal	webui: display prompt processing stats (#18146)	commit \| commitdiff \| tree
2025-12-18	Taimur Ahmad	ggml-cpu: extend support for RVV floating-point kernels...	commit \| commitdiff \| tree
2025-12-18	Xuan-Son Nguyen	arg: fix ASAN error on sampler_type_names empty (#18167)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom