git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2026-01-11	Xuan-Son Nguyen	model: fix qwen3next broken due to #18683 (#18762)	commit \| commitdiff \| tree
2026-01-11	Ruben Ortlam	Vulkan: Optimize Matmul parameters for AMD GPUs with...	commit \| commitdiff \| tree
2026-01-11	Xuan-Son Nguyen	security: make it clear about subtopics in server ...	commit \| commitdiff \| tree
2026-01-11	Daniel Bevenius	debug : include LLAMA_POOLING_TYPE_UNSPECIFIED in pooli...	commit \| commitdiff \| tree
2026-01-11	Georgi Gerganov	tests : refactor test-backend-sampler (#18753)	commit \| commitdiff \| tree
2026-01-11	Xuan-Son Nguyen	model: try to improve Qwen3 Next (#18683)	commit \| commitdiff \| tree
2026-01-11	thom-dev-fr	readme : update UIs (#18751)	commit \| commitdiff \| tree
2026-01-11	Xuan-Son Nguyen	security: narrow down the scope of what we consider...	commit \| commitdiff \| tree
2026-01-11	shaofeiqi	opencl: add SOFTPLUS op support (#18726)	commit \| commitdiff \| tree
2026-01-10	Aman Gupta	test-backend-ops: fix mxfp4 tests on blackwell (#18736)	commit \| commitdiff \| tree
2026-01-10	Johannes Gäßler	HIP: adjust RDNA3.5 MMQ kernel selction logic (#18666)	commit \| commitdiff \| tree
2026-01-10	Perry Naseck	cmake : update blas logic (#18205)	commit \| commitdiff \| tree
2026-01-10	Georgi Gerganov	server : adjust unified KV cache tests (#18716)	commit \| commitdiff \| tree
2026-01-10	Sigbjørn Skjæret	scripts : follow api redirects in pr2wt.sh (#18739)	commit \| commitdiff \| tree
2026-01-10	Xuan-Son Nguyen	preset: allow named remote preset (#18728)	commit \| commitdiff \| tree
2026-01-10	Aaron Teo	docs(ggml): update backend ops (#18734)	commit \| commitdiff \| tree
2026-01-10	Michael Wand	Corrected: changed s13 = src1->nb[3] instead of nb...	commit \| commitdiff \| tree
2026-01-10	Adrien Gallouët	common : add --license to display embedded licenses...	commit \| commitdiff \| tree
2026-01-09	Xuan-Son Nguyen	server: fix n_cmpl not skipping processing prompt ...	commit \| commitdiff \| tree
2026-01-09	Simranjeet...	mtmd: Add Gemma3n multimodal support with MobileNetV5...	commit \| commitdiff \| tree
2026-01-09	shaofeiqi	opencl: add EXPM1 op (#18704)	commit \| commitdiff \| tree
2026-01-09	Reese Levine	Updates to webgpu get_memory (#18707)	commit \| commitdiff \| tree
2026-01-09	Pascal	Webui/file upload (#18694)	commit \| commitdiff \| tree
2026-01-09	Asbjørn Olling	cmake: only build cli when server is enabled (#18670)	commit \| commitdiff \| tree
2026-01-09	Georgi Gerganov	server : fix timing of prompt/generation (#18713)	commit \| commitdiff \| tree
2026-01-09	Georgi Gerganov	scripts : pr2wt.sh reset to remote head (#18695)	commit \| commitdiff \| tree
2026-01-09	Georgi Gerganov	server : use different seeds for child completions...	commit \| commitdiff \| tree
2026-01-08	Xuan-Son Nguyen	common: support remote preset (#18520)	commit \| commitdiff \| tree
2026-01-08	Aaron Teo	llama: use host memory if device reports 0 memory ...	commit \| commitdiff \| tree
2026-01-08	Masashi Yoshimura	ggml-webgpu: Fix GGML_MEM_ALIGN to 8 for emscripten...	commit \| commitdiff \| tree
2026-01-08	Reese Levine	ggml webgpu: initial flashattention implementation...	commit \| commitdiff \| tree
2026-01-08	Jeff Bolz	vulkan: fix push constant size for quantize_q8_1 (...	commit \| commitdiff \| tree
2026-01-08	Jeff Bolz	vulkan: optimize ssm_scan (#18630)	commit \| commitdiff \| tree
2026-01-08	Adrien Gallouët	vendor : update cpp-httplib to 0.30.0 (#18660)	commit \| commitdiff \| tree
2026-01-08	Georgi Gerganov	scripts : support chaining commands in pr2wt.sh (#18671)	commit \| commitdiff \| tree
2026-01-08	도로로도로또	metal : add MoE kernel specialization for ne20=5 (...	commit \| commitdiff \| tree
2026-01-08	Johannes Gäßler	llama-fit-params: free memory target per device (#18679)	commit \| commitdiff \| tree
2026-01-08	Doctor Shotgun	ggml: add env var GGML_OP_OFFLOAD_MIN_BATCH (#18535)	commit \| commitdiff \| tree
2026-01-08	Daniel Bevenius	model-conversion : add warn about transformers mismatch...	commit \| commitdiff \| tree
2026-01-08	Daniel Bevenius	model-conversion : remove -st targets for converted...	commit \| commitdiff \| tree
2026-01-08	Julius Tischbein	llama : add `use_direct_io` flag for model loading...	commit \| commitdiff \| tree
2026-01-08	shaofeiqi	opencl: add FILL op support (#18682)	commit \| commitdiff \| tree
2026-01-07	Sigbjørn Skjæret	scripts : fix repos cloned with .git extension (#18669)	commit \| commitdiff \| tree
2026-01-07	Sigbjørn Skjæret	convert : more variants of rope_theta config entries...	commit \| commitdiff \| tree
2026-01-07	Oliver Walsh	cuda : fix build on cuda 12.8 (#18672)	commit \| commitdiff \| tree
2026-01-07	R	fix(docker): add missing libglvnd libraries to Vulkan...	commit \| commitdiff \| tree
2026-01-07	Adrien Gallouët	tools : remove llama-run (#18661)	commit \| commitdiff \| tree
2026-01-07	Georgi Gerganov	scripts : add pr2wt.sh (#18644)	commit \| commitdiff \| tree
2026-01-07	Daniel Bevenius	convert : clarify sentence-transformers-dense-modules...	commit \| commitdiff \| tree
2026-01-07	Sigbjørn Skjæret	ci : run cann build unconditionally [no ci] (#18659)	commit \| commitdiff \| tree
2026-01-07	Jeff Bolz	vulkan: reject ops when a tensor is too large to alloca...	commit \| commitdiff \| tree
2026-01-07	virajwad	vulkan: Warptile tuning for Intel Xe2/Xe3 (#18178)	commit \| commitdiff \| tree
2026-01-07	Eve	vulkan: more mul mat optimizations (#18533)	commit \| commitdiff \| tree
2026-01-07	Daniel Bevenius	examples : add debug utility/example (#18464)	commit \| commitdiff \| tree
2026-01-07	hipudding	CANN: Fix rename for get_env (#18652)	commit \| commitdiff \| tree
2026-01-07	Raul Torres	CANN: Rename `get_env` to `get_env_as_lowercase` (...	commit \| commitdiff \| tree
2026-01-07	Max Krasnyansky	Hexagon add support for f16/f32 flash attention, scale...	commit \| commitdiff \| tree
2026-01-06	Tarek Dakhran	mtmd: mtmd_audio_streaming_istft (#18645)	commit \| commitdiff \| tree
2026-01-06	Johannes Gäßler	llama-params-fit: fix last devices with low VRAM (...	commit \| commitdiff \| tree
2026-01-06	Aadeshveer...	ggml : optimize cuda ssm_scan using warp-level reductio...	commit \| commitdiff \| tree
2026-01-06	Xuan-Son Nguyen	arg: use CSV escape style for multiple-value args ...	commit \| commitdiff \| tree
2026-01-06	Jeff Bolz	vulkan: support buffer_from_host_ptr (#18467)	commit \| commitdiff \| tree
2026-01-06	Aman Gupta	ggml-cuda: refactor cuda graph usage (#18637)	commit \| commitdiff \| tree
2026-01-06	Beinsezii	mmq.cu: tune mmq/rocblas switching for RDNA (#18537)	commit \| commitdiff \| tree
2026-01-06	R	server : add thinking content blocks to Anthropic Messa...	commit \| commitdiff \| tree
2026-01-06	Christian Kastner	gguf-py : add requests to dependencies (#18629)	commit \| commitdiff \| tree
2026-01-06	Adrien Gallouët	ggml : fix avx512bf16 build (#18623)	commit \| commitdiff \| tree
2026-01-06	Raul Torres	CANN: Make `valid_values` variable `static const` ...	commit \| commitdiff \| tree
2026-01-05	nwyin	ggml webgpu: add CEIL operation support (#18605)	commit \| commitdiff \| tree
2026-01-05	Tarek Dakhran	model : add LFM2-ColBert-350M (#18607)	commit \| commitdiff \| tree
2026-01-05	Johannes Gäßler	CUDA: fix FA FP16 accumulator overflow for Granite...	commit \| commitdiff \| tree
2026-01-05	tt	add YoutuVLForConditionalGeneration architectures ...	commit \| commitdiff \| tree
2026-01-05	Aman Gupta	ggml-cuda: check for srcs outside the cgraph (#18583)	commit \| commitdiff \| tree
2026-01-05	Vladislav Sayapin	server : fix router child env in containerized environm...	commit \| commitdiff \| tree
2026-01-05	Jeff Bolz	vulkan: fix topk_moe_sigmoid_norm_bias failures in...	commit \| commitdiff \| tree
2026-01-05	Georgi Gerganov	models : fix backend assignment for Granite/Nemotron...	commit \| commitdiff \| tree
2026-01-05	Jeff Bolz	vulkan: handle quantize_q8_1 overflowing the max workgr...	commit \| commitdiff \| tree
2026-01-05	Sigbjørn Skjæret	llama : refactor rope_freq_base/scale_swa conversion...	commit \| commitdiff \| tree
2026-01-05	Chenguang Li	CANN: add operator fusion support for ADD + RMS_NORM...	commit \| commitdiff \| tree
2026-01-05	Francisco Herrera	doc: clarify that steps also apply to linux for opencl...	commit \| commitdiff \| tree
2026-01-05	Ali Tariq	ci : init git lfs in every build for RISC-V (#18590)	commit \| commitdiff \| tree
2026-01-04	Daniel Bevenius	sampling : add support for backend sampling (#17004)	commit \| commitdiff \| tree
2026-01-04	Tarek Dakhran	model : mtmd : make input norm optional in LFM2-VL...	commit \| commitdiff \| tree
2026-01-04	Aman Gupta	CUDA: disable cuda graph when using n-cpu-moe (#18593)	commit \| commitdiff \| tree
2026-01-04	Aman Gupta	ggml-cuda: remove unused params in ggml_cuda_graph...	commit \| commitdiff \| tree
2026-01-03	Aldehir Rojas	common/grammar : replace problematic backtracking regex...	commit \| commitdiff \| tree
2026-01-03	Georgi Gerganov	graph : fix graph reuse logic when `n_pos_per_embd...	commit \| commitdiff \| tree
2026-01-03	Aman Gupta	ggml-cuda: fixes for concurrent streams (#18496)	commit \| commitdiff \| tree
2026-01-03	Georgi Gerganov	context : fix reserve token padding to n_seqs (#18536)	commit \| commitdiff \| tree
2026-01-03	Johannes Gäßler	CUDA: only allocate FA tmp buffer if needed (#18564)	commit \| commitdiff \| tree
2026-01-03	pl752	(Bugfix, ggml-cuda) Pool alloc count fix + small size...	commit \| commitdiff \| tree
2026-01-03	Shouyu	ggml-hexagon: optimize activation function (#18393)	commit \| commitdiff \| tree
2026-01-02	Jeff Bolz	vulkan: Optimize GGML_OP_CUMSUM (#18417)	commit \| commitdiff \| tree
2026-01-02	Jeff Bolz	vulkan: Implement mmvq for iq1_s/iq1_m (#18450)	commit \| commitdiff \| tree
2026-01-02	Prabod	model : Maincoder-1B support (#18534)	commit \| commitdiff \| tree
2026-01-02	Georgi Gerganov	metal : adjust extra size for FA buffer to avoid reallo...	commit \| commitdiff \| tree
2026-01-02	Georgi Gerganov	graph : reduce topology branching (#18548)	commit \| commitdiff \| tree
2026-01-02	Georgi Gerganov	vocab : reduce debug logs about non-EOG control tokens...	commit \| commitdiff \| tree
2026-01-02	Chris Rohlf	rpc : use unordered_map::reserve and emplace (#18513)	commit \| commitdiff \| tree
2026-01-01	MeeMin	cuda : fix copy of large tensors (ggml_nbytes <= INT_MA...	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom