git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2025-08-14	Georgi Gerganov	eval-callback : stop on first NaN (#15320)	commit \| commitdiff \| tree
2025-08-14	Diego Devesa	chat : include kwargs in template example (#15309)	commit \| commitdiff \| tree
2025-08-14	Daniel Bevenius	llama : add 18-layer model type for Gemma 3-270m (...	commit \| commitdiff \| tree
2025-08-14	simevo	devops : fix compile bug when the BASE_CUDA_DEV_CONTAIN...	commit \| commitdiff \| tree
2025-08-14	uvos	HIP: Cleanup hipification header (#15285)	commit \| commitdiff \| tree
2025-08-14	Aldehir Rojas	gpt-oss: implement harmony parsing (#15181) upstream/0.0.6164	commit \| commitdiff \| tree
2025-08-14	Christian Kastner	docker : Enable GGML_CPU_ALL_VARIANTS for ARM (#15267)	commit \| commitdiff \| tree
2025-08-14	Georgi Gerganov	readme : update hot topics (#15315)	commit \| commitdiff \| tree
2025-08-14	Jeff Bolz	vulkan: perf_logger improvements (#15246)	commit \| commitdiff \| tree
2025-08-14	Georgi Gerganov	server : add SWA checkpoints (#15293)	commit \| commitdiff \| tree
2025-08-14	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-08-14	Jason Ni	ggml: fix ggml_conv_1d_dw bug (ggml/1323)	commit \| commitdiff \| tree
2025-08-14	Georgi Gerganov	tests : remove unused includes (ggml/0)	commit \| commitdiff \| tree
2025-08-14	kallewoof	perplexity : provide a helpful hint for has_cpl case...	commit \| commitdiff \| tree
2025-08-14	Sigbjørn Skjæret	cuda : fix GGML_CUDA_GRAPHS=OFF (#15300)	commit \| commitdiff \| tree
2025-08-14	Jonathan Graehl	finetune: SGD optimizer, more CLI args (#13873)	commit \| commitdiff \| tree
2025-08-14	kallewoof	perplexity: give more information about constraints...	commit \| commitdiff \| tree
2025-08-13	uvos	HIP: bump requirement to rocm 6.1 (#15296)	commit \| commitdiff \| tree
2025-08-13	Bas Nijholt	fix(nix): remove non-functional llama-cpp cachix cache...	commit \| commitdiff \| tree
2025-08-13	Sigbjørn Skjæret	server : enable -td and -tbd parameters (#15172)	commit \| commitdiff \| tree
2025-08-13	Judd	ggml : update `ggml_rope_multi` (#12665)	commit \| commitdiff \| tree
2025-08-13	Copilot	common : add --override-tensor-draft, --cpu-moe-draft...	commit \| commitdiff \| tree
2025-08-13	Aldehir Rojas	server : filter out harmony thought messages (#15278)	commit \| commitdiff \| tree
2025-08-13	Ali Tariq	ci : Added CI with RISC-V RVV1.0 Hardware (#14439)	commit \| commitdiff \| tree
2025-08-13	Sigbjørn Skjæret	ci : add more python requirements to copilot-setup...	commit \| commitdiff \| tree
2025-08-13	Georgi Gerganov	ggml : repack block_iq4_nlx8 (#14904)	commit \| commitdiff \| tree
2025-08-13	Oliver Simons	CUDA: Optimize `reduce_rows_f32` kernel, leading up...	commit \| commitdiff \| tree
2025-08-13	Sigbjørn Skjæret	ci : add copilot-setup-steps.yml (#15214)	commit \| commitdiff \| tree
2025-08-13	Tak-RS	ggml-rpc: chunk send()/recv() to avoid EINVAL for very...	commit \| commitdiff \| tree
2025-08-12	uvos	HIP: disable sync warp shuffel operators from clr amd_w...	commit \| commitdiff \| tree
2025-08-12	Romain Biessy	sycl: Fix and disable more configurations of mul_mat...	commit \| commitdiff \| tree
2025-08-12	rmatif	opencl: allow mixed f16/f32 `add` (#15140)	commit \| commitdiff \| tree
2025-08-12	Aman Gupta	CUDA cmake: add `-lineinfo` for easier debug (#15260)	commit \| commitdiff \| tree
2025-08-12	Chenguang Li	CANN: GGML_OP_CPY optimization (#15070)	commit \| commitdiff \| tree
2025-08-12	R0CKSTAR	musa: fix failures in test-backend-ops for mul_mat_id...	commit \| commitdiff \| tree
2025-08-11	hipudding	CANN: Add broadcast for softmax and FA (#15208)	commit \| commitdiff \| tree
2025-08-11	rainred	mtmd : Fix MinicpmV model converter and clip to avoid...	commit \| commitdiff \| tree
2025-08-11	Xuan-Son Nguyen	chat : hotfix gpt-oss jinja raising an exception (...	commit \| commitdiff \| tree
2025-08-11	Xuan-Son Nguyen	server : allow specifying reasoning_format in HTTP...	commit \| commitdiff \| tree
2025-08-11	Zagaj	readme : update infra list (#15234)	commit \| commitdiff \| tree
2025-08-11	Georgi Gerganov	kv-cache : fix seq_rm with seq_id == -1 (#15226)	commit \| commitdiff \| tree
2025-08-11	Daniel Bevenius	kv-cache : log (debug) all streams in find_slot (#15176)	commit \| commitdiff \| tree
2025-08-11	Sigbjørn Skjæret	convert : fix merge conflicts (#15229)	commit \| commitdiff \| tree
2025-08-11	Daniel Bevenius	perplexity : update comments/error msg to use decode...	commit \| commitdiff \| tree
2025-08-11	Julien Denize	convert : improve Mistral models integration (#14737)	commit \| commitdiff \| tree
2025-08-11	Charles Xu	kleidiai: fix unsigned overflow bug (#15150)	commit \| commitdiff \| tree
2025-08-09	David Zhao	cuda: refactored ssm_scan and use CUB (#13291)	commit \| commitdiff \| tree
2025-08-09	Aman Gupta	CUDA: add attention sinks for tile and wmma (#15178)	commit \| commitdiff \| tree
2025-08-08	compilade	gguf-py : add Numpy MXFP4 de/quantization support ...	commit \| commitdiff \| tree
2025-08-08	Johannes Gäßler	server-bench: external OAI servers, sqlite (#15179)	commit \| commitdiff \| tree
2025-08-08	AN Long	ggml : fix field name when new ggml_backend (#14944)	commit \| commitdiff \| tree
2025-08-08	Olivier Chafik	vendor: sync minja (#15161)	commit \| commitdiff \| tree
2025-08-08	Johannes Gäßler	CUDA: attention sinks for mma FlashAttention (#15157)	commit \| commitdiff \| tree
2025-08-08	lhez	opencl: support sink in `soft_max` (attn sinks) (#15152)	commit \| commitdiff \| tree
2025-08-07	Xuan-Son Nguyen	convert : support non-mxfp4 HF model (#15153)	commit \| commitdiff \| tree
2025-08-07	Jeff Bolz	vulkan: support fattn sinks (#15126)	commit \| commitdiff \| tree
2025-08-07	Jeff Bolz	vulkan: Add env var to disable host visible vidmem...	commit \| commitdiff \| tree
2025-08-07	RunningLeon	llama : Support intern-s1 (#14875)	commit \| commitdiff \| tree
2025-08-07	uvos	HIP: add cmake option to enable compiler output of...	commit \| commitdiff \| tree
2025-08-07	Christian Kastner	ggml: Skip backend library linking code when GGML_BACKE...	commit \| commitdiff \| tree
2025-08-07	Johannes Gäßler	CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (#15131)	commit \| commitdiff \| tree
2025-08-07	Johannes Gäßler	scripts: fix crash when --tool is not set (#15133)	commit \| commitdiff \| tree
2025-08-07	Daniel Bevenius	requirements : fix PyTorch uint64 compatibility (#15134)	commit \| commitdiff \| tree
2025-08-06	Reese Levine	ggml: Add basic SET_ROWS support in WebGPU (#15137)	commit \| commitdiff \| tree
2025-08-06	rmatif	fix profiling crash (#15072)	commit \| commitdiff \| tree
2025-08-06	lhez	opencl: add `swiglu_oai` and `add_id` (#15121)	commit \| commitdiff \| tree
2025-08-06	Sachin Desai	chat : support Granite model reasoning and tool call...	commit \| commitdiff \| tree
2025-08-06	Juk Armstrong	Fixed name `-override-tensors` to `-override-tensor...	commit \| commitdiff \| tree
2025-08-06	Diego Devesa	ggml : fix fallback to CPU for ununsupported ops (...	commit \| commitdiff \| tree
2025-08-06	Sigbjørn Skjæret	chat : fix yandex chat template (#15116)	commit \| commitdiff \| tree
2025-08-06	stevenkuang	chat : fix hunyuan auto-detection (#15114)	commit \| commitdiff \| tree
2025-08-06	Chenguang Li	CANN: add support for ACL Graph (#15065)	commit \| commitdiff \| tree
2025-08-05	Reese Levine	ggml: WebGPU disable SET_ROWS for now (#15078)	commit \| commitdiff \| tree
2025-08-05	Georgi Gerganov	llama : add gpt-oss (#15091)	commit \| commitdiff \| tree
2025-08-05	Sigbjørn Skjæret	chat : only remove double bos/eos if added (#15086)	commit \| commitdiff \| tree
2025-08-05	Georgi Gerganov	readme : update hot topics (#15097)	commit \| commitdiff \| tree
2025-08-05	Romain Biessy	sycl: fix mul_mat selection (#15092)	commit \| commitdiff \| tree
2025-08-05	Juk Armstrong	Fix `glm4moe` bug (#15088)	commit \| commitdiff \| tree
2025-08-05	Alex Wu	webui: fix markdown table (#15081)	commit \| commitdiff \| tree
2025-08-05	compilade	context : fix index overflow on huge outputs (#15080)	commit \| commitdiff \| tree
2025-08-04	Diego Devesa	llama : add --n-cpu-moe option (#15077)	commit \| commitdiff \| tree
2025-08-04	compilade	imatrix : warn when GGUF imatrix is saved without ...	commit \| commitdiff \| tree
2025-08-04	Christian Kastner	cmake: Add GGML_BACKEND_DIR option (#15074)	commit \| commitdiff \| tree
2025-08-04	Sigbjørn Skjæret	gguf-py : add --chat-template-file to gguf_new_metadata...	commit \| commitdiff \| tree
2025-08-04	Sam	model: support GLM 4.5 family of models (#14939)	commit \| commitdiff \| tree
2025-08-04	Sigbjørn Skjæret	quantize : fix confusing error message if ftype is...	commit \| commitdiff \| tree
2025-08-04	Reese Levine	ggml: WebGPU backend host improvements and style fixing...	commit \| commitdiff \| tree
2025-08-04	Jeff Bolz	vulkan: fix build when using glslang that does not...	commit \| commitdiff \| tree
2025-08-03	compilade	imatrix : use GGUF by default (#14842)	commit \| commitdiff \| tree
2025-08-03	compilade	imatrix : fix 3d activation handling for hybrid and...	commit \| commitdiff \| tree
2025-08-03	compilade	memory : handle kv_unified for hybrid models (#15050)	commit \| commitdiff \| tree
2025-08-03	Csaba Kecskemeti	vocab : JetBrains Mellum pre-tokenizer (#15045)	commit \| commitdiff \| tree
2025-08-03	Gabriel Larson	model : add text-only support for Kimi-VL (and find...	commit \| commitdiff \| tree
2025-08-03	Jeff Bolz	vulkan: Use coopmat2 for conv2d (#14982)	commit \| commitdiff \| tree
2025-08-02	lhez	opencl: fix adreno compiler detection logic (#15029)	commit \| commitdiff \| tree
2025-08-02	Johannes Gäßler	CUDA: use mma FA kernel for gqa > 4 on RTX 4000 (#15035)	commit \| commitdiff \| tree
2025-08-02	leejet	cuda: make im2col a little faster (#15025) upstream/0.0.6073	commit \| commitdiff \| tree
2025-08-02	Daniel Bevenius	kv-cache : skip alignment of n_stream in kv-cache log...	commit \| commitdiff \| tree
2025-08-02	Georgi Gerganov	llama : enable LLAMA_SET_ROWS=1 by default (#14959)	commit \| commitdiff \| tree
2025-08-02	Georgi Gerganov	cuda, sycl : fix batched gemm when ne02 == 1 && ne03...	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom