git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2025-08-18	Georgi Gerganov	scripts : update sync scripts	commit \| commitdiff \| tree
2025-08-18	Sigbjørn Skjæret	llama : merge conts and reshapes and remove unnecessary...	commit \| commitdiff \| tree
2025-08-18	Georgi Gerganov	readme : update hot topics (#15397)	commit \| commitdiff \| tree
2025-08-18	davidef	server : fix incoming tasks not process in order (...	commit \| commitdiff \| tree
2025-08-18	Dobri Danchev	Fix broken build: require updated pip to support -...	commit \| commitdiff \| tree
2025-08-18	compilade	ggml-quants : fix make_qp_quants NANs and IQ1 assertion...	commit \| commitdiff \| tree
2025-08-18	Jeff Bolz	vulkan: disable spirv-opt for bfloat16 shaders (#15352)	commit \| commitdiff \| tree
2025-08-17	Oleksandr Kuvshynov	server : export max observed n_past value (#15361)	commit \| commitdiff \| tree
2025-08-17	Jeff Bolz	vulkan: Use larger workgroups for mul_mat_vec when...	commit \| commitdiff \| tree
2025-08-17	Dong Won Kim	vulkan: support sqrt (#15370)	commit \| commitdiff \| tree
2025-08-17	Sigbjørn Skjæret	convert : force patch_embd weights to F16 or F32 to...	commit \| commitdiff \| tree
2025-08-17	Sigbjørn Skjæret	ci : fix hang in windows-hip build/release (#15365)	commit \| commitdiff \| tree
2025-08-17	Jeff Bolz	vulkan: Optimize argsort (#15354)	commit \| commitdiff \| tree
2025-08-16	Tarek Dakhran	model : support vision LiquidAI LFM2-VL family (#15347)	commit \| commitdiff \| tree
2025-08-16	Jeff Bolz	vulkan: fuse adds (#15252)	commit \| commitdiff \| tree
2025-08-16	Jeff Bolz	vulkan: Support mul_mat_id with f32 accumulators (...	commit \| commitdiff \| tree
2025-08-16	Jeff Bolz	vulkan: Add missing bounds checking to scalar/coopmat1...	commit \| commitdiff \| tree
2025-08-16	rmatif	OpenCL: add initial FA support (#14987)	commit \| commitdiff \| tree
2025-08-15	Daniel Bevenius	common : fix double bos, use common_chat_templates...	commit \| commitdiff \| tree
2025-08-15	lhez	opencl: add initial mxfp4 support via mv (#15270)	commit \| commitdiff \| tree
2025-08-15	Georgi Gerganov	vulkan : fix out-of-bounds access in argmax kernel...	commit \| commitdiff \| tree
2025-08-15	Georgi Gerganov	vulkan : fix compile warnings on macos (#15340)	commit \| commitdiff \| tree
2025-08-15	Aaron Teo	ggml: initial IBM zDNN backend (#14975)	commit \| commitdiff \| tree
2025-08-15	Sigbjørn Skjæret	ci : fix ios-xcode-build (#15324)	commit \| commitdiff \| tree
2025-08-15	Diego Devesa	ci : move ccache action to ggml-org fork (#15328)	commit \| commitdiff \| tree
2025-08-15	Johannes Gäßler	test-opt: fix backend support check (#15317)	commit \| commitdiff \| tree
2025-08-14	Johannes Gäßler	CUDA: fix negative KV_max values in FA (#15321)	commit \| commitdiff \| tree
2025-08-14	Georgi Gerganov	eval-callback : stop on first NaN (#15320)	commit \| commitdiff \| tree
2025-08-14	Diego Devesa	chat : include kwargs in template example (#15309)	commit \| commitdiff \| tree
2025-08-14	Daniel Bevenius	llama : add 18-layer model type for Gemma 3-270m (...	commit \| commitdiff \| tree
2025-08-14	simevo	devops : fix compile bug when the BASE_CUDA_DEV_CONTAIN...	commit \| commitdiff \| tree
2025-08-14	uvos	HIP: Cleanup hipification header (#15285)	commit \| commitdiff \| tree
2025-08-14	Aldehir Rojas	gpt-oss: implement harmony parsing (#15181) upstream/0.0.6164	commit \| commitdiff \| tree
2025-08-14	Christian Kastner	docker : Enable GGML_CPU_ALL_VARIANTS for ARM (#15267)	commit \| commitdiff \| tree
2025-08-14	Georgi Gerganov	readme : update hot topics (#15315)	commit \| commitdiff \| tree
2025-08-14	Jeff Bolz	vulkan: perf_logger improvements (#15246)	commit \| commitdiff \| tree
2025-08-14	Georgi Gerganov	server : add SWA checkpoints (#15293)	commit \| commitdiff \| tree
2025-08-14	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-08-14	Jason Ni	ggml: fix ggml_conv_1d_dw bug (ggml/1323)	commit \| commitdiff \| tree
2025-08-14	Georgi Gerganov	tests : remove unused includes (ggml/0)	commit \| commitdiff \| tree
2025-08-14	kallewoof	perplexity : provide a helpful hint for has_cpl case...	commit \| commitdiff \| tree
2025-08-14	Sigbjørn Skjæret	cuda : fix GGML_CUDA_GRAPHS=OFF (#15300)	commit \| commitdiff \| tree
2025-08-14	Jonathan Graehl	finetune: SGD optimizer, more CLI args (#13873)	commit \| commitdiff \| tree
2025-08-14	kallewoof	perplexity: give more information about constraints...	commit \| commitdiff \| tree
2025-08-13	uvos	HIP: bump requirement to rocm 6.1 (#15296)	commit \| commitdiff \| tree
2025-08-13	Bas Nijholt	fix(nix): remove non-functional llama-cpp cachix cache...	commit \| commitdiff \| tree
2025-08-13	Sigbjørn Skjæret	server : enable -td and -tbd parameters (#15172)	commit \| commitdiff \| tree
2025-08-13	Judd	ggml : update `ggml_rope_multi` (#12665)	commit \| commitdiff \| tree
2025-08-13	Copilot	common : add --override-tensor-draft, --cpu-moe-draft...	commit \| commitdiff \| tree
2025-08-13	Aldehir Rojas	server : filter out harmony thought messages (#15278)	commit \| commitdiff \| tree
2025-08-13	Ali Tariq	ci : Added CI with RISC-V RVV1.0 Hardware (#14439)	commit \| commitdiff \| tree
2025-08-13	Sigbjørn Skjæret	ci : add more python requirements to copilot-setup...	commit \| commitdiff \| tree
2025-08-13	Georgi Gerganov	ggml : repack block_iq4_nlx8 (#14904)	commit \| commitdiff \| tree
2025-08-13	Oliver Simons	CUDA: Optimize `reduce_rows_f32` kernel, leading up...	commit \| commitdiff \| tree
2025-08-13	Sigbjørn Skjæret	ci : add copilot-setup-steps.yml (#15214)	commit \| commitdiff \| tree
2025-08-13	Tak-RS	ggml-rpc: chunk send()/recv() to avoid EINVAL for very...	commit \| commitdiff \| tree
2025-08-12	uvos	HIP: disable sync warp shuffel operators from clr amd_w...	commit \| commitdiff \| tree
2025-08-12	Romain Biessy	sycl: Fix and disable more configurations of mul_mat...	commit \| commitdiff \| tree
2025-08-12	rmatif	opencl: allow mixed f16/f32 `add` (#15140)	commit \| commitdiff \| tree
2025-08-12	Aman Gupta	CUDA cmake: add `-lineinfo` for easier debug (#15260)	commit \| commitdiff \| tree
2025-08-12	Chenguang Li	CANN: GGML_OP_CPY optimization (#15070)	commit \| commitdiff \| tree
2025-08-12	R0CKSTAR	musa: fix failures in test-backend-ops for mul_mat_id...	commit \| commitdiff \| tree
2025-08-11	hipudding	CANN: Add broadcast for softmax and FA (#15208)	commit \| commitdiff \| tree
2025-08-11	rainred	mtmd : Fix MinicpmV model converter and clip to avoid...	commit \| commitdiff \| tree
2025-08-11	Xuan-Son Nguyen	chat : hotfix gpt-oss jinja raising an exception (...	commit \| commitdiff \| tree
2025-08-11	Xuan-Son Nguyen	server : allow specifying reasoning_format in HTTP...	commit \| commitdiff \| tree
2025-08-11	Zagaj	readme : update infra list (#15234)	commit \| commitdiff \| tree
2025-08-11	Georgi Gerganov	kv-cache : fix seq_rm with seq_id == -1 (#15226)	commit \| commitdiff \| tree
2025-08-11	Daniel Bevenius	kv-cache : log (debug) all streams in find_slot (#15176)	commit \| commitdiff \| tree
2025-08-11	Sigbjørn Skjæret	convert : fix merge conflicts (#15229)	commit \| commitdiff \| tree
2025-08-11	Daniel Bevenius	perplexity : update comments/error msg to use decode...	commit \| commitdiff \| tree
2025-08-11	Julien Denize	convert : improve Mistral models integration (#14737)	commit \| commitdiff \| tree
2025-08-11	Charles Xu	kleidiai: fix unsigned overflow bug (#15150)	commit \| commitdiff \| tree
2025-08-09	David Zhao	cuda: refactored ssm_scan and use CUB (#13291)	commit \| commitdiff \| tree
2025-08-09	Aman Gupta	CUDA: add attention sinks for tile and wmma (#15178)	commit \| commitdiff \| tree
2025-08-08	compilade	gguf-py : add Numpy MXFP4 de/quantization support ...	commit \| commitdiff \| tree
2025-08-08	Johannes Gäßler	server-bench: external OAI servers, sqlite (#15179)	commit \| commitdiff \| tree
2025-08-08	AN Long	ggml : fix field name when new ggml_backend (#14944)	commit \| commitdiff \| tree
2025-08-08	Olivier Chafik	vendor: sync minja (#15161)	commit \| commitdiff \| tree
2025-08-08	Johannes Gäßler	CUDA: attention sinks for mma FlashAttention (#15157)	commit \| commitdiff \| tree
2025-08-08	lhez	opencl: support sink in `soft_max` (attn sinks) (#15152)	commit \| commitdiff \| tree
2025-08-07	Xuan-Son Nguyen	convert : support non-mxfp4 HF model (#15153)	commit \| commitdiff \| tree
2025-08-07	Jeff Bolz	vulkan: support fattn sinks (#15126)	commit \| commitdiff \| tree
2025-08-07	Jeff Bolz	vulkan: Add env var to disable host visible vidmem...	commit \| commitdiff \| tree
2025-08-07	RunningLeon	llama : Support intern-s1 (#14875)	commit \| commitdiff \| tree
2025-08-07	uvos	HIP: add cmake option to enable compiler output of...	commit \| commitdiff \| tree
2025-08-07	Christian Kastner	ggml: Skip backend library linking code when GGML_BACKE...	commit \| commitdiff \| tree
2025-08-07	Johannes Gäßler	CUDA: GEMM for FP32/FP16/BF16 and ne11 <= 16 (#15131)	commit \| commitdiff \| tree
2025-08-07	Johannes Gäßler	scripts: fix crash when --tool is not set (#15133)	commit \| commitdiff \| tree
2025-08-07	Daniel Bevenius	requirements : fix PyTorch uint64 compatibility (#15134)	commit \| commitdiff \| tree
2025-08-06	Reese Levine	ggml: Add basic SET_ROWS support in WebGPU (#15137)	commit \| commitdiff \| tree
2025-08-06	rmatif	fix profiling crash (#15072)	commit \| commitdiff \| tree
2025-08-06	lhez	opencl: add `swiglu_oai` and `add_id` (#15121)	commit \| commitdiff \| tree
2025-08-06	Sachin Desai	chat : support Granite model reasoning and tool call...	commit \| commitdiff \| tree
2025-08-06	Juk Armstrong	Fixed name `-override-tensors` to `-override-tensor...	commit \| commitdiff \| tree
2025-08-06	Diego Devesa	ggml : fix fallback to CPU for ununsupported ops (...	commit \| commitdiff \| tree
2025-08-06	Sigbjørn Skjæret	chat : fix yandex chat template (#15116)	commit \| commitdiff \| tree
2025-08-06	stevenkuang	chat : fix hunyuan auto-detection (#15114)	commit \| commitdiff \| tree
2025-08-06	Chenguang Li	CANN: add support for ACL Graph (#15065)	commit \| commitdiff \| tree
2025-08-05	Reese Levine	ggml: WebGPU disable SET_ROWS for now (#15078)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom