git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog

overview / pkg / ggml / sources / ggml / shortlog

2026-04-01	uvos	CUDA/HIP: Fix kernel slection for mmvq mmid kernel...	commit \| commitdiff \| tree
2026-04-01	Georgi Gerganov	ggml : fix RWKV ops thread assignment (llama/21226)	commit \| commitdiff \| tree
2026-04-01	Taimur Ahmad	ggml-cpu: fix fallback for RVV kernels without zvfh...	commit \| commitdiff \| tree
2026-04-01	Anav Prasad	CUDA: Add Flash Attention Support for Head Dimension...	commit \| commitdiff \| tree
2026-04-01	Reese Levine	ggml webgpu: quantized buffers to u32 + wider browser...	commit \| commitdiff \| tree
2026-04-01	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-04-01	Abhijit Ramesh	ggml-webgpu: port all AOT operators to JIT (llama/20728)	commit \| commitdiff \| tree
2026-04-01	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-04-01	hipudding	CANN: fix multi-thread set_tensor race conditions ...	commit \| commitdiff \| tree
2026-04-01	Neo Zhang	sycl : enhance fattn perf (llama/21185)	commit \| commitdiff \| tree
2026-04-01	shaofeiqi	opencl: add q4_K gemm and gemv kernels for Adreno ...	commit \| commitdiff \| tree
2026-04-01	Oliver Simons	CUDA : Fix CUB's argsort when nrows % block_size =...	commit \| commitdiff \| tree
2026-04-01	Radoslav Gerganov	rpc : fix misleading error log (llama/21184)	commit \| commitdiff \| tree
2026-04-01	Gaurav Garg	Optimize MOE GEMV kernel for BS > 1. (llama/20905)	commit \| commitdiff \| tree
2026-04-01	Max Krasnyansky	hexagon: dma optimizations (mostly fixing regressions...	commit \| commitdiff \| tree
2026-03-30	Georgi Gerganov	ggml : bump version to 0.9.9 (#1449) v0.9.9	commit \| commitdiff \| tree
2026-03-30	Georgi Gerganov	sync : whisper.cpp	commit \| commitdiff \| tree
2026-03-28	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-03-28	Ruben Ortlam	vulkan: add noncontiguous GLU support (llama/21081)	commit \| commitdiff \| tree
2026-03-28	Yiwei Shao	hexagon: support for IQ4_NL and MXFP4 (llama/21018)	commit \| commitdiff \| tree
2026-03-28	Radoslav Gerganov	rpc : proper handling of data pointers to CPU buffers...	commit \| commitdiff \| tree
2026-03-28	ren	metal : Fix dimension constraint violation in matmul2d...	commit \| commitdiff \| tree
2026-03-28	uvos	hip: use fnuz fp8 for conversion on CDNA3 (llama/21040)	commit \| commitdiff \| tree
2026-03-28	lhez	opencl: allow large buffer for adreno (llama/20997)	commit \| commitdiff \| tree
2026-03-28	ihb2032	fix(ggml): correct RISC-V ISA string canonical ordering...	commit \| commitdiff \| tree
2026-03-28	Michael Wand	ggml-cuda: Add NVFP4 dp4a kernel (llama/20644)	commit \| commitdiff \| tree
2026-03-28	Yihao Wang	CUDA & CPU: support F32 kernel type for `CONV_TRANSPOSE...	commit \| commitdiff \| tree
2026-03-28	Saba Fallah	mtmd: Add DeepSeekOCR Support (llama/17400)	commit \| commitdiff \| tree
2026-03-28	Johannes Gäßler	llama: fix llama-model-saver (llama/20503)	commit \| commitdiff \| tree
2026-03-28	Neo Zhang	sycl : fix wrong variable check by assert (llama/20903)	commit \| commitdiff \| tree
2026-03-28	nuri	metal : add FLOOR, CEIL, ROUND, TRUNC unary ops (llama...	commit \| commitdiff \| tree
2026-03-28	Georgi Gerganov	metal : add FA instantiations for HSK=512, HSV=512...	commit \| commitdiff \| tree
2026-03-28	Max Krasnyansky	hexagon: general DMA and Binary Op fixes for large...	commit \| commitdiff \| tree
2026-03-28	lhez	opencl: add q6_K gemm and gemv kernels for Adreno ...	commit \| commitdiff \| tree
2026-03-28	las7	rpc : RCE patch (llama/20908)	commit \| commitdiff \| tree
2026-03-28	Rashid Ul Islam	metal: add CONV_3D (llama/19927)	commit \| commitdiff \| tree
2026-03-28	Chenguang Li	CANN: add RoPE cache preload before ACL graph capture...	commit \| commitdiff \| tree
2026-03-28	Dan Hoffman	fix(openvino): explicit memset in buffer_context alloca...	commit \| commitdiff \| tree
2026-03-28	shaofeiqi	opencl: add flattened Q4_K mv and general Q4_K mm ...	commit \| commitdiff \| tree
2026-03-28	Johannes Gäßler	CUDA: fix BF16 FA compilation (llama/20865)	commit \| commitdiff \| tree
2026-03-28	Neo Zhang	support bf16 and quantized type (llama/20803)	commit \| commitdiff \| tree
2026-03-28	Patrick Buckley	ggml-cuda: native bf16 flash attention for vec kernel...	commit \| commitdiff \| tree
2026-03-28	Gaurav Garg	Increase number of output elements per-thread block...	commit \| commitdiff \| tree
2026-03-28	y198	fix(rpc): prevent division by zero in deserialize_tenso...	commit \| commitdiff \| tree
2026-03-28	Matt Corallo	Add shader count for Intel Arc Pro B60 (llama/20818)	commit \| commitdiff \| tree
2026-03-28	shalinib-ibm	ggml-cpu: add always_inline to tinyBLAS_PPC accumulator...	commit \| commitdiff \| tree
2026-03-28	Jeff Bolz	vulkan: change gated_delta_net to shard a column across...	commit \| commitdiff \| tree
2026-03-28	hipudding	CANN: add BF16 support for core operators (llama/20152)	commit \| commitdiff \| tree
2026-03-28	Sundaram krishnan	ggml: guard KleidiAI DOWNLOAD_EXTRACT_TIMESTAMP for...	commit \| commitdiff \| tree
2026-03-28	Rail Chabdarov	hip: Avoid compiler bug in RDNA code generation during...	commit \| commitdiff \| tree
2026-03-28	Yiwei Shao	hexagon: add Matrix Extensions (HMX) for Hexagon NPU...	commit \| commitdiff \| tree
2026-03-28	uvos	ci : add hip quality check (llama/20430)	commit \| commitdiff \| tree
2026-03-28	Reese Levine	ggml webgpu: ops support for qwen3.5 (SET, TRI_SOLVE...	commit \| commitdiff \| tree
2026-03-28	Eve	vulkan: dequantize iq4_xs 4 at a time (llama/20657)	commit \| commitdiff \| tree
2026-03-28	Charles Xu	cmake : fix build warning when kleidiai is enabled...	commit \| commitdiff \| tree
2026-03-28	Chenguang Li	CANN: handle in-place ROPE on non-contiguous f32 tensor...	commit \| commitdiff \| tree
2026-03-28	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-03-28	Masashi Yoshimura	ggml-webgpu: Update the `RMS_NORM` preprocessor and...	commit \| commitdiff \| tree
2026-03-28	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-03-28	Masashi Yoshimura	ggml-webgpu: Add supports for `DIAG` and `TRI` (llama...	commit \| commitdiff \| tree
2026-03-28	Chenguang Li	CANN: support flash attention for head dim not multiple...	commit \| commitdiff \| tree
2026-03-28	Reese Levine	Move to no timeout for WaitAny in graph submission...	commit \| commitdiff \| tree
2026-03-28	Shaw Nguyen	ggml-cpu/x86: fix unused changemask warning in repack...	commit \| commitdiff \| tree
2026-03-28	uvos	HIP : ignore return of hipMemAdvise [no ci] (llama...	commit \| commitdiff \| tree
2026-03-28	Krishna Sridhar	hexagon: add neg, exp, sigmoid, softplus ops, cont...	commit \| commitdiff \| tree
2026-03-28	Ruben Ortlam	vulkan: disable mmvq on Intel Windows driver (llama...	commit \| commitdiff \| tree
2026-03-28	Kevin Hannon	ggml-blas: set mkl threads from thread context (llama...	commit \| commitdiff \| tree
2026-03-28	Taimur Ahmad	ggml-cpu: fix RVV checks in quants and repacking (llama...	commit \| commitdiff \| tree
2026-03-28	Ruben Ortlam	vulkan: async and event fixes (llama/20518)	commit \| commitdiff \| tree
2026-03-28	Justin Bradford	kleidiai : fix MUL_MAT support for batched (3D) inputs...	commit \| commitdiff \| tree
2026-03-28	Ruben Ortlam	vulkan: allow graphics queue only through env var ...	commit \| commitdiff \| tree
2026-03-28	Neo Zhang	ehance UPSCALE to support all UT cases (llama/20637)	commit \| commitdiff \| tree
2026-03-28	Martin Klacer	kleidiai: add data type check to get_tensor_traits...	commit \| commitdiff \| tree
2026-03-28	Ruben Ortlam	vulkan: fix flash attention dot product precision ...	commit \| commitdiff \| tree
2026-03-28	Aman Gupta	CUDA: GDN hide memory latency (llama/20537)	commit \| commitdiff \| tree
2026-03-28	Sigbjørn Skjæret	sycl : fix for untransposed GDA recurrent state (llama...	commit \| commitdiff \| tree
2026-03-16	Georgi Gerganov	ci : disable AMX jobs	commit \| commitdiff \| tree
2026-03-16	Georgi Gerganov	ggml : bump version to 0.9.8 (#1442) v0.9.8	commit \| commitdiff \| tree
2026-03-16	Georgi Gerganov	ggml : restore ggml_type_sizef() to aboid major version...	commit \| commitdiff \| tree
2026-03-16	Georgi Gerganov	readme : simplify	commit \| commitdiff \| tree
2026-03-16	Georgi Gerganov	sync : whisper.cpp	commit \| commitdiff \| tree
2026-03-16	Georgi Gerganov	ggml : try fix arm build (whisper/0)	commit \| commitdiff \| tree
2026-03-15	David366AI	ggml : extend im2col f16 (#1434)	commit \| commitdiff \| tree
2026-03-15	Georgi Gerganov	common : add nvfp4 (#0)	commit \| commitdiff \| tree
2026-03-15	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-03-15	Johannes Gäßler	CUDA: limit number of FA stream-k CUDA blocks (llama...	commit \| commitdiff \| tree
2026-03-15	Pascal	ggml: avoid creating CUDA context during device init...	commit \| commitdiff \| tree
2026-03-15	MoonShadow	ggml/hip: fix APU compatibility - soft error handling...	commit \| commitdiff \| tree
2026-03-15	Bartowski	ggml : guard against sumq2 being 0 in IQ4_NL (llama...	commit \| commitdiff \| tree
2026-03-15	PikaPikachu	cuda : add RDNA4-specific MMVQ parameter table for...	commit \| commitdiff \| tree
2026-03-15	Ruben Ortlam	vulkan: use graphics queue on AMD (llama/20551)	commit \| commitdiff \| tree
2026-03-15	Georgi Gerganov	metal : add FA specialization for HSK = 320, HSV =...	commit \| commitdiff \| tree
2026-03-15	Max Krasnyansky	hexagon: Q4_0 and MXFP4 repack fixes (llama/20527)	commit \| commitdiff \| tree
2026-03-15	Neo Zhang	add op gated_delta_net (llama/20455)	commit \| commitdiff \| tree
2026-03-15	Adrien Gallouët	ggml : add native AVX512-FP16 support for F16 operation...	commit \| commitdiff \| tree
2026-03-15	Wallentri	Use fp32 in cuBLAS V100 to avoid overflows, env variabl...	commit \| commitdiff \| tree
2026-03-15	Zijun Yu	ggml : add OpenVINO backend (llama/15307)	commit \| commitdiff \| tree
2026-03-15	Rail Chabdarov	Fix data race in CUDA's "cpy" kernel (influences GGML...	commit \| commitdiff \| tree
2026-03-15	lhez	opencl: fix l2_norm (llama/20480)	commit \| commitdiff \| tree
2026-03-15	Georgi Gerganov	graph : remove redundant GDN state transposes (llama...	commit \| commitdiff \| tree
next

Packaging of ggml-org/ggml

RSS Atom