git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog

overview / pkg / ggml / sources / ggml / shortlog

2025-09-05	Jeff Bolz	vulkan: Skip syncing for prealloc_y when it is reused...	commit \| commitdiff \| tree
2025-09-05	Chenguang Li	CANN: FIx compiler warnings (llama/15661)	commit \| commitdiff \| tree
2025-09-05	Aman Gupta	CUDA: fix bug in rms_norm fusion (llama/15660)	commit \| commitdiff \| tree
2025-09-05	Aman Gupta	CUDA: fuse adds, fuse add with rms norm (llama/15631)	commit \| commitdiff \| tree
2025-09-05	mnehete32	CUDA: add conv2d (llama/15635)	commit \| commitdiff \| tree
2025-09-05	Aaron Teo	ggml-cpu: fix invalid hsum build in debug s390x (llama...	commit \| commitdiff \| tree
2025-09-05	compilade	ggml : fix SSM_SCAN for n_groups > 1 (llama/15625)	commit \| commitdiff \| tree
2025-09-05	Georgi Gerganov	kv-cache : remove LLAMA_SET_ROWS checks (llama/15505)	commit \| commitdiff \| tree
2025-09-05	matiaslin	cuda: Add cublasLt_static linking when GGML_STATIC...	commit \| commitdiff \| tree
2025-09-05	uvos	HIP: Enable support for ggml_backend_cuda_register_host...	commit \| commitdiff \| tree
2025-09-05	Chenguang Li	CANN: refactor mask handling and improve performance...	commit \| commitdiff \| tree
2025-09-05	xctan	ggml-cpu : add basic RVV support for vector f32 ops...	commit \| commitdiff \| tree
2025-09-05	rmatif	OpenCL: add fused group_norm/norm, mul, add (llama...	commit \| commitdiff \| tree
2025-09-05	Diego Devesa	tests : fix test-opt with GGML_BACKEND_DL (llama/15599)	commit \| commitdiff \| tree
2025-09-05	Akarshan Biswas	SYCL: fix rms_norm_mul_add for tensor dim not a multipl...	commit \| commitdiff \| tree
2025-09-05	Eve	tests: add performance test for mul mat id (llama/15543)	commit \| commitdiff \| tree
2025-09-05	shalinib-ibm	llamafile: PowerPC Sgemm Optimization (llama/15558)	commit \| commitdiff \| tree
2025-09-05	Johannes Gäßler	CUDA: return -1 for nonexistent compiled arch (llama...	commit \| commitdiff \| tree
2025-09-05	Georgi Gerganov	metal : optimize FA vec for large sequences and BS...	commit \| commitdiff \| tree
2025-09-05	Georgi Gerganov	metal : improve `MUL_MAT_ID` (llama/15541)	commit \| commitdiff \| tree
2025-09-05	Sigbjørn Skjæret	metal : remove contiguous assertion for src0 in IM2COL...	commit \| commitdiff \| tree
2025-09-05	Yoshi_likes_e4	Add a warning for special devices (llama/15563)	commit \| commitdiff \| tree
2025-09-05	Jeff Bolz	vulkan: Remove splitting for mul_mat_id (llama/15568)	commit \| commitdiff \| tree
2025-09-05	Qeeweew	CUDA: Accelerate MXFP4 table lookup using `__byte_perm...	commit \| commitdiff \| tree
2025-09-05	lhez	opencl: fix support ops condition for `rms_norm` (llama...	commit \| commitdiff \| tree
2025-09-05	Ruben Ortlam	vulkan: fix min subgroup 16 condition for mmid subgroup...	commit \| commitdiff \| tree
2025-09-05	Jeff Bolz	tests: Generate unique input values for count_equal...	commit \| commitdiff \| tree
2025-09-05	Ihar Hrachyshka	metal: fix regression when no metal devices are present...	commit \| commitdiff \| tree
2025-09-05	Johannes Gäßler	CUDA: MoE helper in device code, better tile sizes...	commit \| commitdiff \| tree
2025-09-05	Georgi Gerganov	metal : add FA kernels for HS=40 (llama/15559)	commit \| commitdiff \| tree
2025-09-05	Chenguang Li	CANN: ROPE cache sin/cos repeat (llama/15501)	commit \| commitdiff \| tree
2025-09-05	Ruben Ortlam	vulkan: apply MUL_MAT_ID subgroup optimization to non...	commit \| commitdiff \| tree
2025-09-05	Jeff Bolz	vulkan: Support FA with any multiple of 8 head sizes...	commit \| commitdiff \| tree
2025-09-05	Ruben Ortlam	vulkan: enable Conv2D for Apple after MoltenVK fixed...	commit \| commitdiff \| tree
2025-09-05	Jeff Bolz	vulkan: workaround MoltenVK compile failure in multi_ad...	commit \| commitdiff \| tree
2025-09-05	Johannes Gäßler	CUDA: fix half2 -> half conversion for HIP (llama/15529)	commit \| commitdiff \| tree
2025-09-05	Jeff Bolz	vulkan: optimize rms_norm, and allow the work to spread...	commit \| commitdiff \| tree
2025-09-05	Jeff Bolz	vulkan: Rewrite synchronization to allow some overlap...	commit \| commitdiff \| tree
2025-09-05	Acly	vulkan : support ggml_mean (llama/15393)	commit \| commitdiff \| tree
2025-09-05	Jeff Bolz	vulkan: optimize mul_mat_id loading row ids into shared...	commit \| commitdiff \| tree
2025-09-05	Johannes Gäßler	test-opt: allow slight inprecision (llama/15503)	commit \| commitdiff \| tree
2025-09-05	Reese Levine	ggml WebGPU: add support for quantization types (llama...	commit \| commitdiff \| tree
2025-09-05	rmatif	ggml: add `conv3d` op (llama/15182)	commit \| commitdiff \| tree
2025-09-05	Yavor Ivanov	cuda : add Pad Reflect 1D support (llama/14659)	commit \| commitdiff \| tree
2025-09-05	Aaron Teo	ggml-cpu: Support Q5_0 and Q5_1 on s390x (llama/15486)	commit \| commitdiff \| tree
2025-09-05	Chenguang Li	CANN: Optimize RMS_NORM using cache (llama/15419)	commit \| commitdiff \| tree
2025-09-05	Diego Devesa	sched : fix possible use of wrong ids tensor when offlo...	commit \| commitdiff \| tree
2025-09-05	Acly	vulkan : support conv_2d_dw with f16 weights (llama...	commit \| commitdiff \| tree
2025-09-05	Dong Won Kim	vulkan: add exp operation (llama/15456)	commit \| commitdiff \| tree
2025-09-05	Jeff Bolz	vulkan: Reuse conversion results in prealloc_y (llama...	commit \| commitdiff \| tree
2025-09-05	Xuan-Son Nguyen	ggml : fix condition of im2col on Metal backend (llama...	commit \| commitdiff \| tree
2025-09-05	R0CKSTAR	musa: add GGML_UNUSED_VARS (llama/15446)	commit \| commitdiff \| tree
2025-09-05	Diego Devesa	sched : copy only the used experts when offloading...	commit \| commitdiff \| tree
2025-09-05	Johannes Gäßler	CUDA: refactor FA support/selection code (llama/15454)	commit \| commitdiff \| tree
2025-09-05	Johannes Gäßler	CUDA: replace GGML_CUDA_F16 with CUDA arch checks ...	commit \| commitdiff \| tree
2025-09-05	Jeff Bolz	vulkan: shorten pipeline name strings (llama/15431)	commit \| commitdiff \| tree
2025-09-05	R0CKSTAR	musa: fix build warnings (llama/15258)	commit \| commitdiff \| tree
2025-09-05	lhez	opencl: mark `argsort` unsupported if cols exceed workg...	commit \| commitdiff \| tree
2025-09-05	SHUAI YANG	CANN: optimize rope operator (llama/15335)	commit \| commitdiff \| tree
2025-09-05	R0CKSTAR	musa: handle __hgt2_mask, available starting from MUSA...	commit \| commitdiff \| tree
2025-09-05	Marvin Gießing	ggml-cpu: add mxfp4 VSX intrinsics for Power9+ (ppc64le...	commit \| commitdiff \| tree
2025-08-28	Daniel Bevenius	ci : add github release job (#1334)	commit \| commitdiff \| tree
2025-08-18	Georgi Gerganov	cuda : remove obsolete sources (#1332) upstream/0.0.2471	commit \| commitdiff \| tree
2025-08-18	Georgi Gerganov	sync : whisper.cpp	commit \| commitdiff \| tree
2025-08-18	Georgi Gerganov	scripts : update sync scripts	commit \| commitdiff \| tree
2025-08-18	Reese Levine	ggml: Add initial WebGPU backend (llama/14521)	commit \| commitdiff \| tree
2025-08-18	Aaron Teo	ggml : initial zDNN backend (llama/14975)	commit \| commitdiff \| tree
2025-08-18	Georgi Gerganov	scripts : update sync scripts	commit \| commitdiff \| tree
2025-08-18	Georgi Gerganov	common : handle mxfp4 enum	commit \| commitdiff \| tree
2025-08-18	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2025-08-18	compilade	ggml-quants : fix make_qp_quants NANs and IQ1 assertion...	commit \| commitdiff \| tree
2025-08-18	Jeff Bolz	vulkan: disable spirv-opt for bfloat16 shaders (llama...	commit \| commitdiff \| tree
2025-08-18	Jeff Bolz	vulkan: Use larger workgroups for mul_mat_vec when...	commit \| commitdiff \| tree
2025-08-18	Dong Won Kim	vulkan: support sqrt (llama/15370)	commit \| commitdiff \| tree
2025-08-18	Jeff Bolz	vulkan: Optimize argsort (llama/15354)	commit \| commitdiff \| tree
2025-08-18	Jeff Bolz	vulkan: fuse adds (llama/15252)	commit \| commitdiff \| tree
2025-08-18	Jeff Bolz	vulkan: Support mul_mat_id with f32 accumulators (llama...	commit \| commitdiff \| tree
2025-08-18	Jeff Bolz	vulkan: Add missing bounds checking to scalar/coopmat1...	commit \| commitdiff \| tree
2025-08-18	rmatif	OpenCL: add initial FA support (llama/14987)	commit \| commitdiff \| tree
2025-08-18	lhez	opencl: add initial mxfp4 support via mv (llama/15270)	commit \| commitdiff \| tree
2025-08-18	Georgi Gerganov	vulkan : fix out-of-bounds access in argmax kernel...	commit \| commitdiff \| tree
2025-08-18	Georgi Gerganov	vulkan : fix compile warnings on macos (llama/15340)	commit \| commitdiff \| tree
2025-08-18	Aaron Teo	ggml: initial IBM zDNN backend (llama/14975)	commit \| commitdiff \| tree
2025-08-18	Johannes Gäßler	test-opt: fix backend support check (llama/15317)	commit \| commitdiff \| tree
2025-08-18	Johannes Gäßler	CUDA: fix negative KV_max values in FA (llama/15321)	commit \| commitdiff \| tree
2025-08-18	uvos	HIP: Cleanup hipification header (llama/15285)	commit \| commitdiff \| tree
2025-08-18	Jeff Bolz	vulkan: perf_logger improvements (llama/15246)	commit \| commitdiff \| tree
2025-08-14	Jason Ni	ggml: fix ggml_conv_1d_dw bug (#1323) upstream/0.0.2446	commit \| commitdiff \| tree
2025-08-14	Georgi Gerganov	mnist : adapt to opt changes	commit \| commitdiff \| tree
2025-08-14	Georgi Gerganov	tests : remove unused includes (#0)	commit \| commitdiff \| tree
2025-08-14	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2025-08-14	Sigbjørn Skjæret	cuda : fix GGML_CUDA_GRAPHS=OFF (llama/15300)	commit \| commitdiff \| tree
2025-08-14	Jonathan Graehl	finetune: SGD optimizer, more CLI args (llama/13873)	commit \| commitdiff \| tree
2025-08-14	uvos	HIP: bump requirement to rocm 6.1 (llama/15296)	commit \| commitdiff \| tree
2025-08-14	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2025-08-14	Judd	ggml : update `ggml_rope_multi` (llama/12665)	commit \| commitdiff \| tree
2025-08-14	Georgi Gerganov	ggml : repack block_iq4_nlx8 (llama/14904)	commit \| commitdiff \| tree
2025-08-14	Oliver Simons	CUDA: Optimize `reduce_rows_f32` kernel, leading up...	commit \| commitdiff \| tree
2025-08-14	Tak-RS	ggml-rpc: chunk send()/recv() to avoid EINVAL for very...	commit \| commitdiff \| tree
2025-08-14	uvos	HIP: disable sync warp shuffel operators from clr amd_w...	commit \| commitdiff \| tree
next

Packaging of ggml-org/ggml

RSS Atom