git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog

overview / pkg / ggml / sources / ggml / shortlog

2026-02-14	Oliver Simons	CUDA: Do not mutate cgraph for fused ADDs (llama/19566)	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	metal : improve concurrency (llama/19555)	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	metal : support GGML_OP_SET (llama/19548)	commit \| commitdiff \| tree
2026-02-14	Shupei Fan	hexagon: fix typo in vtcm_needs_release (llama/19545)	commit \| commitdiff \| tree
2026-02-14	lhez	opencl: add basic support for q4_1 (llama/19534)	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	metal : update sum_rows kernel to support float4 (llama...	commit \| commitdiff \| tree
2026-02-14	Mario Limonciello	Add a workaround for compilation with ROCWMMA_FATTN...	commit \| commitdiff \| tree
2026-02-14	Max Krasnyansky	hexagon: further optimization and tuning of matmul...	commit \| commitdiff \| tree
2026-02-14	lhez	opencl: add general Q6_K mm and Q4_K mv (llama/19347)	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	ggml : unary ops support non-cont src0 + metal F16...	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	metal : extend l2_norm support for non-cont src0 (llama...	commit \| commitdiff \| tree
2026-02-14	Max Krasnyansky	hexagon: Add ARGSORT, DIV, SQR, SQRT, SUM_ROWS, GEGLU...	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	ggml : extend bin bcast for permuted src1 (llama/19484)	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	metal : consolidate unary ops (llama/19490)	commit \| commitdiff \| tree
2026-02-14	Oliver Simons	CUDA : Update CCCL-tag for 3.2 to final release from...	commit \| commitdiff \| tree
2026-02-14	Nikhil Jain	Plug memory leaks and free resources on shutdown (llama...	commit \| commitdiff \| tree
2026-02-14	Xuan-Son Nguyen	test: fix IMROPE perf test case (llama/19465)	commit \| commitdiff \| tree
2026-02-14	Alberto Cabrera...	ggml-cpu: arm64: q6_K repack gemm and gemv (and generic...	commit \| commitdiff \| tree
2026-02-14	k4ss4n	ggml : use noexcept overload for is_regular_file in...	commit \| commitdiff \| tree
2026-02-14	Raul Torres	CANN: Remove unnecessary wrapper for `gml_backend_buft_...	commit \| commitdiff \| tree
2026-02-14	hipudding	CANN: implement quantized MUL_MAT_ID for MoE models...	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	cuda : extend GGML_OP_PAD to work with non-cont src0...	commit \| commitdiff \| tree
2026-02-14	Oliver Simons	CUDA: Fix non-contig rope (llama/19338)	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	metal : consolidate bin kernels (llama/19390)	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	metal : fix event synchronization in cpy_tensor_async...	commit \| commitdiff \| tree
2026-02-07	Abhijit Ramesh	ggml-webgpu: JIT compile binary operators and handle...	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-02-07	Nechama Krashinski	sycl: add F16 support for GGML_OP_CEIL (llama/19306)	commit \| commitdiff \| tree
2026-02-07	Jeff Bolz	tests: reduce number of FA test permutations (llama...	commit \| commitdiff \| tree
2026-02-07	Jeff Bolz	vulkan: For coopmat2 FA, use fp16 accumulators for...	commit \| commitdiff \| tree
2026-02-07	Jeff Bolz	vulkan: make FA mask/softcap enables spec constants...	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	metal : skip loading all-zero mask (llama/19337)	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	cuda : cuda graphs now compare all node params (llama...	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	metal : adaptive CPU/GPU interleave based on number...	commit \| commitdiff \| tree
2026-02-07	Jeff Bolz	vulkan: Preprocess FA mask to detect all-neg-inf and...	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	metal : add diag (llama/19330)	commit \| commitdiff \| tree
2026-02-07	Oleksandr Kuvshynov	vulkan: fix GPU deduplication logic. (llama/19222)	commit \| commitdiff \| tree
2026-02-07	Jeff Bolz	vulkan: Set k_load_shmem to false when K is too large...	commit \| commitdiff \| tree
2026-02-07	Jeff Bolz	vulkan: fix non-contig rope (llama/19299)	commit \| commitdiff \| tree
2026-02-07	will-lms	metal : add missing includes (llama/19348)	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	tests : add non-cont, inplace rope tests (llama/19296)	commit \| commitdiff \| tree
2026-02-07	Kevin Pouget	ggml-virtgpu: make the code thread safe (llama/19204)	commit \| commitdiff \| tree
2026-02-07	Aman Gupta	ggml-cpu: use LUT for converting e8->f32 scales on...	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	metal : add solve_tri (llama/19302)	commit \| commitdiff \| tree
2026-02-07	Ruben Ortlam	vulkan: disable coopmat1 fa on Nvidia Turing (llama...	commit \| commitdiff \| tree
2026-02-07	Aman Gupta	CUDA: use mmvq for mul-mat-id for small batch sizes...	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	metal : minor cleanup (llama/19251)	commit \| commitdiff \| tree
2026-02-07	Oliver Simons	CUDA: Fix loop unrolling for BW in mul_mat_q_stream_k_f...	commit \| commitdiff \| tree
2026-02-07	George	ggml: added cleanups in ggml_quantize_free (llama/19278)	commit \| commitdiff \| tree
2026-02-07	Gaurav Garg	cuda : revert CUDA_SCALE_LAUNCH_QUEUES override until...	commit \| commitdiff \| tree
2026-02-07	lhez	opencl: refactor some ops, concat, repeat, tanh and...	commit \| commitdiff \| tree
2026-02-07	Aman Gupta	ggml-cpu: FA split across kv for faster TG (llama/19209)	commit \| commitdiff \| tree
2026-02-07	Neo Zhang	Remove support for Nvidia & AMD GPU, because the oneAPI...	commit \| commitdiff \| tree
2026-02-07	Tamar	sycl: implement GGML_OP_TOP_K (llama/19242)	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	metal : support virtual devices (llama/18919)	commit \| commitdiff \| tree
2026-02-07	Johannes Gäßler	ggml-backend: fix async set/get fallback sync (llama...	commit \| commitdiff \| tree
2026-02-07	Christian Kastner	docs : Minor cleanups (llama/19252)	commit \| commitdiff \| tree
2026-02-07	Nikhil Jain	Remove pipeline cache mutexes (llama/19195)	commit \| commitdiff \| tree
2026-02-07	Max Krasnyansky	Bump cmake max version (needed for Windows on Snapdrago...	commit \| commitdiff \| tree
2026-02-07	nullname	ggml-hexagon: flash-attention and reduce-sum optimizati...	commit \| commitdiff \| tree
2026-02-07	shaofeiqi	opencl: add optimized q8_0 mm kernel for adreno (llama...	commit \| commitdiff \| tree
2026-02-07	Simon Redman	Correctly fetch q8_1 quantize pipeline in test as neede...	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	tests : add GQA=20 FA test (llama/19095)	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	ci : remove "Release" word from the title of the release	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	ggml : bump version to 0.9.6 (#1423) v0.9.6	commit \| commitdiff \| tree
2026-01-30	Georgi Gerganov	cmake : remove unused file (#1419)	commit \| commitdiff \| tree
2026-01-30	Georgi Gerganov	sync : whisper.cpp	commit \| commitdiff \| tree
2026-01-30	Georgi Gerganov	cuda : fix compile warnings (whisper/0)	commit \| commitdiff \| tree
2026-01-30	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-01-30	bssrdf	add tensor type checking as part of cuda graph properti...	commit \| commitdiff \| tree
2026-01-30	s8322	sycl: implement GGML_UNARY_OP_SOFTPLUS (llama/19114)	commit \| commitdiff \| tree
2026-01-30	RachelMantel	sycl: implement GGML_OP_TRI (llama/19089)	commit \| commitdiff \| tree
2026-01-30	Zheyuan Chen	ggml-webgpu: improve flastAttention performance by...	commit \| commitdiff \| tree
2026-01-30	Todor Boinovski	hexagon: enable offloading to Hexagon on Windows on...	commit \| commitdiff \| tree
2026-01-30	Georgi Gerganov	cuda : fix nkvo, offload and cuda graph node properties...	commit \| commitdiff \| tree
2026-01-30	yulo	HIP: add mmf for CDNA (llama/18896)	commit \| commitdiff \| tree
2026-01-30	Vishal Singh	ggml-zendnn : resolve ZenDNN backend cross-module symbo...	commit \| commitdiff \| tree
2026-01-30	Aman Gupta	CUDA: refactor topk-moe to enable more models (GLM...	commit \| commitdiff \| tree
2026-01-30	Neo Zhang	sycl: fix norm kernels: l2_norm, group_norm, rms_norm...	commit \| commitdiff \| tree
2026-01-30	Ruben Ortlam	Vulkan Flash Attention Coopmat1 Refactor (llama/19075)	commit \| commitdiff \| tree
2026-01-30	Patryk Kaminski	ggml-sycl: remove unused syclcompat header (llama/19140)	commit \| commitdiff \| tree
2026-01-30	Oleksandr Kuvshynov	vulkan: handle device dedup on MacOS + Vega II Duo...	commit \| commitdiff \| tree
2026-01-30	Kevin Pouget	ggml: new backend for Virglrenderer API Remoting accele...	commit \| commitdiff \| tree
2026-01-30	Alberto Cabrera...	ggml-cpu: arm64: Q4_K scale unroll and vectorization...	commit \| commitdiff \| tree
2026-01-30	Georgi Gerganov	cuda : fix "V is K view" check for non-unified KV cache...	commit \| commitdiff \| tree
2026-01-30	Georgi Gerganov	CUDA: tune GLM 4.7 Flash FA kernel selection logic...	commit \| commitdiff \| tree
2026-01-30	Nikhil Jain	ggml webgpu: Split shared state (webgpu_context) into...	commit \| commitdiff \| tree
2026-01-30	Vishal Singh	ggml-zendnn : update ZenDNN git tag to main branch...	commit \| commitdiff \| tree
2026-01-30	Johannes Gäßler	CUDA: tune GLM 4.7 Flash FA kernel selection logic...	commit \| commitdiff \| tree
2026-01-30	Alberto Cabrera...	ggml-cpu: aarm64: q6_K repack gemm and gemv (and generi...	commit \| commitdiff \| tree
2026-01-30	Gaurav Garg	Reduce CPU-side stalls due to the CUDA command buffer...	commit \| commitdiff \| tree
2026-01-30	shalinib-ibm	ggml-cpu: Enable FP16 MMA kernels on PPC (llama/19060)	commit \| commitdiff \| tree
2026-01-30	lhez	opencl: add flattened q6_K mv (llama/19054)	commit \| commitdiff \| tree
2026-01-30	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-01-30	Johannes Gäßler	CUDA: fix padding of GQA to power of 2 in FA (llama...	commit \| commitdiff \| tree
2026-01-30	Johannes Gäßler	CUDA: faster FA for GQA > 1 but not power of 2 (llama...	commit \| commitdiff \| tree
2026-01-30	ccbinn	metal : fix recommendedMaxWorkingSetSize availability...	commit \| commitdiff \| tree
2026-01-30	Aman Gupta	ggml-cpu: Use tiled FA for prompt-processing (llama...	commit \| commitdiff \| tree
2026-01-30	Georgi Gerganov	kv-cache : support V-less cache (llama/19067)	commit \| commitdiff \| tree
next

Packaging of ggml-org/ggml

RSS Atom