git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog

overview / pkg / ggml / sources / ggml / shortlog

2025-09-25	hebangwen	examples : fix typo mismatch in gpt (#1349)	commit \| commitdiff \| tree
2025-09-25	Daniel Bevenius	ggml : bump version to 0.9.3 (#1353) v0.9.3	commit \| commitdiff \| tree
2025-09-25	Daniel Bevenius	scripts : refactor release script into prepare and...	commit \| commitdiff \| tree
2025-09-25	Daniel Bevenius	scripts : fix next dev version calculation [no ci]...	commit \| commitdiff \| tree
2025-09-25	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2025-09-25	Georgi Gerganov	metal : fuse NORM + MUL + ADD, support non-multiples...	commit \| commitdiff \| tree
2025-09-25	Georgi Gerganov	metal : relax reorder conditions (llama/16216)	commit \| commitdiff \| tree
2025-09-25	Georgi Gerganov	metal : restore im2col perf (llama/16219)	commit \| commitdiff \| tree
2025-09-25	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2025-09-25	Radoslav Gerganov	rpc : use ggml logging facilities	commit \| commitdiff \| tree
2025-09-25	Eve	ci: run the x64 and arm ci on the github machines inste...	commit \| commitdiff \| tree
2025-09-25	Johannes Gäßler	llama: print memory breakdown on exit (llama/15860)	commit \| commitdiff \| tree
2025-09-25	Acly	ggml : split graph allocations according to backend...	commit \| commitdiff \| tree
2025-09-25	Xiangyan Sun	ggml-cpu: Respect cpumask settings (llama/16164)	commit \| commitdiff \| tree
2025-09-25	Sigbjørn Skjæret	ggml : fix uninitialized is_on_grid in quantize_row_iq3...	commit \| commitdiff \| tree
2025-09-25	Aaron Teo	zdnn: refactor codebase + add docs (llama/16178)	commit \| commitdiff \| tree
2025-09-25	Daniel Bevenius	ggml-cpu : fix typo in gemm comments [no ci] (llama...	commit \| commitdiff \| tree
2025-09-25	Sigbjørn Skjæret	ggml : implement set_rows with i32 index (llama/16159)	commit \| commitdiff \| tree
2025-09-25	Georgi Gerganov	ggml : extend ggml_can_fuse to work with non-sequential...	commit \| commitdiff \| tree
2025-09-25	Georgi Gerganov	ggml : add ggml_op_is_empty (llama/16122)	commit \| commitdiff \| tree
2025-09-25	Shin-myoung...	Vulkan: add conv_transpose_2d operation (llama/16022)	commit \| commitdiff \| tree
2025-09-25	Jeff Bolz	vulkan: add RTE variants of exp shader (llama/16165)	commit \| commitdiff \| tree
2025-09-25	Ruben Ortlam	vulkan: vec dot matrix multiplication fix (llama/16151)	commit \| commitdiff \| tree
2025-09-25	lhez	opencl: fix concat crash on win arm64 with Adreno ...	commit \| commitdiff \| tree
2025-09-25	lhez	opencl: initial `q8_0` mv support (llama/15732)	commit \| commitdiff \| tree
2025-09-25	Giuseppe Scrivano	vulkan: optimize UMA buffer operations and fix driver...	commit \| commitdiff \| tree
2025-09-25	Jeff Bolz	vulkan: fix validation error about VK_PIPELINE_CREATE_C...	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	ggml : prepare for development of 0.9.2-dev	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	ggml : bump version to 0.9.1	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	scripts : fix sed usage to work on Mac (#1345)	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	tests : adjust to new timestep_embedding operator	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2025-09-20	Ruben Ortlam	vulkan: use vec dot for matrix matrix multiplications...	commit \| commitdiff \| tree
2025-09-20	Xuan-Son Nguyen	ggml : refactor forward_dup for cpu backend (llama...	commit \| commitdiff \| tree
2025-09-20	Adrien Gallouët	ggml-amx : fix ggml_amx_init() on generic Linux (llama...	commit \| commitdiff \| tree
2025-09-20	Adrien Gallouët	cmake : fix static linking for OpenMP on Unix-like...	commit \| commitdiff \| tree
2025-09-20	Shawn Gu	opencl: optimize mxfp4 kernels (llama/16037)	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	rename optimize_graph to graph_optimize (llama/16082)	commit \| commitdiff \| tree
2025-09-20	Bowen Han	CUDA: Optimize PAD_REFLECT_1D (llama/15957)	commit \| commitdiff \| tree
2025-09-20	Johannes Gäßler	CUDA: fix compilation on CC 6.0 (llama/16091)	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : use function constants for mul_mv_ext kernels...	commit \| commitdiff \| tree
2025-09-20	Sigbjørn Skjæret	cuda : add missing F32<->I32 entries in ggml_cuda_cpy_f...	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : improve F32, F16 and BF16 mat-vec multiplicatio...	commit \| commitdiff \| tree
2025-09-20	Jhen-Jie Hong	metal : avoid call free for non-owned buffer (llama...	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : handle nil cv during pipeline creation (llama...	commit \| commitdiff \| tree
2025-09-20	Chenguang Li	CANN: Remove print (llama/16044)	commit \| commitdiff \| tree
2025-09-20	Reese Levine	GGML WebGPU: Support for ADD, MUL, RMS_NORM, GET_ROWS...	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : refactor + optimize v2 (llama/15995)	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2025-09-20	Johannes Gäßler	CUDA: fix FA occupancy, optimize tile kernel (llama...	commit \| commitdiff \| tree
2025-09-20	Eve	vulkan: automatically remove unsupported devices (llama...	commit \| commitdiff \| tree
2025-09-20	Chenguang Li	CANN: Optimize ggml_cann_set_device (llama/15935)	commit \| commitdiff \| tree
2025-09-20	Daniel Bevenius	ggml : fix padding in timestep embedding kernels (llama...	commit \| commitdiff \| tree
2025-09-20	Jake Karnes	CUDA: fix im2col_3d to respect non-contiguous inputs...	commit \| commitdiff \| tree
2025-09-20	yael-works	SYCL: Add COUNT_EQUAL operator support (llama/15991)	commit \| commitdiff \| tree
2025-09-20	Aman Gupta	CUDA: some micro-optimizations in mmf.cuh for mul_mat_i...	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : remove memory pools (llama/15966)	commit \| commitdiff \| tree
2025-09-20	Ruben Ortlam	Vulkan: Clean up mul_mm shader (llama/15987)	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : fix kernel requirements (llama/15983)	commit \| commitdiff \| tree
2025-09-20	Aaron Teo	ggml-zdnn: rm user mapped buffers (llama/15965)	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: fix failing dequant shaders (llama/15862)	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: initialize vulkan-hpp to allow using extension...	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : refactor kernel loading (llama/15964)	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : allow ops to run concurrently (llama/15929)	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : fix memory leaks (llama/15962)	commit \| commitdiff \| tree
2025-09-20	Aaron Teo	ggml-zdnn: fix #15414, activate FP16 and BF16 accelerat...	commit \| commitdiff \| tree
2025-09-20	Ruben Ortlam	Vulkan iGPU device selection overhaul and PCI ID API...	commit \| commitdiff \| tree
2025-09-20	Mathieu Baudier	vulkan: Make device memory check more portable (llama...	commit \| commitdiff \| tree
2025-09-20	Neo Zhang Jianyu	Revert "sycl: add usage of enqueue_functions extension...	commit \| commitdiff \| tree
2025-09-20	Diego Devesa	ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device...	commit \| commitdiff \| tree
2025-09-20	Johannes Gäßler	CUDA: larger SRAM reads for tile FA, AMD FP16 dot ...	commit \| commitdiff \| tree
2025-09-20	Daniel Bevenius	ggml-cpu : add check for ARM MATMUL_INT8/i8mm support...	commit \| commitdiff \| tree
2025-09-20	Charles Xu	kleidiai: fix GGML_ASSERT(*cur_backend_id != -1) failed...	commit \| commitdiff \| tree
2025-09-20	hipudding	CANN: Disable acl_graph for prefill stage (llama/15933)	commit \| commitdiff \| tree
2025-09-20	Oliver Simons	CUDA: Add `fastdiv` to `k_bin_bcast*`, giving 1-3%...	commit \| commitdiff \| tree
2025-09-20	Daniel Bevenius	ggml-cpu : fix padding in ggml_timestep_embedding ...	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : make the backend async (llama/15906)	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2025-09-20	Daniel Bevenius	tests : filter out no-ops from coverage report (llama...	commit \| commitdiff \| tree
2025-09-20	Chenguang Li	CANN: Add ROPE sin/cos cache for reuse (llama/15912)	commit \| commitdiff \| tree
2025-09-20	Chenguang Li	CANN: implement LRU cache for ACL graphs (llama/15814)	commit \| commitdiff \| tree
2025-09-20	Ruben Ortlam	vulkan: throw the oom error instead of no memory type...	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: Fix OOB accesses in soft_max_back (llama/15861)	commit \| commitdiff \| tree
2025-09-20	Johannes Gäßler	HIP: use v_dot2_f32_f16 instruction for FA (llama/15884)	commit \| commitdiff \| tree
2025-09-20	lksj92hs	Workaround for subgroup arithmetic failing on MoltenVK...	commit \| commitdiff \| tree
2025-09-20	Aman Gupta	CUDA: Add mul_mat_id support for the mmf kernel (llama...	commit \| commitdiff \| tree
2025-09-20	Johannes Gäßler	CUDA: fix GET_ROWS for large tensors (llama/15882)	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: sort graph to allow more parallel execution...	commit \| commitdiff \| tree
2025-09-20	Aman Gupta	CUDA: generate_cu_files.py - add missing mxfp4 (llama...	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	cuda : fix supports_op condition for get_rows when...	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : refactor + optimize (llama/15857)	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2025-09-20	Xuan-Son Nguyen	ggml: allow casting between f32 and i32 (llama/15783)	commit \| commitdiff \| tree
2025-09-20	Sigbjørn Skjæret	CUDA: non-contiguous src0 not supported for PAD (llama...	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	tests: large sizes for get_rows (llama/15687)	commit \| commitdiff \| tree
2025-09-20	Chenguang Li	CANN: Stream sync between devices for acl_graph (llama...	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: support im2col_3d (llama/15795)	commit \| commitdiff \| tree
2025-09-20	Aaron Teo	ggml-cpu: clean up s390x SIMD (llama/15855)	commit \| commitdiff \| tree
next

Packaging of ggml-org/ggml

RSS Atom