git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/shortlog

overview / pkg / ggml / sources / whisper.cpp / shortlog

2025-09-20	Aman Gupta	CUDA: some micro-optimizations in mmf.cuh for mul_mat_i...	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : remove memory pools (llama/15966)	commit \| commitdiff \| tree
2025-09-20	Ruben Ortlam	Vulkan: Clean up mul_mm shader (llama/15987)	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : fix kernel requirements (llama/15983)	commit \| commitdiff \| tree
2025-09-20	Aaron Teo	ggml-zdnn: rm user mapped buffers (llama/15965)	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: fix failing dequant shaders (llama/15862)	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: initialize vulkan-hpp to allow using extension...	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : refactor kernel loading (llama/15964)	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : allow ops to run concurrently (llama/15929)	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : fix memory leaks (llama/15962)	commit \| commitdiff \| tree
2025-09-20	Aaron Teo	ggml-zdnn: fix #15414, activate FP16 and BF16 accelerat...	commit \| commitdiff \| tree
2025-09-20	Ruben Ortlam	Vulkan iGPU device selection overhaul and PCI ID API...	commit \| commitdiff \| tree
2025-09-20	Mathieu Baudier	vulkan: Make device memory check more portable (llama...	commit \| commitdiff \| tree
2025-09-20	Neo Zhang Jianyu	Revert "sycl: add usage of enqueue_functions extension...	commit \| commitdiff \| tree
2025-09-20	Diego Devesa	ggml-backend : add GGML_BACKEND_DEVICE_TYPE_IGPU device...	commit \| commitdiff \| tree
2025-09-20	Johannes Gäßler	CUDA: larger SRAM reads for tile FA, AMD FP16 dot ...	commit \| commitdiff \| tree
2025-09-20	Daniel Bevenius	ggml-cpu : add check for ARM MATMUL_INT8/i8mm support...	commit \| commitdiff \| tree
2025-09-20	Charles Xu	kleidiai: fix GGML_ASSERT(*cur_backend_id != -1) failed...	commit \| commitdiff \| tree
2025-09-20	hipudding	CANN: Disable acl_graph for prefill stage (llama/15933)	commit \| commitdiff \| tree
2025-09-20	Oliver Simons	CUDA: Add `fastdiv` to `k_bin_bcast*`, giving 1-3%...	commit \| commitdiff \| tree
2025-09-20	Daniel Bevenius	ggml-cpu : fix padding in ggml_timestep_embedding ...	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : make the backend async (llama/15906)	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-09-20	Chenguang Li	CANN: Add ROPE sin/cos cache for reuse (llama/15912)	commit \| commitdiff \| tree
2025-09-20	Chenguang Li	CANN: implement LRU cache for ACL graphs (llama/15814)	commit \| commitdiff \| tree
2025-09-20	Ruben Ortlam	vulkan: throw the oom error instead of no memory type...	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: Fix OOB accesses in soft_max_back (llama/15861)	commit \| commitdiff \| tree
2025-09-20	Johannes Gäßler	HIP: use v_dot2_f32_f16 instruction for FA (llama/15884)	commit \| commitdiff \| tree
2025-09-20	lksj92hs	Workaround for subgroup arithmetic failing on MoltenVK...	commit \| commitdiff \| tree
2025-09-20	Aman Gupta	CUDA: Add mul_mat_id support for the mmf kernel (llama...	commit \| commitdiff \| tree
2025-09-20	Johannes Gäßler	CUDA: fix GET_ROWS for large tensors (llama/15882)	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: sort graph to allow more parallel execution...	commit \| commitdiff \| tree
2025-09-20	Aman Gupta	CUDA: generate_cu_files.py - add missing mxfp4 (llama...	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	cuda : fix supports_op condition for get_rows when...	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : refactor + optimize (llama/15857)	commit \| commitdiff \| tree
2025-09-20	Xuan-Son Nguyen	ggml: allow casting between f32 and i32 (llama/15783)	commit \| commitdiff \| tree
2025-09-20	Sigbjørn Skjæret	CUDA: non-contiguous src0 not supported for PAD (llama...	commit \| commitdiff \| tree
2025-09-20	Chenguang Li	CANN: Stream sync between devices for acl_graph (llama...	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: support im2col_3d (llama/15795)	commit \| commitdiff \| tree
2025-09-20	Aaron Teo	ggml-cpu: clean up s390x SIMD (llama/15855)	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: Support pad_ext (llama/15794)	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: Use larger loads in scalar/coopmat1 matmul...	commit \| commitdiff \| tree
2025-09-20	Daniel Bevenius	ggml WebGPU: remove userdata from request adapter callb...	commit \| commitdiff \| tree
2025-09-20	Johannes Gäßler	CUDA: faster tile FA (Pascal/AMD), headsize 256 (llama...	commit \| commitdiff \| tree
2025-09-20	Charles Xu	kleidiai: generalize compute_forward_kv_cache to comput...	commit \| commitdiff \| tree
2025-09-20	Johannes Gäßler	ggml-cpu: document use of "free" memory [no ci] (llama...	commit \| commitdiff \| tree
2025-09-20	Aaron Teo	ggml-cpu: drop support for nnpa intrinsics (llama/15821)	commit \| commitdiff \| tree
2025-09-20	Johannes Gäßler	CUDA: fastdiv, launch bounds for mmvq + q8_1 quant...	commit \| commitdiff \| tree
2025-09-20	Daniel Bevenius	ggml : introduce semantic versioning (ggml/1336)	commit \| commitdiff \| tree
2025-09-20	Gregor Jasny	CUDA : conditionally add cuda architectures (ggml/1341)	commit \| commitdiff \| tree
2025-09-20	Gabe Goodhart	metal : Add template specialization for mul_mm_id w...	commit \| commitdiff \| tree
2025-09-20	Chenguang Li	CANN: Refactor ND to NZ workspace to be per-device...	commit \| commitdiff \| tree
2025-09-20	leejet	ggml: add ops for WAN video model (cuda && cpu) (llama...	commit \| commitdiff \| tree
2025-09-20	hipudding	CANN: Fix precision issue on 310I DUO multi-devices...	commit \| commitdiff \| tree
2025-09-20	rmatif	opencl: add hs=40 to FA (llama/15758)	commit \| commitdiff \| tree
2025-09-20	Chenguang Li	CANN: fix acl_rstd allocation size in ggml_cann_rms_nor...	commit \| commitdiff \| tree
2025-09-20	Ruben Ortlam	vulkan: fix mmv subgroup16 selection (llama/15775)	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: don't use std::string in load_shaders, to impro...	commit \| commitdiff \| tree
2025-09-20	Daniel Bevenius	vulkan : update ggml_vk_instance_validation_ext_availab...	commit \| commitdiff \| tree
2025-09-20	Shin-myoung...	ggml vulkan: add hardsigmoid and hardswish operations...	commit \| commitdiff \| tree
2025-09-20	Oliver Simons	CUDA: Optimize `rms_norm_f32` kernel and its fused...	commit \| commitdiff \| tree
2025-09-20	hipudding	CANN: Add RoPE contiguous check for 310I DUP device...	commit \| commitdiff \| tree
2025-09-20	xctan	ggml-cpu : optimize RVV kernels (llama/15720)	commit \| commitdiff \| tree
2025-09-20	hipudding	CANN: Mask unsupported TRANSPOSE_1D operator (llama...	commit \| commitdiff \| tree
2025-09-20	Chenguang Li	CANN: Fix type float_t to float (llama/15736)	commit \| commitdiff \| tree
2025-09-20	Ruben Ortlam	vulkan: fix shaders gen when no integer dot is availabl...	commit \| commitdiff \| tree
2025-09-20	hipudding	CANN: Resolve soft_max precision issue (llama/15730)	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: Fix macro parameter order for f32 matmul shader...	commit \| commitdiff \| tree
2025-09-20	rmatif	opencl: add attn sinks support for FA kernels (llama...	commit \| commitdiff \| tree
2025-09-20	Chenguang Li	CANN: Support eager execution mode under ACL graph...	commit \| commitdiff \| tree
2025-09-20	hipudding	CANN: Support ext_factor in rope (llama/15710)	commit \| commitdiff \| tree
2025-09-20	Johannes Gäßler	ggml-backend: raise GGML_MAX_SPLIT_INPUTS (llama/15722)	commit \| commitdiff \| tree
2025-09-20	Gilad S	vulkan: use memory budget extension to read memory...	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: add missing clamps in new mul_mat_id paths...	commit \| commitdiff \| tree
2025-09-20	Ruben Ortlam	vulkan: disable large mmv subgroups on older Nvidia...	commit \| commitdiff \| tree
2025-09-20	s-goto-11	ggml: SVE support for exponential functions (llama...	commit \| commitdiff \| tree
2025-09-20	Prashant Vithule	ggml: aarch64: Implement SVE F16 kernels for vector...	commit \| commitdiff \| tree
2025-09-20	Ruben Ortlam	Vulkan: Add Integer Dot Product mul_mat_vec shader...	commit \| commitdiff \| tree
2025-09-20	Daniel Bevenius	ggml : WebGPU add TRANSPOSE and RESHAPE to supported...	commit \| commitdiff \| tree
2025-09-20	Akarshan Biswas	CUDA: fix build error from ambiguous __half conversions...	commit \| commitdiff \| tree
2025-09-20	hipudding	CANN: Optimize MUL_MAT_ID (llama/15658)	commit \| commitdiff \| tree
2025-09-20	hipudding	CANN: fix RoPE cache issue on multi-device (llama/15629)	commit \| commitdiff \| tree
2025-09-20	Georgi Gerganov	metal : fix checks for available FA kernels (llama...	commit \| commitdiff \| tree
2025-09-20	Diego Devesa	llama : separate compute buffer reserve from fattn...	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: handle large sizes for get_rows (llama/15686)	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: mul_mat_id coopmat2 optimizations (llama/15546)	commit \| commitdiff \| tree
2025-09-20	Daniel Bevenius	vulkan : remove unused portability_enumeration_ext...	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: Allow fallback to sysmem memory when vidmem...	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: clamp matmul and FA results to the max finite...	commit \| commitdiff \| tree
2025-09-20	Charles Xu	ggml: update kleidiai to v1.13.0 (llama/15663)	commit \| commitdiff \| tree
2025-09-20	Johannes Gäßler	llama: use FA + max. GPU layers by default (llama/15434)	commit \| commitdiff \| tree
2025-09-20	Johannes Gäßler	CUDA: use FP32 arithmetic for conv2d (llama/15683)	commit \| commitdiff \| tree
2025-09-20	Jeff Bolz	vulkan: Skip syncing for prealloc_y when it is reused...	commit \| commitdiff \| tree
2025-09-20	Chenguang Li	CANN: FIx compiler warnings (llama/15661)	commit \| commitdiff \| tree
2025-09-20	Aman Gupta	CUDA: fix bug in rms_norm fusion (llama/15660)	commit \| commitdiff \| tree
2025-09-20	Aman Gupta	CUDA: fuse adds, fuse add with rms norm (llama/15631)	commit \| commitdiff \| tree
2025-09-20	mnehete32	CUDA: add conv2d (llama/15635)	commit \| commitdiff \| tree
2025-09-20	Aaron Teo	ggml-cpu: fix invalid hsum build in debug s390x (llama...	commit \| commitdiff \| tree
2025-09-20	compilade	ggml : fix SSM_SCAN for n_groups > 1 (llama/15625)	commit \| commitdiff \| tree
next

Packaging of ggerganov/whisper.cpp

RSS Atom