git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/shortlog

overview / pkg / ggml / sources / whisper.cpp / shortlog

2026-01-30	Aman Gupta	ggml-cpu: Use tiled FA for prompt-processing (llama...	commit \| commitdiff \| tree
2026-01-30	Georgi Gerganov	kv-cache : support V-less cache (llama/19067)	commit \| commitdiff \| tree
2026-01-30	Johannes Gäßler	CUDA: re-use MLA K data for V in MMA FA (llama/19057)	commit \| commitdiff \| tree
2026-01-30	Aman Gupta	ggml-cuda: enable cuda-graphs for `n-cpu-moe` (llama...	commit \| commitdiff \| tree
2026-01-30	nullname	ggml-hexagon: flash-attn opt (llama/19025)	commit \| commitdiff \| tree
2026-01-30	Neo Zhang	use malloc to support both iGPU and dGPU in same time...	commit \| commitdiff \| tree
2026-01-30	Alberto Cabrera...	ggml-cpu: aarm64: q5_K repack gemm and gemv (and generi...	commit \| commitdiff \| tree
2026-01-30	Georgi Gerganov	mla : make the V tensor a view of K (llama/18986)	commit \| commitdiff \| tree
2026-01-30	Johannes Gäßler	CUDA: fix alignment check for FA (llama/19023)	commit \| commitdiff \| tree
2026-01-30	lhez	opencl: enable the general fp mm for non-cont input...	commit \| commitdiff \| tree
2026-01-30	Aman Gupta	CUDA: add gqa_ratio 4 for GLM 4.7 flash (llama/18953)	commit \| commitdiff \| tree
2026-01-30	shaofeiqi	opencl: add TRI op support (llama/18979)	commit \| commitdiff \| tree
2026-01-30	Aleksei Nikiforov	ggml-zdnn : mark zDNN buffers as non-host (llama/18967)	commit \| commitdiff \| tree
2026-01-30	Jeff Bolz	vulkan: Remove transfer_ctx, do everything in compute_c...	commit \| commitdiff \| tree
2026-01-30	Jeff Bolz	vulkan: support flash attention GQA/split_k with small...	commit \| commitdiff \| tree
2026-01-30	Masato Nakasaka	Revert "vulkan: force full subgroups for flash attentio...	commit \| commitdiff \| tree
2026-01-30	Jeff Bolz	vulkan: Use mul_mat_vec_id for small values of n (llama...	commit \| commitdiff \| tree
2026-01-30	Oliver Simons	CUDA: Fix builds for older CCCL versions by ifdefing...	commit \| commitdiff \| tree
2026-01-30	Oliver Simons	CUDA: Replace init_offsets kernel with iterators in...	commit \| commitdiff \| tree
2026-01-30	Adrien Gallouët	ggml : cleanup path_str() (llama/18928)	commit \| commitdiff \| tree
2026-01-30	Georgi Gerganov	metal : enable FA for MLA heads (llama/18950)	commit \| commitdiff \| tree
2026-01-30	Georgi Gerganov	ggml : add ggml_build_forward_select (llama/18550)	commit \| commitdiff \| tree
2026-01-30	lhez	opencl: fix q6_K mv for m=1 (llama/18893)	commit \| commitdiff \| tree
2026-01-30	Reese Levine	ggml webgpu: support for backend sampling (llama/18880)	commit \| commitdiff \| tree
2026-01-30	Thore Koritzius	ggml : extend ggml_pool_1d + metal (llama/16429)	commit \| commitdiff \| tree
2026-01-30	Perry Naseck	ggml-blas: hide warnings from included BLAS headers...	commit \| commitdiff \| tree
2026-01-30	Raul Torres	CANN: Remove unused `ggml_cann_get_device` function...	commit \| commitdiff \| tree
2026-01-30	Chenguang Li	CANN: fix an issue where get_env was not fully renamed...	commit \| commitdiff \| tree
2026-01-30	hipudding	CANN: support gated linear attn (llama/18653)	commit \| commitdiff \| tree
2026-01-30	shaofeiqi	OpenCL: add SOLVE_TRI op support (llama/18846)	commit \| commitdiff \| tree
2026-01-30	Georgi Gerganov	cuda : print less debug logs when disabling cuda graphs...	commit \| commitdiff \| tree
2026-01-30	Johannes Gäßler	CUDA: fix allignment on register spill for FA (llama...	commit \| commitdiff \| tree
2026-01-30	shalinib-ibm	ggml-cpu: optimize ggml_vec_dot_bf16 for Power9 (llama...	commit \| commitdiff \| tree
2026-01-30	Max Krasnyansky	hexagon: support for OP_CPY, host buffers now optional...	commit \| commitdiff \| tree
2026-01-30	Oliver Simons	CUDA: Factor out and re-use `block_reduce` function...	commit \| commitdiff \| tree
2026-01-30	Jeff Bolz	vulkan: Check maxStorageBufferRange in supports_op...	commit \| commitdiff \| tree
2026-01-30	Daniel Bevenius	CUDA : fix typo in clang pragma comment [no ci] (llama...	commit \| commitdiff \| tree
2026-01-30	Ruben Ortlam	vulkan: work around Intel fp16 bug in mmq (llama/18814)	commit \| commitdiff \| tree
2026-01-30	Perry Naseck	ggml-metal: do not copy headers for embedded, use curre...	commit \| commitdiff \| tree
2026-01-30	yulo	HIP: add fattn-mma-f16 for RDNA4 (llama/18481)	commit \| commitdiff \| tree
2026-01-21	Bráulio Oliveira	examples : use -dev/--device and WHISPER_ARG_DEVICE...	commit \| commitdiff \| tree
2026-01-16	Yshtola	whisper : Fix UTF-8 character boundary issue in segment...	commit \| commitdiff \| tree
2026-01-15	Georgi Gerganov	release : v1.8.3 upstream/1.8.3	commit \| commitdiff \| tree
2026-01-15	Georgi Gerganov	benches : update	commit \| commitdiff \| tree
2026-01-14	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2026-01-14	Georgi Gerganov	CUDA : fix unused argument when USE_CUDA_GRAPH=OFF...	commit \| commitdiff \| tree
2026-01-14	Jeff Bolz	vulkan: change memory_logger to be controlled by an...	commit \| commitdiff \| tree
2026-01-14	Jeff Bolz	vulkan: Use VK_EXT_shader_64bit_indexing to handle...	commit \| commitdiff \| tree
2026-01-14	Ruben Ortlam	vulkan: Disable large coopmat matmul configuration...	commit \| commitdiff \| tree
2026-01-14	Ruben Ortlam	Vulkan: Optimize Matmul parameters for AMD GPUs with...	commit \| commitdiff \| tree
2026-01-14	Georgi Gerganov	talk-llama : sync llama.cpp	commit \| commitdiff \| tree
2026-01-14	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2026-01-14	shaofeiqi	opencl: add SOFTPLUS op support (llama/18726)	commit \| commitdiff \| tree
2026-01-14	Johannes Gäßler	HIP: adjust RDNA3.5 MMQ kernel selction logic (llama...	commit \| commitdiff \| tree
2026-01-14	Perry Naseck	cmake : update blas logic (llama/18205)	commit \| commitdiff \| tree
2026-01-14	Michael Wand	Corrected: changed s13 = src1->nb[3] instead of nb...	commit \| commitdiff \| tree
2026-01-14	shaofeiqi	opencl: add EXPM1 op (llama/18704)	commit \| commitdiff \| tree
2026-01-14	Reese Levine	Updates to webgpu get_memory (llama/18707)	commit \| commitdiff \| tree
2026-01-14	Aaron Teo	llama: use host memory if device reports 0 memory ...	commit \| commitdiff \| tree
2026-01-14	Masashi Yoshimura	ggml-webgpu: Fix GGML_MEM_ALIGN to 8 for emscripten...	commit \| commitdiff \| tree
2026-01-14	Reese Levine	ggml webgpu: initial flashattention implementation...	commit \| commitdiff \| tree
2026-01-14	Jeff Bolz	vulkan: fix push constant size for quantize_q8_1 (llama...	commit \| commitdiff \| tree
2026-01-14	Jeff Bolz	vulkan: optimize ssm_scan (llama/18630)	commit \| commitdiff \| tree
2026-01-14	도로로도로또	metal : add MoE kernel specialization for ne20=5 (llama...	commit \| commitdiff \| tree
2026-01-14	Doctor Shotgun	ggml: add env var GGML_OP_OFFLOAD_MIN_BATCH (llama...	commit \| commitdiff \| tree
2026-01-14	shaofeiqi	opencl: add FILL op support (llama/18682)	commit \| commitdiff \| tree
2026-01-14	Oliver Walsh	cuda : fix build on cuda 12.8 (llama/18672)	commit \| commitdiff \| tree
2026-01-14	Jeff Bolz	vulkan: reject ops when a tensor is too large to alloca...	commit \| commitdiff \| tree
2026-01-14	virajwad	vulkan: Warptile tuning for Intel Xe2/Xe3 (llama/18178)	commit \| commitdiff \| tree
2026-01-14	Eve	vulkan: more mul mat optimizations (llama/18533)	commit \| commitdiff \| tree
2026-01-14	hipudding	CANN: Fix rename for get_env (llama/18652)	commit \| commitdiff \| tree
2026-01-14	Raul Torres	CANN: Rename `get_env` to `get_env_as_lowercase` (llama...	commit \| commitdiff \| tree
2026-01-14	Max Krasnyansky	Hexagon add support for f16/f32 flash attention, scale...	commit \| commitdiff \| tree
2026-01-14	Aadeshveer...	ggml : optimize cuda ssm_scan using warp-level reductio...	commit \| commitdiff \| tree
2026-01-14	Jeff Bolz	vulkan: support buffer_from_host_ptr (llama/18467)	commit \| commitdiff \| tree
2026-01-14	Aman Gupta	ggml-cuda: refactor cuda graph usage (llama/18637)	commit \| commitdiff \| tree
2026-01-14	Beinsezii	mmq.cu: tune mmq/rocblas switching for RDNA (llama...	commit \| commitdiff \| tree
2026-01-14	Adrien Gallouët	ggml : fix avx512bf16 build (llama/18623)	commit \| commitdiff \| tree
2026-01-14	Raul Torres	CANN: Make `valid_values` variable `static const` ...	commit \| commitdiff \| tree
2026-01-14	nwyin	ggml webgpu: add CEIL operation support (llama/18605)	commit \| commitdiff \| tree
2026-01-14	Johannes Gäßler	CUDA: fix FA FP16 accumulator overflow for Granite...	commit \| commitdiff \| tree
2026-01-14	Aman Gupta	ggml-cuda: check for srcs outside the cgraph (llama...	commit \| commitdiff \| tree
2026-01-14	Jeff Bolz	vulkan: fix topk_moe_sigmoid_norm_bias failures in...	commit \| commitdiff \| tree
2026-01-14	Jeff Bolz	vulkan: handle quantize_q8_1 overflowing the max workgr...	commit \| commitdiff \| tree
2026-01-14	Chenguang Li	CANN: add operator fusion support for ADD + RMS_NORM...	commit \| commitdiff \| tree
2026-01-14	Daniel Bevenius	sampling : add support for backend sampling (llama...	commit \| commitdiff \| tree
2026-01-14	Aman Gupta	CUDA: disable cuda graph when using n-cpu-moe (llama...	commit \| commitdiff \| tree
2026-01-14	Aman Gupta	ggml-cuda: remove unused params in ggml_cuda_graph...	commit \| commitdiff \| tree
2026-01-14	Aman Gupta	ggml-cuda: fixes for concurrent streams (llama/18496)	commit \| commitdiff \| tree
2026-01-14	Johannes Gäßler	CUDA: only allocate FA tmp buffer if needed (llama...	commit \| commitdiff \| tree
2026-01-14	pl752	(Bugfix, ggml-cuda) Pool alloc count fix + small size...	commit \| commitdiff \| tree
2026-01-14	Shouyu	ggml-hexagon: optimize activation function (llama/18393)	commit \| commitdiff \| tree
2026-01-14	Jeff Bolz	vulkan: Optimize GGML_OP_CUMSUM (llama/18417)	commit \| commitdiff \| tree
2026-01-14	Jeff Bolz	vulkan: Implement mmvq for iq1_s/iq1_m (llama/18450)	commit \| commitdiff \| tree
2026-01-14	Georgi Gerganov	metal : adjust extra size for FA buffer to avoid reallo...	commit \| commitdiff \| tree
2026-01-14	Chris Rohlf	rpc : use unordered_map::reserve and emplace (llama...	commit \| commitdiff \| tree
2026-01-14	MeeMin	cuda : fix copy of large tensors (ggml_nbytes <= INT_MA...	commit \| commitdiff \| tree
2026-01-14	Aman Gupta	ggml-cuda: remove unneccesary prints on ggml_cuda_init...	commit \| commitdiff \| tree
2026-01-14	Jeff Bolz	vulkan: extend topk_moe to handle sigmoid w/exp_probs_b...	commit \| commitdiff \| tree
2026-01-13	Peter A.	examples : fix executable example targets (#3600)	commit \| commitdiff \| tree
next

Packaging of ggerganov/whisper.cpp

RSS Atom