git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog

overview / pkg / ggml / sources / ggml / shortlog

2026-02-27	Kevin Pouget	ggml-virtgpu: improve the reliability of the code ...	commit \| commitdiff \| tree
2026-02-27	Neo Zhang	support permuted, remove check s0/s10 (llama/19889)	commit \| commitdiff \| tree
2026-02-27	Jeff Bolz	vulkan: check for memory overlap before doing fusion...	commit \| commitdiff \| tree
2026-02-25	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-02-25	Georgi Gerganov	ggml/gguf : prevent integer overflows (llama/19856)	commit \| commitdiff \| tree
2026-02-25	Ruben Ortlam	Vulkan Scalar Flash Attention Refactor (llama/19625)	commit \| commitdiff \| tree
2026-02-25	Jeff Bolz	vulkan: fix coopmat1 without bf16 support (llama/19793)	commit \| commitdiff \| tree
2026-02-25	Jeff Bolz	vulkan: fix data race in mul_mat_id shader (llama/19790)	commit \| commitdiff \| tree
2026-02-25	Max Krasnyansky	hexagon refactor all Ops to use local context struct...	commit \| commitdiff \| tree
2026-02-25	Alberto Cabrera...	ggml-cpu: arm64: q5_K repack gemm and gemv (and generic...	commit \| commitdiff \| tree
2026-02-25	Gaurav Garg	Improve CUDA graph capture (llama/19754)	commit \| commitdiff \| tree
2026-02-25	Taimur Ahmad	ggml-cpu: add RVV vec dot kernels for quantization...	commit \| commitdiff \| tree
2026-02-25	Jeff Bolz	test: mul_mat tests with huge batch size (llama/19519)	commit \| commitdiff \| tree
2026-02-25	Masashi Yoshimura	ggml-webgpu: Add unary op (SQR, SQRT, SIN, COS) support...	commit \| commitdiff \| tree
2026-02-25	Ruben Ortlam	vulkan: fix MMQ shader push constants and multi-dispatc...	commit \| commitdiff \| tree
2026-02-25	Johannes Gäßler	CUDA: fix kernel selection logic for tile FA (llama...	commit \| commitdiff \| tree
2026-02-25	shalinib-ibm	llamafile: powerpc: add FP16 MMA path for Q4/Q8 matmul...	commit \| commitdiff \| tree
2026-02-25	Reese Levine	ggml webgpu: Fix bug in dispatching large matrix-vector...	commit \| commitdiff \| tree
2026-02-25	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-02-25	Reese Levine	ggml webgpu: shader library organization (llama/19530)	commit \| commitdiff \| tree
2026-02-25	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-02-25	Jeff Bolz	vulkan: split mul_mat into multiple dispatches to avoid...	commit \| commitdiff \| tree
2026-02-25	shaofeiqi	opencl: refactor expm1 and softplus (llama/19404)	commit \| commitdiff \| tree
2026-02-25	shaofeiqi	opencl: optimize mean and sum_row kernels (llama/19614)	commit \| commitdiff \| tree
2026-02-25	Talha Can Havadar	ggml: ggml-cpu: force-no-lto-for-cpu-feats (llama/19609)	commit \| commitdiff \| tree
2026-02-25	Georgi Gerganov	cuda : enable CUDA graphs for MMID 1 <= BS <= 4 (llama...	commit \| commitdiff \| tree
2026-02-25	Judd	ggml : make `ggml_is_view` as API (llama/19539)	commit \| commitdiff \| tree
2026-02-25	Mario Limonciello	Adjust workaround for ROCWMMA_FATTN/GFX9 to only newer...	commit \| commitdiff \| tree
2026-02-25	abhijain1204fujitsu	ggml: aarch64: Implement SVE in Gemm q4_k 8x8 q8_k...	commit \| commitdiff \| tree
2026-02-25	David Friehs	cuda: optimize iq2xxs/iq2xs/iq3xxs dequantization ...	commit \| commitdiff \| tree
2026-02-25	Daniel Bevenius	cmake : check if KleidiAI API has been fetched (llama...	commit \| commitdiff \| tree
2026-02-25	Georgi Gerganov	ggml : avoid UB in gemm ukernel (llama/19642)	commit \| commitdiff \| tree
2026-02-25	Aaron Teo	ggml-cpu: optimize ggml_vec_dot_bf16 for s390x (llama...	commit \| commitdiff \| tree
2026-02-25	Aman Gupta	ggml-cpu: FA add GEMM microkernel (llama/19422)	commit \| commitdiff \| tree
2026-02-25	SamareshSingh	cmake : fix KleidiAI install target failure with EXCLUD...	commit \| commitdiff \| tree
2026-02-25	Salman Chishti	ci : Upgrade GitHub Actions for Node 24 compatibility...	commit \| commitdiff \| tree
2026-02-16	Mathieu Baudier	Build for armv8.4-a	commit \| commitdiff \| tree
2026-02-16	Mathieu Baudier	Build CUDA only on amd64	commit \| commitdiff \| tree
2026-02-16	Mathieu Baudier	Fix arm64 build	commit \| commitdiff \| tree
2026-02-16	Mathieu Baudier	Better align with official Debian packages	commit \| commitdiff \| tree
2026-02-16	Mathieu Baudier	Upstream release	commit \| commitdiff \| tree
2026-02-16	Mathieu Baudier	Merge tag 'upstream/0.9.7' into debian/latest	commit \| commitdiff \| tree
2026-02-15	Georgi Gerganov	ggml : bump version to 0.9.7 (#1425) upstream/0.9.7 v0.9.7	commit \| commitdiff \| tree
2026-02-15	Georgi Gerganov	sync : whisper.cpp	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	models : optimize qwen3next graph (llama/19375)	commit \| commitdiff \| tree
2026-02-14	Adrien Gallouët	ggml : fix GGML_DEBUG with OpenMP (llama/19599)	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	metal : fix ACC op (llama/19427)	commit \| commitdiff \| tree
2026-02-14	Jeff Bolz	vulkan: support L2_NORM with contiguous rows (llama...	commit \| commitdiff \| tree
2026-02-14	Jeff Bolz	vulkan: support GGML_OP_SET (llama/19584)	commit \| commitdiff \| tree
2026-02-14	Sophon	vulkan: Add vendor id for Qualcomm drivers (llama/19569)	commit \| commitdiff \| tree
2026-02-14	Max Krasnyansky	hexagon: further optimizations and refactoring for...	commit \| commitdiff \| tree
2026-02-14	Jeff Bolz	vulkan: restore -inf check in FA shaders (llama/19582)	commit \| commitdiff \| tree
2026-02-14	Alberto Cabrera...	Fix wrong memcpy length for block_interleave == 4 ...	commit \| commitdiff \| tree
2026-02-14	ymcki	fix vulkan ggml_acc only works in 3d but not 4d (llama...	commit \| commitdiff \| tree
2026-02-14	Aman Gupta	CUDA: loop over ne2*ne3 in case it overflows (llama...	commit \| commitdiff \| tree
2026-02-14	Oliver Simons	CUDA: Do not mutate cgraph for fused ADDs (llama/19566)	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	metal : improve concurrency (llama/19555)	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	metal : support GGML_OP_SET (llama/19548)	commit \| commitdiff \| tree
2026-02-14	Shupei Fan	hexagon: fix typo in vtcm_needs_release (llama/19545)	commit \| commitdiff \| tree
2026-02-14	lhez	opencl: add basic support for q4_1 (llama/19534)	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	metal : update sum_rows kernel to support float4 (llama...	commit \| commitdiff \| tree
2026-02-14	Mario Limonciello	Add a workaround for compilation with ROCWMMA_FATTN...	commit \| commitdiff \| tree
2026-02-14	Max Krasnyansky	hexagon: further optimization and tuning of matmul...	commit \| commitdiff \| tree
2026-02-14	lhez	opencl: add general Q6_K mm and Q4_K mv (llama/19347)	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	ggml : unary ops support non-cont src0 + metal F16...	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	metal : extend l2_norm support for non-cont src0 (llama...	commit \| commitdiff \| tree
2026-02-14	Max Krasnyansky	hexagon: Add ARGSORT, DIV, SQR, SQRT, SUM_ROWS, GEGLU...	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	ggml : extend bin bcast for permuted src1 (llama/19484)	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	metal : consolidate unary ops (llama/19490)	commit \| commitdiff \| tree
2026-02-14	Oliver Simons	CUDA : Update CCCL-tag for 3.2 to final release from...	commit \| commitdiff \| tree
2026-02-14	Nikhil Jain	Plug memory leaks and free resources on shutdown (llama...	commit \| commitdiff \| tree
2026-02-14	Xuan-Son Nguyen	test: fix IMROPE perf test case (llama/19465)	commit \| commitdiff \| tree
2026-02-14	Alberto Cabrera...	ggml-cpu: arm64: q6_K repack gemm and gemv (and generic...	commit \| commitdiff \| tree
2026-02-14	k4ss4n	ggml : use noexcept overload for is_regular_file in...	commit \| commitdiff \| tree
2026-02-14	Raul Torres	CANN: Remove unnecessary wrapper for `gml_backend_buft_...	commit \| commitdiff \| tree
2026-02-14	hipudding	CANN: implement quantized MUL_MAT_ID for MoE models...	commit \| commitdiff \| tree
2026-02-14	Georgi Gerganov	cuda : extend GGML_OP_PAD to work with non-cont src0...	commit \| commitdiff \| tree
2026-02-14	Oliver Simons	CUDA: Fix non-contig rope (llama/19338)	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	metal : consolidate bin kernels (llama/19390)	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	metal : fix event synchronization in cpy_tensor_async...	commit \| commitdiff \| tree
2026-02-07	Abhijit Ramesh	ggml-webgpu: JIT compile binary operators and handle...	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2026-02-07	Nechama Krashinski	sycl: add F16 support for GGML_OP_CEIL (llama/19306)	commit \| commitdiff \| tree
2026-02-07	Jeff Bolz	tests: reduce number of FA test permutations (llama...	commit \| commitdiff \| tree
2026-02-07	Jeff Bolz	vulkan: For coopmat2 FA, use fp16 accumulators for...	commit \| commitdiff \| tree
2026-02-07	Jeff Bolz	vulkan: make FA mask/softcap enables spec constants...	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	metal : skip loading all-zero mask (llama/19337)	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	cuda : cuda graphs now compare all node params (llama...	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	metal : adaptive CPU/GPU interleave based on number...	commit \| commitdiff \| tree
2026-02-07	Jeff Bolz	vulkan: Preprocess FA mask to detect all-neg-inf and...	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	metal : add diag (llama/19330)	commit \| commitdiff \| tree
2026-02-07	Oleksandr Kuvshynov	vulkan: fix GPU deduplication logic. (llama/19222)	commit \| commitdiff \| tree
2026-02-07	Jeff Bolz	vulkan: Set k_load_shmem to false when K is too large...	commit \| commitdiff \| tree
2026-02-07	Jeff Bolz	vulkan: fix non-contig rope (llama/19299)	commit \| commitdiff \| tree
2026-02-07	will-lms	metal : add missing includes (llama/19348)	commit \| commitdiff \| tree
2026-02-07	Georgi Gerganov	tests : add non-cont, inplace rope tests (llama/19296)	commit \| commitdiff \| tree
2026-02-07	Kevin Pouget	ggml-virtgpu: make the code thread safe (llama/19204)	commit \| commitdiff \| tree
2026-02-07	Aman Gupta	ggml-cpu: use LUT for converting e8->f32 scales on...	commit \| commitdiff \| tree
next

Packaging of ggml-org/ggml

RSS Atom