git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2025-08-04	compilade	imatrix : warn when GGUF imatrix is saved without ...	commit \| commitdiff \| tree
2025-08-04	Christian Kastner	cmake: Add GGML_BACKEND_DIR option (#15074)	commit \| commitdiff \| tree
2025-08-04	Sigbjørn Skjæret	gguf-py : add --chat-template-file to gguf_new_metadata...	commit \| commitdiff \| tree
2025-08-04	Sam	model: support GLM 4.5 family of models (#14939)	commit \| commitdiff \| tree
2025-08-04	Sigbjørn Skjæret	quantize : fix confusing error message if ftype is...	commit \| commitdiff \| tree
2025-08-04	Reese Levine	ggml: WebGPU backend host improvements and style fixing...	commit \| commitdiff \| tree
2025-08-04	Jeff Bolz	vulkan: fix build when using glslang that does not...	commit \| commitdiff \| tree
2025-08-03	compilade	imatrix : use GGUF by default (#14842)	commit \| commitdiff \| tree
2025-08-03	compilade	imatrix : fix 3d activation handling for hybrid and...	commit \| commitdiff \| tree
2025-08-03	compilade	memory : handle kv_unified for hybrid models (#15050)	commit \| commitdiff \| tree
2025-08-03	Csaba Kecskemeti	vocab : JetBrains Mellum pre-tokenizer (#15045)	commit \| commitdiff \| tree
2025-08-03	Gabriel Larson	model : add text-only support for Kimi-VL (and find...	commit \| commitdiff \| tree
2025-08-03	Jeff Bolz	vulkan: Use coopmat2 for conv2d (#14982)	commit \| commitdiff \| tree
2025-08-02	lhez	opencl: fix adreno compiler detection logic (#15029)	commit \| commitdiff \| tree
2025-08-02	Johannes Gäßler	CUDA: use mma FA kernel for gqa > 4 on RTX 4000 (#15035)	commit \| commitdiff \| tree
2025-08-02	leejet	cuda: make im2col a little faster (#15025) upstream/0.0.6073	commit \| commitdiff \| tree
2025-08-02	Daniel Bevenius	kv-cache : skip alignment of n_stream in kv-cache log...	commit \| commitdiff \| tree
2025-08-02	Georgi Gerganov	llama : enable LLAMA_SET_ROWS=1 by default (#14959)	commit \| commitdiff \| tree
2025-08-02	Georgi Gerganov	cuda, sycl : fix batched gemm when ne02 == 1 && ne03...	commit \| commitdiff \| tree
2025-08-02	Sigbjørn Skjæret	ci : check that pre-tokenizer hashes are up-to-date...	commit \| commitdiff \| tree
2025-08-02	Douglas Hanley	convert : fix Qwen3-Embedding pre-tokenizer hash (...	commit \| commitdiff \| tree
2025-08-02	Jhen-Jie Hong	chat : fix multiple tool_calls on hermes-2-pro (#14962)	commit \| commitdiff \| tree
2025-08-02	Jeff Bolz	vulkan: coopmat2 mul_mat optimizations (#14934)	commit \| commitdiff \| tree
2025-08-02	R0CKSTAR	llama-bench: rename DB table name from test to llama_be...	commit \| commitdiff \| tree
2025-08-02	Jeff Bolz	vulkan: Support ne[3]>1 in noncontig matrix-vector...	commit \| commitdiff \| tree
2025-08-02	Douglas Hanley	model : support Qwen3-Embedding (#15023)	commit \| commitdiff \| tree
2025-08-02	Johannes Gäßler	server: enable token array inputs for OAI API (#15001)	commit \| commitdiff \| tree
2025-08-02	Jeff Bolz	vulkan: optimizations for direct convolution (#14933)	commit \| commitdiff \| tree
2025-08-01	Johannes Gäßler	CUDA: fix MMQ nwarps for AMD with warp_size==32 (#15014)	commit \| commitdiff \| tree
2025-08-01	l-austenfeld	vendor : update vendored copy of google/minja (#15011)	commit \| commitdiff \| tree
2025-08-01	stevenkuang	model : add hunyuan dense (#14878)	commit \| commitdiff \| tree
2025-08-01	lhez	opencl: add f16 for `add`, `sub`, `mul`, `div` (#14984)	commit \| commitdiff \| tree
2025-08-01	Srihari-mcw	ggml : Q2k interleaving implementation - x86/x64 SIMD...	commit \| commitdiff \| tree
2025-08-01	Georgi Gerganov	graph : fix equal_seq() check (#14986)	commit \| commitdiff \| tree
2025-08-01	diannao	docker : add cann build pipline (#14591)	commit \| commitdiff \| tree
2025-08-01	R0CKSTAR	compare-commits.sh: support both llama-bench and test...	commit \| commitdiff \| tree
2025-07-31	Ed Addario	quantize : skip tensor override when in fallback mode...	commit \| commitdiff \| tree
2025-07-31	Diego Devesa	llama : add simple option to enable CPU for MoE weights...	commit \| commitdiff \| tree
2025-07-31	Aman Gupta	Fix params bug in diffusion example (#14993)	commit \| commitdiff \| tree
2025-07-31	Diego Devesa	llama : allow other bufts when overriding to CPU, add...	commit \| commitdiff \| tree
2025-07-31	Ruben Ortlam	Vulkan: Fix minor debug mode issues (#14899)	commit \| commitdiff \| tree
2025-07-31	tc-mb	mtmd : support MiniCPM-V 4.0 (#14983)	commit \| commitdiff \| tree
2025-07-31	Csaba Kecskemeti	MODEL_TENSOR.SSM_DT_NORM has defined twice (#14991)	commit \| commitdiff \| tree
2025-07-31	g2mt	server : implement universal assisted decoding (#12635)	commit \| commitdiff \| tree
2025-07-31	Dongliang Wei	llama : merge build_moe_ffn_from_probs function into...	commit \| commitdiff \| tree
2025-07-31	Lukas Straub	server : add openai-style logit_bias support (#14946)	commit \| commitdiff \| tree
2025-07-31	Aman Gupta	Add LLaDA 8b Diffusion model (#14771)	commit \| commitdiff \| tree
2025-07-31	hipudding	CANN: Improve loading efficiency after converting weigh...	commit \| commitdiff \| tree
2025-07-31	compilade	graph : reduce splits for recurrent and hybrid models...	commit \| commitdiff \| tree
2025-07-30	lhez	opencl: add `mul_mat_f32_f32_l4_lm` and `mul_mat_f16_f3...	commit \| commitdiff \| tree
2025-07-30	Ed Addario	quantize : fix using combined imatrix GGUFs (multiple...	commit \| commitdiff \| tree
2025-07-30	Daniel Bevenius	server : add support for `embd_normalize` parameter...	commit \| commitdiff \| tree
2025-07-30	uvos	HIP: enable mfma mmq on gfx908 and gfx90a for select...	commit \| commitdiff \| tree
2025-07-30	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-07-30	Kai Pastor	cmake : Fix BLAS link interface (ggml/1316)	commit \| commitdiff \| tree
2025-07-30	Kai Pastor	vulkan : fix 32-bit builds (ggml/1313)	commit \| commitdiff \| tree
2025-07-30	Johannes Gäßler	CUDA: skip masked KV slices for all FA kernels (#14924)	commit \| commitdiff \| tree
2025-07-30	Georgi Gerganov	tests : update for LLAMA_SET_ROWS=1 (#14961)	commit \| commitdiff \| tree
2025-07-30	Georgi Gerganov	graph : fix stack-use-after-return (#14960)	commit \| commitdiff \| tree
2025-07-30	Douglas Hanley	embeddings: fix extraction of CLS pooling results ...	commit \| commitdiff \| tree
2025-07-30	Xinpeng Dou	CANN: update ops docs (#14935)	commit \| commitdiff \| tree
2025-07-29	uvos	HIP: remove the use of __HIP_PLATFORM_AMD__, explicitly...	commit \| commitdiff \| tree
2025-07-29	uvos	HIP: add GGML_HIP_MMQ_MFMA option to allow disableing...	commit \| commitdiff \| tree
2025-07-29	uvos	HIP: Ignore unsupported unroll transformation in fattn...	commit \| commitdiff \| tree
2025-07-29	kallewoof	common : avoid logging partial messages (which can...	commit \| commitdiff \| tree
2025-07-29	hipudding	CANN: Add ggml_set_rows (#14943)	commit \| commitdiff \| tree
2025-07-29	Sigbjørn Skjæret	cuda : add softcap fusion (#14907)	commit \| commitdiff \| tree
2025-07-29	Johannes Gäßler	server-bench: make seed choice configurable (#14929)	commit \| commitdiff \| tree
2025-07-29	Aman Gupta	CUDA: add roll (#14919)	commit \| commitdiff \| tree
2025-07-28	lhez	opencl : add ops docs (#14910)	commit \| commitdiff \| tree
2025-07-28	Leonard Mosescu	test-backend-ops : extend test case filtering (#14865)	commit \| commitdiff \| tree
2025-07-28	Radoslav Gerganov	llama-bench : use local GPUs along with RPC servers...	commit \| commitdiff \| tree
2025-07-28	xctan	ggml-cpu : deduplicate scalar implementations (#14897)	commit \| commitdiff \| tree
2025-07-28	Akarshan Biswas	SYCL: Add set_rows support for quantized types (#14883)	commit \| commitdiff \| tree
2025-07-28	Xuan-Son Nguyen	mtmd : add support for Voxtral (#14862)	commit \| commitdiff \| tree
2025-07-28	Johannes Gäßler	CUDA: fix pointer incrementation in FA (#14916)	commit \| commitdiff \| tree
2025-07-28	Dongliang Wei	model : add support for SmallThinker series (#14898)	commit \| commitdiff \| tree
2025-07-28	Alberto Cabrera...	sycl: refactor quantization to q8_1 (#14815)	commit \| commitdiff \| tree
2025-07-28	Georgi Gerganov	ops : update BLAS (#14914)	commit \| commitdiff \| tree
2025-07-28	Georgi Gerganov	ops : update Metal (#14912)	commit \| commitdiff \| tree
2025-07-28	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-07-28	Kai Pastor	cmake : Indent ggml-config.cmake (ggml/1310)	commit \| commitdiff \| tree
2025-07-27	Ed Addario	quantize : update README.md (#14905)	commit \| commitdiff \| tree
2025-07-27	Ruben Ortlam	vulkan: add ops docs (#14900)	commit \| commitdiff \| tree
2025-07-27	Akarshan Biswas	SYCL: add ops doc (#14901)	commit \| commitdiff \| tree
2025-07-27	Daniel Bevenius	llama : clarify comment about pp and tg graphs [no...	commit \| commitdiff \| tree
2025-07-27	Erik Scholz	vulkan : add fp16 support for the conv_2d kernel (...	commit \| commitdiff \| tree
2025-07-27	Jeff Bolz	vulkan: skip empty set_rows to avoid invalid API usage...	commit \| commitdiff \| tree
2025-07-27	Gabriel Larson	model : make rope_yarn_log_mul optional for deepseek2...	commit \| commitdiff \| tree
2025-07-27	Shunta Saito	llama : fix kq_scale for the attention layers of PLaMo2...	commit \| commitdiff \| tree
2025-07-27	Aman Gupta	Docs: add instructions for adding backends (#14889)	commit \| commitdiff \| tree
2025-07-26	deepsek	HIP: Enable Matrix cores for MMQ Kernels, Enable stream...	commit \| commitdiff \| tree
2025-07-26	hipudding	CANN: Implement GLU ops (#14884)	commit \| commitdiff \| tree
2025-07-26	R0CKSTAR	musa: fix build warnings (unused variable) (#14869)	commit \| commitdiff \| tree
2025-07-25	Aaron Teo	ggml-cpu : disable GGML_NNPA by default due to instabil...	commit \| commitdiff \| tree
2025-07-25	Gabe Goodhart	metal: SSM_SCAN performance (#14743)	commit \| commitdiff \| tree
2025-07-25	lhez	opencl: add fused `rms_norm_mul` (#14841)	commit \| commitdiff \| tree
2025-07-25	wooksong	docs : update HOWTO‑add‑model.md for ModelBase and...	commit \| commitdiff \| tree
2025-07-25	Oliver Simons	ggml : remove invalid portPos specifiers from dot files...	commit \| commitdiff \| tree
2025-07-25	Georgi Gerganov	context : restore preemptive sched reset when LLAMA_SET...	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom