git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2025-07-31	hipudding	CANN: Improve loading efficiency after converting weigh...	commit \| commitdiff \| tree
2025-07-31	compilade	graph : reduce splits for recurrent and hybrid models...	commit \| commitdiff \| tree
2025-07-30	lhez	opencl: add `mul_mat_f32_f32_l4_lm` and `mul_mat_f16_f3...	commit \| commitdiff \| tree
2025-07-30	Ed Addario	quantize : fix using combined imatrix GGUFs (multiple...	commit \| commitdiff \| tree
2025-07-30	Daniel Bevenius	server : add support for `embd_normalize` parameter...	commit \| commitdiff \| tree
2025-07-30	uvos	HIP: enable mfma mmq on gfx908 and gfx90a for select...	commit \| commitdiff \| tree
2025-07-30	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-07-30	Kai Pastor	cmake : Fix BLAS link interface (ggml/1316)	commit \| commitdiff \| tree
2025-07-30	Kai Pastor	vulkan : fix 32-bit builds (ggml/1313)	commit \| commitdiff \| tree
2025-07-30	Johannes Gäßler	CUDA: skip masked KV slices for all FA kernels (#14924)	commit \| commitdiff \| tree
2025-07-30	Georgi Gerganov	tests : update for LLAMA_SET_ROWS=1 (#14961)	commit \| commitdiff \| tree
2025-07-30	Georgi Gerganov	graph : fix stack-use-after-return (#14960)	commit \| commitdiff \| tree
2025-07-30	Douglas Hanley	embeddings: fix extraction of CLS pooling results ...	commit \| commitdiff \| tree
2025-07-30	Xinpeng Dou	CANN: update ops docs (#14935)	commit \| commitdiff \| tree
2025-07-29	uvos	HIP: remove the use of __HIP_PLATFORM_AMD__, explicitly...	commit \| commitdiff \| tree
2025-07-29	uvos	HIP: add GGML_HIP_MMQ_MFMA option to allow disableing...	commit \| commitdiff \| tree
2025-07-29	uvos	HIP: Ignore unsupported unroll transformation in fattn...	commit \| commitdiff \| tree
2025-07-29	kallewoof	common : avoid logging partial messages (which can...	commit \| commitdiff \| tree
2025-07-29	hipudding	CANN: Add ggml_set_rows (#14943)	commit \| commitdiff \| tree
2025-07-29	Sigbjørn Skjæret	cuda : add softcap fusion (#14907)	commit \| commitdiff \| tree
2025-07-29	Johannes Gäßler	server-bench: make seed choice configurable (#14929)	commit \| commitdiff \| tree
2025-07-29	Aman Gupta	CUDA: add roll (#14919)	commit \| commitdiff \| tree
2025-07-28	lhez	opencl : add ops docs (#14910)	commit \| commitdiff \| tree
2025-07-28	Leonard Mosescu	test-backend-ops : extend test case filtering (#14865)	commit \| commitdiff \| tree
2025-07-28	Radoslav Gerganov	llama-bench : use local GPUs along with RPC servers...	commit \| commitdiff \| tree
2025-07-28	xctan	ggml-cpu : deduplicate scalar implementations (#14897)	commit \| commitdiff \| tree
2025-07-28	Akarshan Biswas	SYCL: Add set_rows support for quantized types (#14883)	commit \| commitdiff \| tree
2025-07-28	Xuan-Son Nguyen	mtmd : add support for Voxtral (#14862)	commit \| commitdiff \| tree
2025-07-28	Johannes Gäßler	CUDA: fix pointer incrementation in FA (#14916)	commit \| commitdiff \| tree
2025-07-28	Dongliang Wei	model : add support for SmallThinker series (#14898)	commit \| commitdiff \| tree
2025-07-28	Alberto Cabrera...	sycl: refactor quantization to q8_1 (#14815)	commit \| commitdiff \| tree
2025-07-28	Georgi Gerganov	ops : update BLAS (#14914)	commit \| commitdiff \| tree
2025-07-28	Georgi Gerganov	ops : update Metal (#14912)	commit \| commitdiff \| tree
2025-07-28	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-07-28	Kai Pastor	cmake : Indent ggml-config.cmake (ggml/1310)	commit \| commitdiff \| tree
2025-07-27	Ed Addario	quantize : update README.md (#14905)	commit \| commitdiff \| tree
2025-07-27	Ruben Ortlam	vulkan: add ops docs (#14900)	commit \| commitdiff \| tree
2025-07-27	Akarshan Biswas	SYCL: add ops doc (#14901)	commit \| commitdiff \| tree
2025-07-27	Daniel Bevenius	llama : clarify comment about pp and tg graphs [no...	commit \| commitdiff \| tree
2025-07-27	Erik Scholz	vulkan : add fp16 support for the conv_2d kernel (...	commit \| commitdiff \| tree
2025-07-27	Jeff Bolz	vulkan: skip empty set_rows to avoid invalid API usage...	commit \| commitdiff \| tree
2025-07-27	Gabriel Larson	model : make rope_yarn_log_mul optional for deepseek2...	commit \| commitdiff \| tree
2025-07-27	Shunta Saito	llama : fix kq_scale for the attention layers of PLaMo2...	commit \| commitdiff \| tree
2025-07-27	Aman Gupta	Docs: add instructions for adding backends (#14889)	commit \| commitdiff \| tree
2025-07-26	deepsek	HIP: Enable Matrix cores for MMQ Kernels, Enable stream...	commit \| commitdiff \| tree
2025-07-26	hipudding	CANN: Implement GLU ops (#14884)	commit \| commitdiff \| tree
2025-07-26	R0CKSTAR	musa: fix build warnings (unused variable) (#14869)	commit \| commitdiff \| tree
2025-07-25	Aaron Teo	ggml-cpu : disable GGML_NNPA by default due to instabil...	commit \| commitdiff \| tree
2025-07-25	Gabe Goodhart	metal: SSM_SCAN performance (#14743)	commit \| commitdiff \| tree
2025-07-25	lhez	opencl: add fused `rms_norm_mul` (#14841)	commit \| commitdiff \| tree
2025-07-25	wooksong	docs : update HOWTO‑add‑model.md for ModelBase and...	commit \| commitdiff \| tree
2025-07-25	Oliver Simons	ggml : remove invalid portPos specifiers from dot files...	commit \| commitdiff \| tree
2025-07-25	Georgi Gerganov	context : restore preemptive sched reset when LLAMA_SET...	commit \| commitdiff \| tree
2025-07-25	kiwi	mtmd : fix 32-bit narrowing issue in export-lora and...	commit \| commitdiff \| tree
2025-07-25	Chris Rohlf	rpc : check for null buffers in get/set/copy tensor...	commit \| commitdiff \| tree
2025-07-25	Diego Devesa	sched : fix multiple evaluations of the same graph...	commit \| commitdiff \| tree
2025-07-24	R0CKSTAR	musa: upgrade musa sdk to rc4.2.0 (#14498)	commit \| commitdiff \| tree
2025-07-24	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-07-24	Kai Pastor	cmake : fix usage issues (ggml/1257)	commit \| commitdiff \| tree
2025-07-24	Daniel Bevenius	ggml-cpu : remove stdlib include from repack.cpp (ggml...	commit \| commitdiff \| tree
2025-07-24	Georgi Gerganov	context : perform output reorder lazily upon access...	commit \| commitdiff \| tree
2025-07-24	Xuan-Son Nguyen	chat : fix kimi-k2 chat template (#14852)	commit \| commitdiff \| tree
2025-07-24	Alberto Cabrera...	sycl: fixed semantics of block offset calculation ...	commit \| commitdiff \| tree
2025-07-24	yummy	llama : fix MiniCPM inference after Granite Four change...	commit \| commitdiff \| tree
2025-07-24	Pouya	docs: add libcurl-dev install hint for Linux distros...	commit \| commitdiff \| tree
2025-07-24	Georgi Gerganov	metal : fix fusion across different encoders (#14849)	commit \| commitdiff \| tree
2025-07-24	Donghyeon Jeong	sycl: fix undefined variable in work group size check...	commit \| commitdiff \| tree
2025-07-23	jacekpoplawski	convert : text-only support for GLM-4.1V-9B-Thinking...	commit \| commitdiff \| tree
2025-07-23	Johannes Gäßler	CUDA: fix overflow in FA, tune performance (#14840)	commit \| commitdiff \| tree
2025-07-23	Johannes Gäßler	CUDA: fix compilation with GGML_CUDA_F16 (#14837)	commit \| commitdiff \| tree
2025-07-23	Sigbjørn Skjæret	ci : correct label refactor->refactoring (#14832)	commit \| commitdiff \| tree
2025-07-23	Johannes Gäßler	CUDA: fix quantized KV cache + multiple sequences ...	commit \| commitdiff \| tree
2025-07-23	Georgi Gerganov	tests : add non-cont K,V FA tests	commit \| commitdiff \| tree
2025-07-23	l3utterfly	memory : handle saving/loading null layers in recurrent...	commit \| commitdiff \| tree
2025-07-23	lixing-star	ggml: fix loongarch quantize_row_q8_1 error (#14827)	commit \| commitdiff \| tree
2025-07-23	chen fan	CANN: weight format to NZ for Ascend310P3 (#14407)	commit \| commitdiff \| tree
2025-07-23	Aman Gupta	CUDA: add fused rms norm (#14800)	commit \| commitdiff \| tree
2025-07-22	Csaba Kecskemeti	ggml : model card yaml tab->2xspace (#14819)	commit \| commitdiff \| tree
2025-07-22	Jeff Bolz	vulkan: fix rms_norm_mul to handle broadcasting dim0...	commit \| commitdiff \| tree
2025-07-22	Molly Sophia	llama : add model type detection for rwkv7 7B&14B ...	commit \| commitdiff \| tree
2025-07-22	Ed Addario	imatrix: add option to display importance score statist...	commit \| commitdiff \| tree
2025-07-22	stduhpf	Mtmd: add a way to select device for vision encoder...	commit \| commitdiff \| tree
2025-07-22	Sigbjørn Skjæret	cuda : implement bf16 cpy ops and enable bf16 cont...	commit \| commitdiff \| tree
2025-07-22	lhez	opencl: remove unreachable `return` (#14806)	commit \| commitdiff \| tree
2025-07-22	Molly Sophia	server : allow setting `--reverse-prompt` arg (#14799)	commit \| commitdiff \| tree
2025-07-21	R0CKSTAR	cuda: remove linking to cublasLt (#14790)	commit \| commitdiff \| tree
2025-07-21	Sigbjørn Skjæret	opencl: fix `im2col` when `KW!=KH` (#14803)	commit \| commitdiff \| tree
2025-07-21	rmatif	opencl: add conv2d kernel (#14403)	commit \| commitdiff \| tree
2025-07-21	Romain Biessy	sycl: Fix im2col (#14797)	commit \| commitdiff \| tree
2025-07-21	Charles Xu	kleidiai: add support for get_rows (#14676)	commit \| commitdiff \| tree
2025-07-21	Radoslav Gerganov	docs : fix backends table in README.md (#14796)	commit \| commitdiff \| tree
2025-07-21	Jeff Bolz	vulkan/cuda: Fix im2col when KW!=KH (#14789)	commit \| commitdiff \| tree
2025-07-21	Molly Sophia	llama : fix `--reverse-prompt` crashing issue (#14794)	commit \| commitdiff \| tree
2025-07-21	IsaacDynamo	server : add parse_special option to /tokenize endpoint...	commit \| commitdiff \| tree
2025-07-20	Aman Gupta	docs : fix link for tools/perplexity in README.md ...	commit \| commitdiff \| tree
2025-07-20	rspOverflow	Documentation: Further revisions to the Vulkan section...	commit \| commitdiff \| tree
2025-07-20	Aman Gupta	Clang-format: local files first + fix BinPacking (...	commit \| commitdiff \| tree
2025-07-19	0cc4m	Contrib: add 0cc4m as codeowner for Vulkan backend...	commit \| commitdiff \| tree
2025-07-19	Ervin Áron...	ggml: adds CONV_2D op and direct GEMM Vulkan implementa...	commit \| commitdiff \| tree
2025-07-19	compilade	imatrix : use GGUF to store importance matrices (#9400)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom