git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2025-09-03	Ruben Ortlam	vulkan: fix mmv subgroup16 selection (#15775)	commit \| commitdiff \| tree
2025-09-03	Jeff Bolz	vulkan: don't use std::string in load_shaders, to impro...	commit \| commitdiff \| tree
2025-09-03	Daniel Bevenius	vulkan : update ggml_vk_instance_validation_ext_availab...	commit \| commitdiff \| tree
2025-09-03	Shin-myoung...	ggml vulkan: add hardsigmoid and hardswish operations...	commit \| commitdiff \| tree
2025-09-03	Oliver Simons	CUDA: Optimize `rms_norm_f32` kernel and its fused...	commit \| commitdiff \| tree
2025-09-03	Daniel Bevenius	model-conversion : fix pyright errors (#15770)	commit \| commitdiff \| tree
2025-09-03	Georgi Gerganov	sampling : optimize dist sampler (#15704)	commit \| commitdiff \| tree
2025-09-03	Daniel Bevenius	llama : fix incorrect model type for Gemma 270M (...	commit \| commitdiff \| tree
2025-09-03	Daniel Bevenius	model-conversion : remove hardcoded /bin/bash shebangs...	commit \| commitdiff \| tree
2025-09-03	hipudding	CANN: Add RoPE contiguous check for 310I DUP device...	commit \| commitdiff \| tree
2025-09-03	xctan	ggml-cpu : optimize RVV kernels (#15720)	commit \| commitdiff \| tree
2025-09-03	Daniel Bevenius	model-conversion : add missing curl script [no ci]...	commit \| commitdiff \| tree
2025-09-03	hipudding	CANN: Mask unsupported TRANSPOSE_1D operator (#15733)	commit \| commitdiff \| tree
2025-09-03	Chenguang Li	CANN: Fix type float_t to float (#15736)	commit \| commitdiff \| tree
2025-09-02	SnA1lGo	fix: resolve unsigned int initialization warning for...	commit \| commitdiff \| tree
2025-09-02	Oliver Simons	chore: Update `.clang-format` to use `BinPackArguments...	commit \| commitdiff \| tree
2025-09-02	Johannes Gäßler	llama: -fa 1/0/-1 aliases for -fa on/off/auto (#15746)	commit \| commitdiff \| tree
2025-09-02	Ruben Ortlam	vulkan: fix shaders gen when no integer dot is availabl...	commit \| commitdiff \| tree
2025-09-02	hipudding	CANN: Resolve soft_max precision issue (#15730)	commit \| commitdiff \| tree
2025-09-02	Jeff Bolz	vulkan: Fix macro parameter order for f32 matmul shader...	commit \| commitdiff \| tree
2025-09-02	rmatif	opencl: add attn sinks support for FA kernels (#15706)	commit \| commitdiff \| tree
2025-09-02	Chenguang Li	CANN: Support eager execution mode under ACL graph...	commit \| commitdiff \| tree
2025-09-02	hipudding	CANN: Support ext_factor in rope (#15710)	commit \| commitdiff \| tree
2025-09-01	Johannes Gäßler	ggml-backend: raise GGML_MAX_SPLIT_INPUTS (#15722)	commit \| commitdiff \| tree
2025-09-01	Gilad S.	vulkan: use memory budget extension to read memory...	commit \| commitdiff \| tree
2025-09-01	Jeff Bolz	vulkan: add missing clamps in new mul_mat_id paths...	commit \| commitdiff \| tree
2025-09-01	Ruben Ortlam	vulkan: disable large mmv subgroups on older Nvidia...	commit \| commitdiff \| tree
2025-09-01	s-goto-11	ggml: SVE support for exponential functions (#15145)	commit \| commitdiff \| tree
2025-09-01	Prashant Vithule	ggml: aarch64: Implement SVE F16 kernels for vector...	commit \| commitdiff \| tree
2025-09-01	Jie Fu (傅杰)	convert : remove redundant code (#15708)	commit \| commitdiff \| tree
2025-09-01	Ruben Ortlam	Vulkan: Add Integer Dot Product mul_mat_vec shader...	commit \| commitdiff \| tree
2025-09-01	Daniel Bevenius	ggml : WebGPU add TRANSPOSE and RESHAPE to supported...	commit \| commitdiff \| tree
2025-09-01	Jie Fu (傅杰)	docs : add Hunyuan to models section (#15707)	commit \| commitdiff \| tree
2025-09-01	Akarshan Biswas	CUDA: fix build error from ambiguous __half conversions...	commit \| commitdiff \| tree
2025-09-01	hipudding	CANN: Optimize MUL_MAT_ID (#15658)	commit \| commitdiff \| tree
2025-09-01	hipudding	CANN: fix RoPE cache issue on multi-device (#15629)	commit \| commitdiff \| tree
2025-08-31	Georgi Gerganov	sampling : optimize samplers by reusing bucket sort...	commit \| commitdiff \| tree
2025-08-31	Georgi Gerganov	server : enable /slots by default and make it secure...	commit \| commitdiff \| tree
2025-08-31	Georgi Gerganov	metal : fix checks for available FA kernels (#15700)	commit \| commitdiff \| tree
2025-08-31	Diego Devesa	llama : fix fattn reserve call n_seqs parameter (#15699)	commit \| commitdiff \| tree
2025-08-31	Diego Devesa	llama : separate compute buffer reserve from fattn...	commit \| commitdiff \| tree
2025-08-31	Sigbjørn Skjæret	ci : explicitly set fa off or on (#15692)	commit \| commitdiff \| tree
2025-08-31	Jeff Bolz	vulkan: handle large sizes for get_rows (#15686)	commit \| commitdiff \| tree
2025-08-31	Jeff Bolz	vulkan: mul_mat_id coopmat2 optimizations (#15546)	commit \| commitdiff \| tree
2025-08-31	Daniel Bevenius	vulkan : remove unused portability_enumeration_ext...	commit \| commitdiff \| tree
2025-08-31	Jeff Bolz	vulkan: Allow fallback to sysmem memory when vidmem...	commit \| commitdiff \| tree
2025-08-31	Jeff Bolz	vulkan: clamp matmul and FA results to the max finite...	commit \| commitdiff \| tree
2025-08-30	Charles Xu	ggml: update kleidiai to v1.13.0 (#15663)	commit \| commitdiff \| tree
2025-08-30	Diego Devesa	Update build.md to remove MSVC arm64 notes (#15684)	commit \| commitdiff \| tree
2025-08-30	Johannes Gäßler	llama: use FA + max. GPU layers by default (#15434)	commit \| commitdiff \| tree
2025-08-30	Johannes Gäßler	CUDA: use FP32 arithmetic for conv2d (#15683)	commit \| commitdiff \| tree
2025-08-30	Jeff Bolz	vulkan: Skip syncing for prealloc_y when it is reused...	commit \| commitdiff \| tree
2025-08-30	Chenguang Li	CANN: FIx compiler warnings (#15661)	commit \| commitdiff \| tree
2025-08-29	Sergey Alirzaev	server : removed obsolete doc (#15670)	commit \| commitdiff \| tree
2025-08-29	Johannes Gäßler	scripts: strip "AMD Instinct" from GPU name (#15668)	commit \| commitdiff \| tree
2025-08-29	ExtReMLapin	server : add documentation for `parallel_tool_calls...	commit \| commitdiff \| tree
2025-08-29	Aman Gupta	CUDA: fix bug in rms_norm fusion (#15660)	commit \| commitdiff \| tree
2025-08-29	Piotr Wilkin...	chat : Seed OSS thinking + tool call support (#15552)	commit \| commitdiff \| tree
2025-08-29	Aman Gupta	CUDA: fuse adds, fuse add with rms norm (#15631)	commit \| commitdiff \| tree
2025-08-29	Gabe Goodhart	nvidia nemotron nano v2 (nemotronh) (#15507)	commit \| commitdiff \| tree
2025-08-28	Gabe Goodhart	fix: Compute the full sum in llama-eval-callback, not...	commit \| commitdiff \| tree
2025-08-28	mnehete32	CUDA: add conv2d (#15635)	commit \| commitdiff \| tree
2025-08-28	Aaron Teo	ggml-cpu: fix invalid hsum build in debug s390x (#15634)	commit \| commitdiff \| tree
2025-08-28	compilade	ggml : fix SSM_SCAN for n_groups > 1 (#15625)	commit \| commitdiff \| tree
2025-08-28	Georgi Gerganov	kv-cache : fix find_slot to not search for continuous...	commit \| commitdiff \| tree
2025-08-28	Sigbjørn Skjæret	model : jina-embeddings-v3 support (#13693)	commit \| commitdiff \| tree
2025-08-28	Aman Gupta	scripts: add sqlite3 check for compare-commits.sh ...	commit \| commitdiff \| tree
2025-08-28	Georgi Gerganov	kv-cache : remove LLAMA_SET_ROWS checks (#15505)	commit \| commitdiff \| tree
2025-08-28	Aleksei Nikiforov	gguf-py: byteswapping improvements (#12851)	commit \| commitdiff \| tree
2025-08-28	Joshua Cogliati	cli : change log to warning to explain reason for stopp...	commit \| commitdiff \| tree
2025-08-28	Daniel Bevenius	model-conversion : add mmproj conversion target (#15628)	commit \| commitdiff \| tree
2025-08-28	matiaslin	cuda: Add cublasLt_static linking when GGML_STATIC...	commit \| commitdiff \| tree
2025-08-27	Johannes Gäßler	server: higher timeout for tests (#15621)	commit \| commitdiff \| tree
2025-08-27	Georgi Gerganov	presets : add qwen3-30B-a3b FIM (#15616)	commit \| commitdiff \| tree
2025-08-27	uvos	HIP: Enable support for ggml_backend_cuda_register_host...	commit \| commitdiff \| tree
2025-08-27	Georgi Gerganov	kv-cache : better estimate of n_kv for multi-sequence...	commit \| commitdiff \| tree
2025-08-27	Chenguang Li	CANN: refactor mask handling and improve performance...	commit \| commitdiff \| tree
2025-08-27	xctan	ggml-cpu : add basic RVV support for vector f32 ops...	commit \| commitdiff \| tree
2025-08-27	Daniel Bevenius	common : add -m to bash completion for --model [no...	commit \| commitdiff \| tree
2025-08-27	rmatif	OpenCL: add fused group_norm/norm, mul, add (#15314)	commit \| commitdiff \| tree
2025-08-26	Diego Devesa	tests : fix test-opt with GGML_BACKEND_DL (#15599)	commit \| commitdiff \| tree
2025-08-26	Akarshan Biswas	SYCL: fix rms_norm_mul_add for tensor dim not a multipl...	commit \| commitdiff \| tree
2025-08-26	fidoriel	mtmd : fix mtmd ios build (#15579)	commit \| commitdiff \| tree
2025-08-26	Eve	tests: add performance test for mul mat id (#15543)	commit \| commitdiff \| tree
2025-08-26	shalinib-ibm	llamafile: PowerPC Sgemm Optimization (#15558)	commit \| commitdiff \| tree
2025-08-26	Georgi Gerganov	graph : fix assert in memory-less build_attn (#15590)	commit \| commitdiff \| tree
2025-08-26	Daniel Bevenius	model-conversion : add qat-q4 quantization targets...	commit \| commitdiff \| tree
2025-08-26	Johannes Gäßler	CUDA: return -1 for nonexistent compiled arch (#15587)	commit \| commitdiff \| tree
2025-08-26	Georgi Gerganov	metal : optimize FA vec for large sequences and BS...	commit \| commitdiff \| tree
2025-08-26	Xuan-Son Nguyen	mtmd : support Kimi VL model (#15458)	commit \| commitdiff \| tree
2025-08-26	Georgi Gerganov	context : print graph stats for memory-less contexts...	commit \| commitdiff \| tree
2025-08-26	Georgi Gerganov	metal : improve `MUL_MAT_ID` (#15541)	commit \| commitdiff \| tree
2025-08-26	tc-mb	model : support MiniCPM-V 4.5 (#15575)	commit \| commitdiff \| tree
2025-08-26	Sigbjørn Skjæret	gguf-py : remove erroneous FFN_GATE entry (#15583)	commit \| commitdiff \| tree
2025-08-26	Sigbjørn Skjæret	metal : remove contiguous assertion for src0 in IM2COL...	commit \| commitdiff \| tree
2025-08-26	Yoshi_likes_e4	Add a warning for special devices (#15563)	commit \| commitdiff \| tree
2025-08-26	Jeff Bolz	vulkan: Remove splitting for mul_mat_id (#15568)	commit \| commitdiff \| tree
2025-08-25	Qeeweew	CUDA: Accelerate MXFP4 table lookup using `__byte_perm...	commit \| commitdiff \| tree
2025-08-25	lhez	opencl: fix support ops condition for `rms_norm` (...	commit \| commitdiff \| tree
2025-08-25	Ruben Ortlam	vulkan: fix min subgroup 16 condition for mmid subgroup...	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom