git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2025-09-11	Daniel Bevenius	ggml-cpu : add check for ARM MATMUL_INT8/i8mm support...	commit \| commitdiff \| tree
2025-09-11	Charles Xu	kleidiai: fix GGML_ASSERT(*cur_backend_id != -1) failed...	commit \| commitdiff \| tree
2025-09-11	hipudding	CANN: Disable acl_graph for prefill stage (#15933)	commit \| commitdiff \| tree
2025-09-10	Oliver Simons	CUDA: Add `fastdiv` to `k_bin_bcast*`, giving 1-3%...	commit \| commitdiff \| tree
2025-09-10	Jie Fu (傅杰)	llama : support T5 models with unequal number of encode...	commit \| commitdiff \| tree
2025-09-10	Sigbjørn Skjæret	graph : support non-contiguous Q in build_attn_mha...	commit \| commitdiff \| tree
2025-09-10	Daniel Bevenius	ggml-cpu : fix padding in ggml_timestep_embedding ...	commit \| commitdiff \| tree
2025-09-10	Georgi Gerganov	metal : make the backend async (#15906)	commit \| commitdiff \| tree
2025-09-10	Daniel Bevenius	ci : add caching for ROCm installation in release workf...	commit \| commitdiff \| tree
2025-09-10	Daniel Bevenius	tests : filter out no-ops from coverage report (#15900)	commit \| commitdiff \| tree
2025-09-10	j-k	media : add transparent icon svg and png [no ci] (...	commit \| commitdiff \| tree
2025-09-10	Jesse	gitignore : Ignore vim swap files in tests (#15901)	commit \| commitdiff \| tree
2025-09-10	Chenguang Li	CANN: Add ROPE sin/cos cache for reuse (#15912)	commit \| commitdiff \| tree
2025-09-10	Chenguang Li	CANN: implement LRU cache for ACL graphs (#15814)	commit \| commitdiff \| tree
2025-09-10	Daniel Bevenius	llama : check returned fn ptrs from ggml_backend_reg_ge...	commit \| commitdiff \| tree
2025-09-10	Daniel Bevenius	ci : cache ROCm installation in windows-latest-cmake...	commit \| commitdiff \| tree
2025-09-09	Ruben Ortlam	vulkan: throw the oom error instead of no memory type...	commit \| commitdiff \| tree
2025-09-09	Jeff Bolz	vulkan: Fix OOB accesses in soft_max_back (#15861)	commit \| commitdiff \| tree
2025-09-09	Johannes Gäßler	HIP: use v_dot2_f32_f16 instruction for FA (#15884)	commit \| commitdiff \| tree
2025-09-09	lksj92hs	Workaround for subgroup arithmetic failing on MoltenVK...	commit \| commitdiff \| tree
2025-09-09	Aman Gupta	CUDA: Add mul_mat_id support for the mmf kernel (#15767)	commit \| commitdiff \| tree
2025-09-09	Johannes Gäßler	CUDA: fix GET_ROWS for large tensors (#15882)	commit \| commitdiff \| tree
2025-09-09	Georgi Gerganov	contrib : add notes about merging PRs (#15881)	commit \| commitdiff \| tree
2025-09-09	Daniel Bevenius	requirements : update transformers/torch for Embedding...	commit \| commitdiff \| tree
2025-09-09	Piotr Wilkin...	model-conversion : add extra debugging support for...	commit \| commitdiff \| tree
2025-09-08	Aldehir Rojas	json : support `enum` values within `allOf` (#15830)	commit \| commitdiff \| tree
2025-09-08	j-k	media : add llama1 icon (#15878)	commit \| commitdiff \| tree
2025-09-08	Jeff Bolz	vulkan: sort graph to allow more parallel execution...	commit \| commitdiff \| tree
2025-09-08	Aman Gupta	CUDA: generate_cu_files.py - add missing mxfp4 (#15880)	commit \| commitdiff \| tree
2025-09-08	Jesse	chat : Deepseek V3.1 reasoning and tool calling support...	commit \| commitdiff \| tree
2025-09-08	Xuan-Son Nguyen	server : bring back timings_per_token (#15879)	commit \| commitdiff \| tree
2025-09-08	Georgi Gerganov	cuda : fix supports_op condition for get_rows when...	commit \| commitdiff \| tree
2025-09-08	Georgi Gerganov	metal : refactor + optimize (#15857)	commit \| commitdiff \| tree
2025-09-08	Xuan-Son Nguyen	ggml: allow casting between f32 and i32 (#15783)	commit \| commitdiff \| tree
2025-09-08	Sigbjørn Skjæret	CUDA: non-contiguous src0 not supported for PAD (#15869)	commit \| commitdiff \| tree
2025-09-08	Daniel Bevenius	convert : force setting sliding_window from original...	commit \| commitdiff \| tree
2025-09-08	Georgi Gerganov	batched-bench : fix llama_synchronize usage during...	commit \| commitdiff \| tree
2025-09-08	Georgi Gerganov	context : fix n_outputs during reserve (#15858)	commit \| commitdiff \| tree
2025-09-08	Georgi Gerganov	model : avoid ggml_cont_3d for fused QKV weights (...	commit \| commitdiff \| tree
2025-09-08	Jeff Bolz	tests: large sizes for get_rows (#15687)	commit \| commitdiff \| tree
2025-09-08	Chenguang Li	CANN: Stream sync between devices for acl_graph (#15809)	commit \| commitdiff \| tree
2025-09-07	Jeff Bolz	vulkan: support im2col_3d (#15795)	commit \| commitdiff \| tree
2025-09-07	Aaron Teo	ggml-cpu: clean up s390x SIMD (#15855)	commit \| commitdiff \| tree
2025-09-07	Jeff Bolz	vulkan: Support pad_ext (#15794)	commit \| commitdiff \| tree
2025-09-07	Jeff Bolz	vulkan: Use larger loads in scalar/coopmat1 matmul...	commit \| commitdiff \| tree
2025-09-07	Daniel Bevenius	ggml WebGPU: remove userdata from request adapter callb...	commit \| commitdiff \| tree
2025-09-06	Johannes Gäßler	CUDA: faster tile FA (Pascal/AMD), headsize 256 (#15769)	commit \| commitdiff \| tree
2025-09-06	Charles Xu	kleidiai: generalize compute_forward_kv_cache to comput...	commit \| commitdiff \| tree
2025-09-06	Xuan-Son Nguyen	server : speed up tests (#15836)	commit \| commitdiff \| tree
2025-09-06	Xuan-Son Nguyen	server : implement prompt processing progress report...	commit \| commitdiff \| tree
2025-09-06	Johannes Gäßler	ggml-cpu: document use of "free" memory [no ci] (#15834)	commit \| commitdiff \| tree
2025-09-06	Aaron Teo	ggml-cpu: drop support for nnpa intrinsics (#15821)	commit \| commitdiff \| tree
2025-09-05	Gabe Goodhart	aLoRA Support (#15327)	commit \| commitdiff \| tree
2025-09-05	Sigbjørn Skjæret	ci : exempt correct research label (#15825)	commit \| commitdiff \| tree
2025-09-05	Gabe Goodhart	Thinking model disabled assistant prefill (#15404)	commit \| commitdiff \| tree
2025-09-05	Eric Curtin	Implement --log-colors with always/never/auto (#15792)	commit \| commitdiff \| tree
2025-09-05	Johannes Gäßler	CUDA: fastdiv, launch bounds for mmvq + q8_1 quant...	commit \| commitdiff \| tree
2025-09-05	Daniel Bevenius	tests : add --list-ops and --show-coverage options...	commit \| commitdiff \| tree
2025-09-05	Erik Scholz	gguf: gguf_writer refactor (#15691)	commit \| commitdiff \| tree
2025-09-05	Georgi Gerganov	kv-cache : fix SWA checks + disable cacheless iSWA...	commit \| commitdiff \| tree
2025-09-05	Daniel Bevenius	model-conversion : add --embeddings flag to modelcard...	commit \| commitdiff \| tree
2025-09-04	ExtReMLapin	chat : fixed crash when Hermes 2 <tool_call> had a...	commit \| commitdiff \| tree
2025-09-04	Piotr Wilkin...	chat : nemotron thinking & toolcalling support (#15676)	commit \| commitdiff \| tree
2025-09-04	Piotr Wilkin...	scripts : add Jinja tester PySide6 simple app (#15756)	commit \| commitdiff \| tree
2025-09-04	Daniel Bevenius	llama : add support for EmbeddingGemma 300m (#15798)	commit \| commitdiff \| tree
2025-09-04	Gabe Goodhart	metal : Add template specialization for mul_mm_id w...	commit \| commitdiff \| tree
2025-09-04	Daniel Bevenius	llama : set n_outputs to 1 to avoid 0 outputs mean...	commit \| commitdiff \| tree
2025-09-04	Chenguang Li	CANN: Refactor ND to NZ workspace to be per-device...	commit \| commitdiff \| tree
2025-09-04	Xuan-Son Nguyen	server: add exceed_context_size_error type (#15780)	commit \| commitdiff \| tree
2025-09-04	Eric Curtin	Document the new max GPU layers default in help (#15771)	commit \| commitdiff \| tree
2025-09-04	leejet	ggml: add ops for WAN video model (cuda && cpu) (#15669)	commit \| commitdiff \| tree
2025-09-04	hipudding	CANN: Fix precision issue on 310I DUO multi-devices...	commit \| commitdiff \| tree
2025-09-04	rmatif	opencl: add hs=40 to FA (#15758)	commit \| commitdiff \| tree
2025-09-04	Chenguang Li	CANN: fix acl_rstd allocation size in ggml_cann_rms_nor...	commit \| commitdiff \| tree
2025-09-03	Ruben Ortlam	vulkan: fix mmv subgroup16 selection (#15775)	commit \| commitdiff \| tree
2025-09-03	Jeff Bolz	vulkan: don't use std::string in load_shaders, to impro...	commit \| commitdiff \| tree
2025-09-03	Daniel Bevenius	vulkan : update ggml_vk_instance_validation_ext_availab...	commit \| commitdiff \| tree
2025-09-03	Shin-myoung...	ggml vulkan: add hardsigmoid and hardswish operations...	commit \| commitdiff \| tree
2025-09-03	Oliver Simons	CUDA: Optimize `rms_norm_f32` kernel and its fused...	commit \| commitdiff \| tree
2025-09-03	Daniel Bevenius	model-conversion : fix pyright errors (#15770)	commit \| commitdiff \| tree
2025-09-03	Georgi Gerganov	sampling : optimize dist sampler (#15704)	commit \| commitdiff \| tree
2025-09-03	Daniel Bevenius	llama : fix incorrect model type for Gemma 270M (...	commit \| commitdiff \| tree
2025-09-03	Daniel Bevenius	model-conversion : remove hardcoded /bin/bash shebangs...	commit \| commitdiff \| tree
2025-09-03	hipudding	CANN: Add RoPE contiguous check for 310I DUP device...	commit \| commitdiff \| tree
2025-09-03	xctan	ggml-cpu : optimize RVV kernels (#15720)	commit \| commitdiff \| tree
2025-09-03	Daniel Bevenius	model-conversion : add missing curl script [no ci]...	commit \| commitdiff \| tree
2025-09-03	hipudding	CANN: Mask unsupported TRANSPOSE_1D operator (#15733)	commit \| commitdiff \| tree
2025-09-03	Chenguang Li	CANN: Fix type float_t to float (#15736)	commit \| commitdiff \| tree
2025-09-02	SnA1lGo	fix: resolve unsigned int initialization warning for...	commit \| commitdiff \| tree
2025-09-02	Oliver Simons	chore: Update `.clang-format` to use `BinPackArguments...	commit \| commitdiff \| tree
2025-09-02	Johannes Gäßler	llama: -fa 1/0/-1 aliases for -fa on/off/auto (#15746)	commit \| commitdiff \| tree
2025-09-02	Ruben Ortlam	vulkan: fix shaders gen when no integer dot is availabl...	commit \| commitdiff \| tree
2025-09-02	hipudding	CANN: Resolve soft_max precision issue (#15730)	commit \| commitdiff \| tree
2025-09-02	Jeff Bolz	vulkan: Fix macro parameter order for f32 matmul shader...	commit \| commitdiff \| tree
2025-09-02	rmatif	opencl: add attn sinks support for FA kernels (#15706)	commit \| commitdiff \| tree
2025-09-02	Chenguang Li	CANN: Support eager execution mode under ACL graph...	commit \| commitdiff \| tree
2025-09-02	hipudding	CANN: Support ext_factor in rope (#15710)	commit \| commitdiff \| tree
2025-09-01	Johannes Gäßler	ggml-backend: raise GGML_MAX_SPLIT_INPUTS (#15722)	commit \| commitdiff \| tree
2025-09-01	Gilad S.	vulkan: use memory budget extension to read memory...	commit \| commitdiff \| tree
2025-09-01	Jeff Bolz	vulkan: add missing clamps in new mul_mat_id paths...	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom