git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2025-04-09	Georgi Gerganov	readme : add rpc backend (#12842)	commit \| commitdiff \| tree
2025-04-09	Chenguang Li	CANN: Support Opt CONV_TRANSPOSE_1D and ELU (#12786)	commit \| commitdiff \| tree
2025-04-09	Jeff Bolz	vulkan: In coopmat2 mmq, load q4_k/q5_k scales through...	commit \| commitdiff \| tree
2025-04-09	Jeff Bolz	vulkan: Use fp16 for the flash attention P*V multiplica...	commit \| commitdiff \| tree
2025-04-08	Sigbjørn Skjæret	cuda : add f32 to bf16 copy op (#12806)	commit \| commitdiff \| tree
2025-04-08	Matt Clayton	llava: improve clip_ctx destructor to not memleak load_...	commit \| commitdiff \| tree
2025-04-08	Georgi Gerganov	llama : fix FA when KV cache is not used (i.e. embeddin...	commit \| commitdiff \| tree
2025-04-08	Xuan-Son Nguyen	server : fix thread.join() on exit (#12831)	commit \| commitdiff \| tree
2025-04-08	dm4	llava: add more helper functions to check projector...	commit \| commitdiff \| tree
2025-04-08	Prajwal B Mehendarkar	arg : Including limits file on AIX (#12822)	commit \| commitdiff \| tree
2025-04-08	characharm	server : webui : Improve Chat Input with Auto-Sizing...	commit \| commitdiff \| tree
2025-04-08	Neo Zhang Jianyu	Revert "sycl:remove redundant memcopy in function ggml_...	commit \| commitdiff \| tree
2025-04-08	compilade	gguf-py : support lazy tensor splitting (#12809)	commit \| commitdiff \| tree
2025-04-07	Xuan-Son Nguyen	llama : Support llama 4 text-only (#12791)	commit \| commitdiff \| tree
2025-04-07	lhez	opencl: better identify Adreno GPU (#12760)	commit \| commitdiff \| tree
2025-04-07	stduhpf	hellaswag: display estimated score confidence interval...	commit \| commitdiff \| tree
2025-04-07	Georgi Gerganov	cuda : fix HIP and MUSA BF16 (#0)	commit \| commitdiff \| tree
2025-04-07	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-04-07	Georgi Gerganov	ggml : simplify Arm fp16 CPU logic (ggml/1177)	commit \| commitdiff \| tree
2025-04-07	Sigbjørn Skjæret	CUDA: don't convert BF16 weights to FP32 (ggml/1174)	commit \| commitdiff \| tree
2025-04-07	cmdr2	cpu: move all the operators into a separate c++ file...	commit \| commitdiff \| tree
2025-04-07	zhouwg	sycl: remove redundant memcopy in function ggml_backend...	commit \| commitdiff \| tree
2025-04-07	Xuan-Son Nguyen	ci : no curl on ggml-ci (#12796)	commit \| commitdiff \| tree
2025-04-07	Xuan-Son Nguyen	cmake : enable curl by default (#12761)	commit \| commitdiff \| tree
2025-04-07	zhouwg	CANN: fix typo in ggml-cann (#12733)	commit \| commitdiff \| tree
2025-04-07	hipudding	CANN: Refactor to reduce duplicate code (#12731)	commit \| commitdiff \| tree
2025-04-06	R0CKSTAR	musa: fix compilation warnings in mp_22/31 (#12780)	commit \| commitdiff \| tree
2025-04-06	Jeff Bolz	vulkan: fix NaN issue in flash attention shader (#12776)	commit \| commitdiff \| tree
2025-04-06	Jeff Bolz	vulkan: Use unclamped loads for flash attention mask...	commit \| commitdiff \| tree
2025-04-05	0cc4m	Vulkan: Tune Vulkan mmq int dot shader for performance...	commit \| commitdiff \| tree
2025-04-05	Sergey Fedorov	common : fix includes in arg.cpp and gemma3-cli.cpp...	commit \| commitdiff \| tree
2025-04-05	Xuan-Son Nguyen	clip : refactor clip_init, add tests (#12757)	commit \| commitdiff \| tree
2025-04-05	エシュナヴァリシア	common: custom hf endpoint support (#12769)	commit \| commitdiff \| tree
2025-04-04	Olivier Chafik	sync: minja (#12739)	commit \| commitdiff \| tree
2025-04-04	Georgi Gerganov	kv-cache : simplify + fix warning for recurrent models...	commit \| commitdiff \| tree
2025-04-04	bandoti	ci: add Linux cross-compile build (#12428)	commit \| commitdiff \| tree
2025-04-04	Nauful Shaikh	server : webui : Upgrade daisyui, tailwindcss. (#12735)	commit \| commitdiff \| tree
2025-04-04	nick huang	gguf-split : --merge now respects --dry-run option...	commit \| commitdiff \| tree
2025-04-04	Nicolò Scipione	sycl: allow ggml-sycl configuration and compilation...	commit \| commitdiff \| tree
2025-04-04	Ronny Brendel	cmake: fix ggml-shaders-gen compiler paths containing...	commit \| commitdiff \| tree
2025-04-04	Daniel Bevenius	docs : add XCFramework section to README.md [no ci...	commit \| commitdiff \| tree
2025-04-04	Jeff Bolz	vulkan: Hybrid waitForFences/getFenceStatus to reduce...	commit \| commitdiff \| tree
2025-04-04	Jeff Bolz	vulkan: set cmake minimum and project name in vulkan...	commit \| commitdiff \| tree
2025-04-04	lhez	opencl: update doc for OpenCL (#12702)	commit \| commitdiff \| tree
2025-04-03	Gaurav Garg	CUDA: Prefer vector flash decoding kernel for Gemma...	commit \| commitdiff \| tree
2025-04-03	yumeyao	vocab : use string_view::find() to avoid unnecessary...	commit \| commitdiff \| tree
2025-04-03	Jeff Bolz	vulkan: Fix missing cmake logic for dot product extensi...	commit \| commitdiff \| tree
2025-04-03	Atharva Dubey	ci : add env variable in ggml-ci and document the same...	commit \| commitdiff \| tree
2025-04-03	R0CKSTAR	sync : minja (inclusionAI/Ling) and update tests (...	commit \| commitdiff \| tree
2025-04-03	a3sh	fix MUSA compiler warning (#12704)	commit \| commitdiff \| tree
2025-04-03	Chenguang Li	CANN: Support operator SIN COS ARGMAX (#12709)	commit \| commitdiff \| tree
2025-04-03	Alan Gray	Simplify and improve CUDA graphs through use of indirec...	commit \| commitdiff \| tree
2025-04-03	hipudding	CANN: Fix failed test cases (#12708)	commit \| commitdiff \| tree
2025-04-03	lhez	opencl: use `max_alloc_size` in backend ctx instead...	commit \| commitdiff \| tree
2025-04-02	Jeff Bolz	vulkan: Implement split_k for coopmat2 flash attention...	commit \| commitdiff \| tree
2025-04-02	bandoti	cmake: remove caching from vulkan coopmat checks (...	commit \| commitdiff \| tree
2025-04-02	Jeff Bolz	vulkan: Implement grouped query attention in the coopma...	commit \| commitdiff \| tree
2025-04-02	0cc4m	Vulkan: Fix mmq int dot float cache size (#12722)	commit \| commitdiff \| tree
2025-04-02	Georgi Gerganov	model : print tensor size during load (#12711)	commit \| commitdiff \| tree
2025-04-02	Diego Devesa	llama : add option to override model tensor buffers... upstream/0.0.5028	commit \| commitdiff \| tree
2025-04-02	Georgi Gerganov	llama : refactor kv cache guard (#12695)	commit \| commitdiff \| tree
2025-04-02	Sigbjørn Skjæret	vocab : BailingMoE : change possessive quantifiers...	commit \| commitdiff \| tree
2025-04-02	Xuan-Son Nguyen	common : remove json.hpp from common.cpp (#12697)	commit \| commitdiff \| tree
2025-04-02	Chenguang Li	[CANN] get_rows and dup optimization (#12671)	commit \| commitdiff \| tree
2025-04-01	Xuan-Son Nguyen	common : refactor downloading system, handle mmproj...	commit \| commitdiff \| tree
2025-04-01	Junil Kim	opencl : fix memory allocation size (#12649)	commit \| commitdiff \| tree
2025-04-01	jklincn	llama : use LLM_KV_GENERAL_FILE_TYPE instead of gguf_fi...	commit \| commitdiff \| tree
2025-04-01	Sigbjørn Skjæret	convert : BailingMoE : fix qkv split when head_dim...	commit \| commitdiff \| tree
2025-04-01	Georgi Gerganov	metal : use F32 prec in FA kernels (#12688)	commit \| commitdiff \| tree
2025-04-01	R0CKSTAR	Fix clang warning in gguf_check_reserved_keys (#12686)	commit \| commitdiff \| tree
2025-04-01	Wagner Bruna	vulkan: fix build when glslc doesn't support coopmat...	commit \| commitdiff \| tree
2025-04-01	Romain Biessy	SYCL: Rename oneMKL to oneMath (#12192)	commit \| commitdiff \| tree
2025-04-01	Akarshan Biswas	SYCL: switch to SYCL namespace (#12674)	commit \| commitdiff \| tree
2025-03-31	Sigbjørn Skjæret	convert : BailingMoE : avoid setting rope_dim to 0...	commit \| commitdiff \| tree
2025-03-31	Daniel Bevenius	vocab : add special infill tokens for CodeLlama (#11850)	commit \| commitdiff \| tree
2025-03-31	a3sh	ggml : faster ssm scan (#10558)	commit \| commitdiff \| tree
2025-03-31	Sigbjørn Skjæret	convert : Qwerky : use lora_rank_tokenshift and lora_ra...	commit \| commitdiff \| tree
2025-03-31	0cc4m	Vulkan: Add DP4A MMQ and Q8_1 quantization shader ...	commit \| commitdiff \| tree
2025-03-31	Georgi Gerganov	cmake : fix whitespace (#0)	commit \| commitdiff \| tree
2025-03-31	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-03-31	Sandro Hanea	cmake: improve Vulkan cooperative matrix support checks...	commit \| commitdiff \| tree
2025-03-31	Sigbjørn Skjæret	llava : proper description fix (#12668)	commit \| commitdiff \| tree
2025-03-31	Akarshan Biswas	SYCL: Remove misleading ggml_sycl_op_flatten function...	commit \| commitdiff \| tree
2025-03-31	Sigbjørn Skjæret	llava : fix clip loading GGUFs with missing description...	commit \| commitdiff \| tree
2025-03-31	marcoStocchi	tts : remove printfs (#12640)	commit \| commitdiff \| tree
2025-03-30	Sigbjørn Skjæret	llama : support BailingMoE (Ling) (#12634)	commit \| commitdiff \| tree
2025-03-30	Georgi Gerganov	metal : use constexpr in FA kernels + fix typedef ...	commit \| commitdiff \| tree
2025-03-30	Juyoung Suk	llama : add Trillion 7B model support (#12556)	commit \| commitdiff \| tree
2025-03-30	Sergei Vorobyov	llama-chat : Add Yandex instruct model template support...	commit \| commitdiff \| tree
2025-03-30	R0CKSTAR	musa: fix all warnings, re-enable `-DLLAMA_FATAL_WARNIN...	commit \| commitdiff \| tree
2025-03-30	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2025-03-30	Xuan-Son Nguyen	cpu : rm unused variable (ggml/1166)	commit \| commitdiff \| tree
2025-03-30	cmdr2	cpu: de-duplicate some of the operators and refactor...	commit \| commitdiff \| tree
2025-03-30	Daniel Bevenius	ggml : add logging for native build options/vars (whisp...	commit \| commitdiff \| tree
2025-03-30	Daniel Bevenius	examples : command.wasm updates (whisper/2904)	commit \| commitdiff \| tree
2025-03-29	Xuan-Son Nguyen	llama : fix non-causal mask for gemma 3 (#12615)	commit \| commitdiff \| tree
2025-03-29	Djip007	llama : change cpu_buft_list order: ACCEL -> GPU host...	commit \| commitdiff \| tree
2025-03-29	Jay	cmake : fix ccache conflict (#12522)	commit \| commitdiff \| tree
2025-03-29	hipudding	CANN : remove clang-format in ggml-cann (#12607)	commit \| commitdiff \| tree
2025-03-28	Sigbjørn Skjæret	llama : fix incorrect Qwen2Moe ffn_moe_out graph callba...	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom