git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog

overview / pkg / ggml / sources / ggml / shortlog

2024-05-28	Masaya, Kato	ggml: aarch64: SVE kernels for q8_0_q8_0, q4_0_q8_0...	commit \| commitdiff \| tree
2024-05-28	Georgi Gerganov	ggml : silence UB sanitizer error during iq2_xxs quanti...	commit \| commitdiff \| tree
2024-05-28	Georgi Gerganov	ggml : remove ggml_flash_attn and ggml_flash_ff (llama...	commit \| commitdiff \| tree
2024-05-28	Georgi Gerganov	ggml : drop support for QK_K=64 (llama/7473)	commit \| commitdiff \| tree
2024-05-28	0cc4m	Update vulkan rope implementation to support frequency...	commit \| commitdiff \| tree
2024-05-28	Johannes Gäßler	CUDA: fix FA out-of-bounds reads (llama/7479)	commit \| commitdiff \| tree
2024-05-28	Johannes Gäßler	CUDA: fix FA out-of-bounds writes (llama/7465)	commit \| commitdiff \| tree
2024-05-28	Georgi Gerganov	cuda : fix compile warning (llama/7454)	commit \| commitdiff \| tree
2024-05-28	Johannes Gäßler	CUDA: remove incorrect precision check (llama/7454)	commit \| commitdiff \| tree
2024-05-28	Georgi Gerganov	cuda : fix rope + add tests (llama/7452)	commit \| commitdiff \| tree
2024-05-28	liuwei-git	llama : add phi3 128K model support (llama/7225)	commit \| commitdiff \| tree
2024-05-28	Georgi Gerganov	metal : handle F16 inf values, fix FA partial offload...	commit \| commitdiff \| tree
2024-05-28	Johannes Gäßler	CUDA: fix unused warning in mmq.cu (llama/7442)	commit \| commitdiff \| tree
2024-05-28	Johannes Gäßler	CUDA: deduplicate mmq code (llama/7397)	commit \| commitdiff \| tree
2024-05-28	Radoslav Gerganov	rpc : track allocated buffers (llama/7411)	commit \| commitdiff \| tree
2024-05-28	AidanBeltonS	Update SYCL upscale operation (llama/7321)	commit \| commitdiff \| tree
2024-05-28	Herman Semenov	ggml-opencl, llama: using reserve() if count already...	commit \| commitdiff \| tree
2024-05-28	junchao-loongson	ggml : add loongarch lsx and lasx support (llama/6454)	commit \| commitdiff \| tree
2024-05-28	Srihari-mcw	Add provisions for windows support for BF16 code includ...	commit \| commitdiff \| tree
2024-05-28	0cc4m	Vulkan Embedding Fix (llama/7360)	commit \| commitdiff \| tree
2024-05-28	slaren	ggml : fix another case of quants nans (llama/7387)	commit \| commitdiff \| tree
2024-05-28	Johannes Gäßler	ggml: implement quantized KV cache for FA (llama/7372)	commit \| commitdiff \| tree
2024-05-28	slaren	cuda : clear error after buffer allocation failure...	commit \| commitdiff \| tree
2024-05-28	fraxy-v	Capture CUDA logging output (llama/7298)	commit \| commitdiff \| tree
2024-05-28	Georgi Gerganov	android : use "ci-android" branch for CI (llama/7341)	commit \| commitdiff \| tree
2024-05-28	Johannes Gäßler	CUDA: deduplicate FlashAttention code (llama/7352)	commit \| commitdiff \| tree
2024-05-28	Engininja2	cuda : add half2 __shfl_xor() for ROCm 5.5 (llama/7263)	commit \| commitdiff \| tree
2024-05-28	0cc4m	Update and fix Vulkan soft_max and argsort implementati...	commit \| commitdiff \| tree
2024-05-28	slaren	ggml : fix quants nans when all the group weights are...	commit \| commitdiff \| tree
2024-05-28	Johannes Gäßler	CUDA: faster large batch FA without tensor cores (llama...	commit \| commitdiff \| tree
2024-05-28	Radoslav Gerganov	rpc : set SO_REUSEADDR for the server socket (llama...	commit \| commitdiff \| tree
2024-05-28	Herman Semenov	ggml-quants, llama : removed excess checks (llama/7274)	commit \| commitdiff \| tree
2024-05-28	Justine Tunney	ggml : rewrite silu and softmax for cpu (llama/7154)	commit \| commitdiff \| tree
2024-05-28	Radoslav Gerganov	rpc : add command line arg for specifying backend memory	commit \| commitdiff \| tree
2024-05-28	Max Krasnyansky	Add support for properly optimized Windows ARM64 builds...	commit \| commitdiff \| tree
2024-05-28	kunnis	ggml : use dynamic thread scheduling for matrix multipl...	commit \| commitdiff \| tree
2024-05-28	agray3	Avoid unnecessarily disabling CUDA graphs (llama/7302)	commit \| commitdiff \| tree
2024-05-28	slaren	ggml : tag ggml_tensor::backend as deprecated (llama...	commit \| commitdiff \| tree
2024-05-28	AidanBeltonS	Add missing " (llama/7303)	commit \| commitdiff \| tree
2024-05-25	Andrei	cmake : add Vulkan build (#730)	commit \| commitdiff \| tree
2024-05-24	compilade	gguf : use Qn_K for k-quants instead of KQn (#837)	commit \| commitdiff \| tree
2024-05-19	Brian	gguf.md: add sharding to naming convention (#826)	commit \| commitdiff \| tree
2024-05-17	Andrei	Add ggml rpc to cmake (#827)	commit \| commitdiff \| tree
2024-05-17	Brian	gguf.md: Add GGUF Naming Convention Section (#822)	commit \| commitdiff \| tree
2024-05-15	John Balis	ggml : add `ggml_upscale_ext` (#814)	commit \| commitdiff \| tree
2024-05-15	Georgi Gerganov	sync : whisper.cpp	commit \| commitdiff \| tree
2024-05-15	Georgi Gerganov	whisper : use flash attention (whisper/2152)	commit \| commitdiff \| tree
2024-05-14	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2024-05-14	Georgi Gerganov	metal : support FA without mask + add asserts (llama...	commit \| commitdiff \| tree
2024-05-14	Radoslav Gerganov	ggml : add RPC backend (llama/6829)	commit \| commitdiff \| tree
2024-05-14	Neo Zhang	rm wait() (llama/7233)	commit \| commitdiff \| tree
2024-05-14	Johannes Gäßler	CUDA: add FP32 FlashAttention vector kernel (llama...	commit \| commitdiff \| tree
2024-05-14	Georgi Gerganov	scripts : sync ggml-rpc	commit \| commitdiff \| tree
2024-05-14	Georgi Gerganov	sync : whisper.cpp	commit \| commitdiff \| tree
2024-05-14	thewh1teagle	whisper : fix model path encoding in windows (whisper...	commit \| commitdiff \| tree
2024-05-14	Daniel Ziegenberg	main : dont print timings with --no-prints (whisper...	commit \| commitdiff \| tree
2024-05-14	Daniel Ziegenberg	main : add options for temperature control (whisper...	commit \| commitdiff \| tree
2024-05-14	Georgi Gerganov	whisper : switch back to F32 mask (whisper/0)	commit \| commitdiff \| tree
2024-05-14	mashizora	main : fix double quote escaping in csv output (whisper...	commit \| commitdiff \| tree
2024-05-14	Georgi Gerganov	metal : tune soft_max number of threads (whisper/0)	commit \| commitdiff \| tree
2024-05-14	Georgi Gerganov	whisper : remove old flash attn code (whisper/0)	commit \| commitdiff \| tree
2024-05-14	Georgi Gerganov	ggml : try fix ppc64 (whisper/0)	commit \| commitdiff \| tree
2024-05-14	Przemysław...	ggml : expose SSE3 and SSSE3 for MSVC when AVX is avail...	commit \| commitdiff \| tree
2024-05-14	goldwaving	Remove unnecessary memory reallocation in fft (whisper...	commit \| commitdiff \| tree
2024-05-14	Georgi Gerganov	whisper : more prominent log message for sub-1s audio...	commit \| commitdiff \| tree
2024-05-14	Georgi Gerganov	main : pass nullptr when regex is empty (whisper/2070)	commit \| commitdiff \| tree
2024-05-14	Ikko Eltociear...	whisper : update grammar-parser.cpp (whisper/2058)	commit \| commitdiff \| tree
2024-05-12	Hong Bo PENG	ggml : optimize for ppc64le using VSX intrinsics (...	commit \| commitdiff \| tree
2024-05-11	Georgi Gerganov	cuda : remove old alibi sources (#0)	commit \| commitdiff \| tree
2024-05-11	Georgi Gerganov	metal : fix indent (#0)	commit \| commitdiff \| tree
2024-05-11	Georgi Gerganov	ggml : restore sigmoid decl order (#0)	commit \| commitdiff \| tree
2024-05-11	Georgi Gerganov	tests : restore unary tests (#0)	commit \| commitdiff \| tree
2024-05-11	Georgi Gerganov	mnist : clean whitespace	commit \| commitdiff \| tree
2024-05-11	Georgi Gerganov	ggml : resolve merge (#0)	commit \| commitdiff \| tree
2024-05-11	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2024-05-11	Georgi Gerganov	ggml : full ALiBi support (llama/7192)	commit \| commitdiff \| tree
2024-05-11	Georgi Gerganov	metal : fix flash attention kernel requirements (llama...	commit \| commitdiff \| tree
2024-05-11	Ouadie EL FAROUKI	Minor arithmetic improvement to mmvq wrapper kernel...	commit \| commitdiff \| tree
2024-05-11	0cc4m	Vulkan Bugfixes and Improvements (llama/7084)	commit \| commitdiff \| tree
2024-05-11	Johannes Gäßler	CUDA: generalize FP16 fattn vec kernel (llama/7061)	commit \| commitdiff \| tree
2024-05-11	Albert Jin	opencl : alignment size converted from bits to bytes...	commit \| commitdiff \| tree
2024-05-11	agray3	Introduction of CUDA Graphs to LLama.cpp (llama/6766)	commit \| commitdiff \| tree
2024-05-11	Gilad S	metal : use `vm_allocate` instead of `posix_memalign...	commit \| commitdiff \| tree
2024-05-11	Justine Tunney	ggml : introduce bfloat16 support (llama/6412)	commit \| commitdiff \| tree
2024-05-11	Georgi Gerganov	metal : fix unused warning	commit \| commitdiff \| tree
2024-05-11	William Tambellini	Add an option to build without CUDA VMM (llama/7067)	commit \| commitdiff \| tree
2024-05-11	Xuan Son Nguyen	gguf-split: add --no-tensor-first-split (llama/7072)	commit \| commitdiff \| tree
2024-05-11	Johannes Gäßler	CUDA: CUDART < 11.7 workaround for __hmax, __hmax2...	commit \| commitdiff \| tree
2024-05-11	Kevin Gibbons	switch to using localizedDescription (llama/7010)	commit \| commitdiff \| tree
2024-05-11	Georgi Gerganov	metal : remove deprecated error code (llama/7008)	commit \| commitdiff \| tree
2024-05-11	Kevin Gibbons	metal : log more info on error (llama/6987)	commit \| commitdiff \| tree
2024-05-11	Georgi Gerganov	ggml : add Flash Attention (llama/5021)	commit \| commitdiff \| tree
2024-05-11	Georgi Gerganov	ggml : fix __MSC_VER -> _MSC_VER (llama/6977)	commit \| commitdiff \| tree
2024-05-11	DAN™	Fix more int overflow during quant (PPL/CUDA). (llama...	commit \| commitdiff \| tree
2024-05-11	Xuan Son Nguyen	gguf : enforce that tensor names are unique (llama...	commit \| commitdiff \| tree
2024-05-11	Neo Zhang	add device version in device list (llama/6959)	commit \| commitdiff \| tree
2024-05-11	agray3	Reset schedule earlier to allow overlap with ggml graph...	commit \| commitdiff \| tree
2024-05-11	slaren	add basic tensor data validation function (llama/6884)	commit \| commitdiff \| tree
2024-05-11	slaren	gguf : fix mismatch between alloc and free functions...	commit \| commitdiff \| tree
2024-05-11	Georgi Gerganov	Merge pull request from GHSA-p5mv-gjc5-mwqv	commit \| commitdiff \| tree
next

Packaging of ggml-org/ggml

RSS Atom