git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/ggml/shortlog

overview / pkg / ggml / sources / ggml / shortlog

2024-02-12	snadampal	ggml : add mmla kernels for quantized GEMM (llama/4966)	commit \| commitdiff \| tree
2024-02-12	Ian Bull	metal : use autoreleasepool to avoid memory leaks ...	commit \| commitdiff \| tree
2024-02-11	slaren	ggml-alloc : v3 (#727)	commit \| commitdiff \| tree
2024-02-10	Georgi Gerganov	examples : remove old stuff (#728)	commit \| commitdiff \| tree
2024-02-10	Georgi Gerganov	sync : whisper.cpp	commit \| commitdiff \| tree
2024-02-10	Didzis Gosko	whisper : expose CUDA device setting in public API...	commit \| commitdiff \| tree
2024-02-10	Georgi Gerganov	sync : ggml (whisper/0)	commit \| commitdiff \| tree
2024-02-10	Georgi Gerganov	src : relocate new backend sources	commit \| commitdiff \| tree
2024-02-10	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2024-02-10	Georgi Gerganov	ci : fix mpt test	commit \| commitdiff \| tree
2024-02-10	Georgi Gerganov	tests : fix im2col usage	commit \| commitdiff \| tree
2024-02-10	Michael Podvitskiy	ggml : fix `error C2078: too many initializers` for...	commit \| commitdiff \| tree
2024-02-10	0cc4m	Fix Vulkan crash on APUs with very little device memory...	commit \| commitdiff \| tree
2024-02-10	Johannes Gäßler	CUDA: more warps for mmvq on NVIDIA (llama/5394)	commit \| commitdiff \| tree
2024-02-10	Abhilash Majumder	Fix f16_sycl cpy call from Arc (llama/5411)	commit \| commitdiff \| tree
2024-02-10	Johannes Gäßler	CUDA: fixed mmvq kernel for bs 2,3,4 and -sm row (llama...	commit \| commitdiff \| tree
2024-02-10	0cc4m	Basic Vulkan Multi-GPU implementation (llama/5321)	commit \| commitdiff \| tree
2024-02-10	Johannes Gäßler	CUDA: mul_mat_vec_q max. batch size 8 -> 4 (llama/5370)	commit \| commitdiff \| tree
2024-02-10	Kawrakow	Slight quantization improvement for Q4_K and Q5_K ...	commit \| commitdiff \| tree
2024-02-10	Johannes Gäßler	CUDA: mul_mat_vec_q for batch sizes > 1 (llama/5351)	commit \| commitdiff \| tree
2024-02-10	Kawrakow	ggml : make use of ggml-quants.h possible in C++ code...	commit \| commitdiff \| tree
2024-02-10	Dr. Tom Murphy...	ggml : avoid duplicating function calls using MIN/MAX...	commit \| commitdiff \| tree
2024-02-10	Kawrakow	iq2_xxs: tune quantization (llama/5320)	commit \| commitdiff \| tree
2024-02-10	AidanBeltonS	Fix cpy with dims of 3 (llama/5289)	commit \| commitdiff \| tree
2024-02-10	0cc4m	Vulkan Intel Fixes, Optimizations and Debugging Flags...	commit \| commitdiff \| tree
2024-02-10	AidanBeltonS	Fix im2col with 32fp (llama/5286)	commit \| commitdiff \| tree
2024-02-10	AidanBeltonS	Tidy ggml-sycl (llama/5261)	commit \| commitdiff \| tree
2024-02-10	Meng, Hengyu	get MAX_MEM_ALLOC from device property (llama/5270)	commit \| commitdiff \| tree
2024-02-10	Neo Zhang Jianyu	add --no-mmap in llama-bench (llama/5257)	commit \| commitdiff \| tree
2024-02-10	0cc4m	Vulkan Phi Fix for AMD Proprietary Drivers (llama/5260)	commit \| commitdiff \| tree
2024-02-10	slaren	cuda : fix LLAMA_CUDA_F16 (llama/5262)	commit \| commitdiff \| tree
2024-02-10	Georgi Gerganov	metal : add im2col F32 dst support (llama/5132)	commit \| commitdiff \| tree
2024-02-10	JidongZhang-THU	llava : add MobileVLM support (llama/5132)	commit \| commitdiff \| tree
2024-02-10	Neo Zhang Jianyu	format license text, restore apache license by legal...	commit \| commitdiff \| tree
2024-02-10	slaren	ggml : limit n_threads to the max n_tasks (llama/5238)	commit \| commitdiff \| tree
2024-02-10	0cc4m	Vulkan Fixes (llama/5223)	commit \| commitdiff \| tree
2024-02-10	Jared Van Bortel	kompute : llama-bench support and ggml_cpu_has_kompute...	commit \| commitdiff \| tree
2024-02-09	Michael Podvitskiy	ggml : add abort_callback for cpu backend (#725)	commit \| commitdiff \| tree
2024-01-30	Georgi Gerganov	sync : whisper.cpp	commit \| commitdiff \| tree
2024-01-30	JacobLinCool	common : fix wav buffer detection (whisper/1819)	commit \| commitdiff \| tree
2024-01-30	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2024-01-30	Kawrakow	ggml : fix IQ3_XXS on Metal (llama/5219)	commit \| commitdiff \| tree
2024-01-30	Georgi Gerganov	sync : ggml (llama/0)	commit \| commitdiff \| tree
2024-01-30	Kawrakow	Faster AVX2 dot product for IQ2_XS (llama/5187)	commit \| commitdiff \| tree
2024-01-30	Kawrakow	SOTA 3-bit quants (llama/5196)	commit \| commitdiff \| tree
2024-01-30	0cc4m	Vulkan Windows APU Memory Handling (llama/5199)	commit \| commitdiff \| tree
2024-01-30	Paul Tsochantaris	ggml alloc: Fix for null dereference on alloc failure...	commit \| commitdiff \| tree
2024-01-30	Jared Van Bortel	Nomic Vulkan backend (llama/4456)	commit \| commitdiff \| tree
2024-01-30	slaren	ggml : add max buffer sizes to opencl and metal backend...	commit \| commitdiff \| tree
2024-01-30	Paul Tsochantaris	metal : free metal objects (llama/5161)	commit \| commitdiff \| tree
2024-01-29	Georgi Gerganov	gguf : fix comparison (#715)	commit \| commitdiff \| tree
2024-01-29	John Balis	`ggml_cuda_cpy` support for 4d tensors and float16...	commit \| commitdiff \| tree
2024-01-29	Georgi Gerganov	gguf : add input validation, prevent integer overflows...	commit \| commitdiff \| tree
2024-01-29	Georgi Gerganov	ci : fix yolo URLs + fix metal capture (#712)	commit \| commitdiff \| tree
2024-01-29	Jack Mousseau	metal : add debug capture backend function (#694)	commit \| commitdiff \| tree
2024-01-28	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2024-01-28	Georgi Gerganov	sync : whisper.cpp	commit \| commitdiff \| tree
2024-01-28	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2024-01-28	0cc4m	ggml : add Vulkan backend (llama/2059)	commit \| commitdiff \| tree
2024-01-28	Abhilash Majumder	ggml : add unified SYCL backend for Intel GPUs (llama...	commit \| commitdiff \| tree
2024-01-28	Georgi Gerganov	ggml : minor type fix (int64_t -> size_t)	commit \| commitdiff \| tree
2024-01-27	Georgi Gerganov	sync : whisper.cpp	commit \| commitdiff \| tree
2024-01-27	Georgi Gerganov	common : fix input buffer check (whisper/1812)	commit \| commitdiff \| tree
2024-01-27	Ryan Hitchman	server : implement "verbose_json" format with token...	commit \| commitdiff \| tree
2024-01-27	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2024-01-27	Michael Klimenko	Remove unused data and add fixes (llama/5154)	commit \| commitdiff \| tree
2024-01-27	0cc4m	Add OpenCL add kernel (llama/5151)	commit \| commitdiff \| tree
2024-01-27	slaren	cuda : fix tensor size calculation for non-split buffer...	commit \| commitdiff \| tree
2024-01-27	slaren	ggml-alloc : add 10% margin to the buffer sizes (llama...	commit \| commitdiff \| tree
2024-01-27	snadampal	ggml : update softmax n_task calculation (llama/5126)	commit \| commitdiff \| tree
2024-01-27	Paul Tsochantaris	metal : remove unused `n_buffers` and `buffers` (llama...	commit \| commitdiff \| tree
2024-01-27	Georgi Gerganov	metal : show compile log messages	commit \| commitdiff \| tree
2024-01-27	Engininja2	cuda : fix 2-bit quants on amd hip (llama/5105)	commit \| commitdiff \| tree
2024-01-27	slaren	llama : pre-allocate input tensors in a separate buffer...	commit \| commitdiff \| tree
2024-01-27	Georgi Gerganov	metal : disable support for MUL_MAT F32 x F16	commit \| commitdiff \| tree
2024-01-27	Johannes Gäßler	CUDA: more info when no device code (llama/5088)	commit \| commitdiff \| tree
2024-01-27	Georgi Gerganov	minor : clean-up some warnings and style (llama/5094)	commit \| commitdiff \| tree
2024-01-27	Reinforce-II	ggml : parallelize FP32 conversion when using BLAS...	commit \| commitdiff \| tree
2024-01-27	XiaotaoChen	llava : MobileVLM support (llama/4954)	commit \| commitdiff \| tree
2024-01-27	slaren	llama : run all KQV ops on the CPU with no KV offload...	commit \| commitdiff \| tree
2024-01-27	Kylin	cuda : fix compile error in jetson platform (llama...	commit \| commitdiff \| tree
2024-01-26	Neuman Vong	gpt-2 : clarify instructions for CLBlast on Android...	commit \| commitdiff \| tree
2024-01-26	Judd	ggml : check ggml_add src1 type (#708)	commit \| commitdiff \| tree
2024-01-22	Jack Vial	mnist : add tensorflow and keras to requirements.txt...	commit \| commitdiff \| tree
2024-01-18	Georgi Gerganov	sync : whisper.cpp	commit \| commitdiff \| tree
2024-01-18	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2024-01-18	Paul Tsochantaris	metal : fix memory leak, dangling pointer and unused...	commit \| commitdiff \| tree
2024-01-18	Georgi Gerganov	ggml : fix SPM package headers	commit \| commitdiff \| tree
2024-01-17	Judd	readme : add link (#699)	commit \| commitdiff \| tree
2024-01-17	Georgi Gerganov	sync : llama.cpp	commit \| commitdiff \| tree
2024-01-17	Georgi Gerganov	metal : update ggml-metal.m from llama.cpp	commit \| commitdiff \| tree
2024-01-17	Georgi Gerganov	ggml : add IQ2 to test-backend-ops + refactoring (llama...	commit \| commitdiff \| tree
2024-01-17	Georgi Gerganov	imatrix : offload to GPU support (llama/4957)	commit \| commitdiff \| tree
2024-01-17	Georgi Gerganov	backend : add eval callback (llama/4935)	commit \| commitdiff \| tree
2024-01-17	Georgi Gerganov	metal : create autorelease pool during library build...	commit \| commitdiff \| tree
2024-01-17	Kawrakow	ggml : importance matrix support for legacy quants...	commit \| commitdiff \| tree
2024-01-17	Alex Azarov	metal : log `recommendedMaxWorkingSetSize` on iOS 16...	commit \| commitdiff \| tree
2024-01-17	Justine Tunney	ggml : introduce GGML_CALL function annotation (llama...	commit \| commitdiff \| tree
2024-01-17	Georgi Gerganov	cuda : fix dequantize kernel names (llama/4938)	commit \| commitdiff \| tree
2024-01-17	Kawrakow	CUDA: faster dequantize kernels for Q4_0 and Q4_1 ...	commit \| commitdiff \| tree
next

Packaging of ggml-org/ggml

RSS Atom