git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2024-02-19	Georgi Gerganov	ci : enable -Werror for CUDA builds (#5579)	commit \| commitdiff \| tree
2024-02-19	Georgi Gerganov	make : fix CUDA build (#5580)	commit \| commitdiff \| tree
2024-02-19	valiray	readme : fix typo in README-sycl.md (#5353)	commit \| commitdiff \| tree
2024-02-19	Abhilash Majumder	cmake : remove obsolete sycl compile flags (#5581)	commit \| commitdiff \| tree
2024-02-19	Georgi Gerganov	minor : fix trailing whitespace (#5538)	commit \| commitdiff \| tree
2024-02-19	Daniel Bevenius	llava : avoid changing the original BakLLaVA model...	commit \| commitdiff \| tree
2024-02-19	NawafAlansari	baby-llama : allocate graphs in ggml_context (#5573)	commit \| commitdiff \| tree
2024-02-19	Xuan Son Nguyen	llama : add llama_chat_apply_template() (#5538)	commit \| commitdiff \| tree
2024-02-19	slaren	cuda, metal : fix nans in soft_max (#5574)	commit \| commitdiff \| tree
2024-02-19	Mirko185	readme : update (#5572)	commit \| commitdiff \| tree
2024-02-19	bmwl	ggml : android and old glibc NUMA incompatibility bugfi...	commit \| commitdiff \| tree
2024-02-18	Jared Van Bortel	build : pass all warning flags to nvcc via -Xcompiler...	commit \| commitdiff \| tree
2024-02-18	Georgi Gerganov	ggml : restore vec dot stride arg names (#5453)	commit \| commitdiff \| tree
2024-02-18	Georgi Gerganov	ci : fix wikitext url + compile warnings (#5569)	commit \| commitdiff \| tree
2024-02-18	Georgi Gerganov	metal : fix unused warnings (#0)	commit \| commitdiff \| tree
2024-02-18	Robey Holderith	common, server : surface min_keep as its own parameter...	commit \| commitdiff \| tree
2024-02-18	Pierrick Hymbert	server : slots monitoring endpoint (#5550)	commit \| commitdiff \| tree
2024-02-18	Georgi Gerganov	sampling : do not set min_keep to n_probs (#5564)	commit \| commitdiff \| tree
2024-02-18	Georgi Gerganov	cmake : fix GGML_USE_SYCL typo (#5555)	commit \| commitdiff \| tree
2024-02-18	Pierrick Hymbert	server : enhanced health endpoint (#5548)	commit \| commitdiff \| tree
2024-02-18	Pierrick Hymbert	server : --n-predict option document and cap to max...	commit \| commitdiff \| tree
2024-02-18	Daniel Hiltgen	server : graceful server shutdown (#5244)	commit \| commitdiff \| tree
2024-02-18	Georgi Gerganov	common : fix ub (#5530)	commit \| commitdiff \| tree
2024-02-18	Herman Semenov	ggml, common, examples, tests : fixed type arguments...	commit \| commitdiff \| tree
2024-02-18	Daniel Bevenius	llava : update surgery script to not remove tensors...	commit \| commitdiff \| tree
2024-02-18	Kawrakow	1.5 bit quantization (#5453)	commit \| commitdiff \| tree
2024-02-18	github-actions...	flake.lock: Update	commit \| commitdiff \| tree
2024-02-17	Georgi Gerganov	ggml : add ALiBi support for ggml_soft_max_ext (#5488)	commit \| commitdiff \| tree
2024-02-17	Ananta Bastola	ci : add an option to fail on compile warning (#3952)	commit \| commitdiff \| tree
2024-02-17	clibdev	gitignore : update for CLion IDE (#5544)	commit \| commitdiff \| tree
2024-02-16	Georgi Gerganov	cmake : fix VULKAN and ROCm builds (#5525)	commit \| commitdiff \| tree
2024-02-16	Georgi Gerganov	scripts : add helpers script for bench comparing commit...	commit \| commitdiff \| tree
2024-02-16	Herman Semenov	llava : removed excess free(NULL) operation (#5531)	commit \| commitdiff \| tree
2024-02-16	Herman Semenov	llama : minor fixed return int value (#5529)	commit \| commitdiff \| tree
2024-02-16	Alexey Parfenov	server : add "samplers" param to control the samplers...	commit \| commitdiff \| tree
2024-02-16	Rőczey Barnabás	server : fix system prompt cli (#5516)	commit \| commitdiff \| tree
2024-02-16	bmwl	ggml : add numa options (#5377)	commit \| commitdiff \| tree
2024-02-16	Daniel Bevenius	llava : fix clip-model-is-vision flag in README.md...	commit \| commitdiff \| tree
2024-02-16	Georgi Gerganov	ci : fix BERT model download and convert	commit \| commitdiff \| tree
2024-02-15	Douglas Hanley	Use correct type of pooling for embedding models (...	commit \| commitdiff \| tree
2024-02-15	Georgi Gerganov	clip : fix wrong loop condition	commit \| commitdiff \| tree
2024-02-15	slaren	cuda : print message when initialization fails (#5512)	commit \| commitdiff \| tree
2024-02-15	Georgi Gerganov	scripts : add hf.sh helper script (#5501)	commit \| commitdiff \| tree
2024-02-15	Michaël de...	fix(gguf-py): special tokens are no longer skipped...	commit \| commitdiff \| tree
2024-02-15	Elbios	llava : fix memory management bug (#5491)	commit \| commitdiff \| tree
2024-02-15	John	llaba : hotfix for llava-1.6 image number (#5495)	commit \| commitdiff \| tree
2024-02-15	Neuman Vong	vulkan: Find optimal memory type but with fallback...	commit \| commitdiff \| tree
2024-02-14	Rune	readme : fix typo (#5490)	commit \| commitdiff \| tree
2024-02-14	John	llava : update README.md (#5489)	commit \| commitdiff \| tree
2024-02-14	Michael Podvitskiy	cmake : ARM intrinsics detection for MSVC (#5401)	commit \| commitdiff \| tree
2024-02-14	John	llava : support v1.6 (#5267)	commit \| commitdiff \| tree
2024-02-13	AT	Early return for zero size calls to get_tensor. (#5482)	commit \| commitdiff \| tree
2024-02-13	John	gguf : add python reader example (#5216)	commit \| commitdiff \| tree
2024-02-13	Jared Van Bortel	llama : add support for Nomic Embed (#5468)	commit \| commitdiff \| tree
2024-02-13	Aarni Koskela	llama : allow raw byte in SPM vocabs; don't crash on...	commit \| commitdiff \| tree
2024-02-13	Aarni Koskela	llama : make load error reporting more granular (#5477)	commit \| commitdiff \| tree
2024-02-13	Daniel Bevenius	finetune : rename feed-forward tensors (w1/w2/w3) ...	commit \| commitdiff \| tree
2024-02-13	Georgi Gerganov	tests : multi-thread the tokenizer tests (#5474)	commit \| commitdiff \| tree
2024-02-13	Douglas Hanley	llama : support batched embeddings (#5466)	commit \| commitdiff \| tree
2024-02-13	Johannes Gäßler	make: add error message for bad CUDA version (#5444)	commit \| commitdiff \| tree
2024-02-13	Georgi Gerganov	bert : add tests + fix quantization (#5475)	commit \| commitdiff \| tree
2024-02-13	Georgi Gerganov	tests : disable moe test (#5473)	commit \| commitdiff \| tree
2024-02-13	Kawrakow	ggml-quants : fix compiler warnings (shadow variable...	commit \| commitdiff \| tree
2024-02-12	Georgi Gerganov	llama : fix quantization when tensors are missing ...	commit \| commitdiff \| tree
2024-02-12	Georgi Gerganov	swift : package no longer use ggml dependency (#5465)	commit \| commitdiff \| tree
2024-02-12	Lee	py : fix persimmon `n_rot` conversion (#5460)	commit \| commitdiff \| tree
2024-02-12	Abhilash Majumder	ggml-sycl: Replace 3d ops with macro (#5458)	commit \| commitdiff \| tree
2024-02-12	Daniel Bevenius	llava : remove prog parameter from ArgumentParser ...	commit \| commitdiff \| tree
2024-02-12	Georgi Gerganov	sync : ggml (#5452)	commit \| commitdiff \| tree
2024-02-11	Johannes Gäßler	CUDA: mul_mat_vec_q tiling, refactor mul mat logic...	commit \| commitdiff \| tree
2024-02-11	Douglas Hanley	Add support for BERT embedding models (#5423)	commit \| commitdiff \| tree
2024-02-11	github-actions...	flake.lock: Update	commit \| commitdiff \| tree
2024-02-11	Sergio López	vulkan: only use M-sized matmul on Apple GPUs (#5412)	commit \| commitdiff \| tree
2024-02-11	Alexey Parfenov	common : use enums for sampler types (#5418)	commit \| commitdiff \| tree
2024-02-11	Alexey Parfenov	server : allow to specify tokens as strings in logit_bi...	commit \| commitdiff \| tree
2024-02-11	Georgi Gerganov	main : ctrl+C print timing in non-interactive mode...	commit \| commitdiff \| tree
2024-02-11	Georgi Gerganov	common : fix compile warning	commit \| commitdiff \| tree
2024-02-11	Georgi Gerganov	ggml : fix compile warnings (unused vars) (#4966)	commit \| commitdiff \| tree
2024-02-11	snadampal	ggml : add mmla kernels for quantized GEMM (#4966)	commit \| commitdiff \| tree
2024-02-11	Johannes Gäßler	lookup: add print for drafting performance (#5450)	commit \| commitdiff \| tree
2024-02-11	Xuan Son Nguyen	server : add llama2 chat template (#5425)	commit \| commitdiff \| tree
2024-02-10	Ian Bull	metal : use autoreleasepool to avoid memory leaks ...	commit \| commitdiff \| tree
2024-02-10	Georgi Gerganov	scripts : update sync scripts with new backends	commit \| commitdiff \| tree
2024-02-10	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-02-10	Michael Podvitskiy	ggml : add abort_callback for cpu backend (ggml/725)	commit \| commitdiff \| tree
2024-02-09	Neuman Vong	vulkan: Set limit for task concurrency (#5427)	commit \| commitdiff \| tree
2024-02-09	Daniel Bevenius	llava : add requirements.txt and update README.md ...	commit \| commitdiff \| tree
2024-02-09	Riley Stewart	server : fix prompt caching for repeated prompts (...	commit \| commitdiff \| tree
2024-02-09	Paul Tsochantaris	llama : do not cap thread count when MoE on CPU (#5419)	commit \| commitdiff \| tree
2024-02-09	Marko Tasic	readme : add JavaScript/Wasm repo (#5415)	commit \| commitdiff \| tree
2024-02-09	Michael Podvitskiy	ggml : fix `error C2078: too many initializers` for...	commit \| commitdiff \| tree
2024-02-09	0cc4m	Fix Vulkan crash on APUs with very little device memory...	commit \| commitdiff \| tree
2024-02-08	Johannes Gäßler	CUDA: more warps for mmvq on NVIDIA (#5394)	commit \| commitdiff \| tree
2024-02-08	slaren	llama : do not print "offloading layers" message in...	commit \| commitdiff \| tree
2024-02-08	Abhilash Majumder	Fix f16_sycl cpy call from Arc (#5411)	commit \| commitdiff \| tree
2024-02-08	Daniel Bevenius	llava : add missing .py, and fix paths in README.md...	commit \| commitdiff \| tree
2024-02-08	Johannes Gäßler	fix trailing whitespace (#5407)	commit \| commitdiff \| tree
2024-02-08	runfuture	llama : fix MiniCPM (#5392)	commit \| commitdiff \| tree
2024-02-08	Daniel Bevenius	llava: fix typo/formatting in README.md (#5405)	commit \| commitdiff \| tree
2024-02-08	Johannes Gäßler	sampling: fix top_k <= 0 (#5388)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom