git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2024-03-22	Georgi Gerganov	common : default --hf-file to --model (#6234)	commit \| commitdiff \| tree
2024-03-22	fraxy-v	convert-llama2c-to-ggml : enable conversion of GQA...	commit \| commitdiff \| tree
2024-03-22	Kawrakow	quantize: options for output and token embedding tensor...	commit \| commitdiff \| tree
2024-03-22	Pierrick Hymbert	llama_model_loader: support multiple split/shard GGUFs...	commit \| commitdiff \| tree
2024-03-22	Minsoo Cheong	ci: apply concurrency limit for github workflows (...	commit \| commitdiff \| tree
2024-03-22	Georgi Gerganov	common : add HF arg helpers (#6234)	commit \| commitdiff \| tree
2024-03-22	Nexesenex	llama : correction of the attn.v.weight quantization...	commit \| commitdiff \| tree
2024-03-22	Olivier Chafik	tests : conditional python & node json schema tests...	commit \| commitdiff \| tree
2024-03-22	Olivier Chafik	json-schema-to-grammar : fix order of props + non-str...	commit \| commitdiff \| tree
2024-03-22	slaren	cuda : add LLAMA_CUDA_NO_PEER_COPY to workaround broken...	commit \| commitdiff \| tree
2024-03-22	Xiaoyi Chen	readme : add RecurseChat to the list of UIs (#6219)	commit \| commitdiff \| tree
2024-03-22	Jan Boon	server : fix n_keep always showing as 0 in response...	commit \| commitdiff \| tree
2024-03-22	Georgi Gerganov	server : enable continuous batching by default (#6231)	commit \| commitdiff \| tree
2024-03-22	Georgi Gerganov	metal : proper assert for mat-mat memory alignment...	commit \| commitdiff \| tree
2024-03-22	Vaibhav Srivastav	ci : add CURL flag for the mac builds (#6214)	commit \| commitdiff \| tree
2024-03-22	Georgi Gerganov	metal : pad n_ctx by 32 (#6177)	commit \| commitdiff \| tree
2024-03-22	Neo Zhang Jianyu	add blog link (#6222)	commit \| commitdiff \| tree
2024-03-22	DAN™	Fix params underscore convert to dash. (#6203)	commit \| commitdiff \| tree
2024-03-21	Jan Boon	server : update readme doc from `slot_id` to `id_slot...	commit \| commitdiff \| tree
2024-03-21	slaren	cuda : disable host register by default (#6206)	commit \| commitdiff \| tree
2024-03-21	semidark	Corrected typo to wrong file (#6199)	commit \| commitdiff \| tree
2024-03-21	Georgi Gerganov	tests : disable system() calls (#6198)	commit \| commitdiff \| tree
2024-03-21	slaren	cuda : fix LLAMA_CUDA_F16 build (#6197)	commit \| commitdiff \| tree
2024-03-21	Kawrakow	ggml : same IQ4_NL quantization for CPU/CUDA/Metal...	commit \| commitdiff \| tree
2024-03-21	Olivier Chafik	json-schema-to-grammar improvements (+ added to server...	commit \| commitdiff \| tree
2024-03-21	Vaibhav Srivastav	ci : fix indentation error (#6195)	commit \| commitdiff \| tree
2024-03-21	Vaibhav Srivastav	build : add mac pre-build binaries (#6182)	commit \| commitdiff \| tree
2024-03-21	Kawrakow	Add ability to use Q5_0, Q5_1, and IQ4_NL for quantized...	commit \| commitdiff \| tree
2024-03-21	AidanBeltonS	Add nvidia and amd backends (#6157)	commit \| commitdiff \| tree
2024-03-21	slaren	cuda : fix conflict with std::swap (#6186)	commit \| commitdiff \| tree
2024-03-20	slaren	cuda : print the returned error when CUDA initializatio...	commit \| commitdiff \| tree
2024-03-20	Ziang Wu	llava : update MobileVLM-README.md (#6180)	commit \| commitdiff \| tree
2024-03-20	Ziang Wu	llava : add MobileVLM_V2 backup (#6175)	commit \| commitdiff \| tree
2024-03-20	slaren	cuda : refactor to remove global resources (#6170)	commit \| commitdiff \| tree
2024-03-20	Xuan Son Nguyen	Server: version bump for httplib and json (#6169)	commit \| commitdiff \| tree
2024-03-20	Georgi Gerganov	gitignore : ignore curl-related files	commit \| commitdiff \| tree
2024-03-20	Georgi Gerganov	server : allow to override -ngl in tests (#6170)	commit \| commitdiff \| tree
2024-03-20	Georgi Gerganov	Revert "llava : add a MobileVLM_V2-1.7B backup (#6152)"	commit \| commitdiff \| tree
2024-03-20	Ziang Wu	llava : add a MobileVLM_V2-1.7B backup (#6152)	commit \| commitdiff \| tree
2024-03-20	Karthick	Server: Handle n_keep parameter in the request (#6174)	commit \| commitdiff \| tree
2024-03-20	Jared Van Bortel	server tests : more pythonic process management; fix...	commit \| commitdiff \| tree
2024-03-20	Neo Zhang Jianyu	update readme sycl for new update (#6151)	commit \| commitdiff \| tree
2024-03-20	Abhilash Majumder	increase igpu cluster limit (#6159)	commit \| commitdiff \| tree
2024-03-19	DAN™	Remove undeed header file. (#6158)	commit \| commitdiff \| tree
2024-03-19	Pierrick Hymbert	gguf-split: split and merge gguf per batch of tensors...	commit \| commitdiff \| tree
2024-03-19	Georgi Gerganov	common : disable repeat penalties by default (#6127)	commit \| commitdiff \| tree
2024-03-19	slaren	ci : exempt some labels from being tagged as stale...	commit \| commitdiff \| tree
2024-03-19	DAN™	common : print usage on '-h' and '--help' (#6145)	commit \| commitdiff \| tree
2024-03-18	github-actions...	flake.lock: Update	commit \| commitdiff \| tree
2024-03-18	Jared Van Bortel	mpt : implement backwards compatiblity with duped outpu...	commit \| commitdiff \| tree
2024-03-18	Felix	clip : fix memory leak (#6138)	commit \| commitdiff \| tree
2024-03-18	slaren	backend : set max split inputs to GGML_MAX_SRC (#6137)	commit \| commitdiff \| tree
2024-03-18	Georgi Gerganov	ci : disable stale issue messages (#6126)	commit \| commitdiff \| tree
2024-03-18	Georgi Gerganov	ci : temporary disable sanitizer builds (#6128)	commit \| commitdiff \| tree
2024-03-18	slaren	backend : offload large batches to GPU (#6083)	commit \| commitdiff \| tree
2024-03-18	DAN™	common : tidy-up argument parsing (#6105)	commit \| commitdiff \| tree
2024-03-18	Thérence	convert : add support for CamembertModel architecture...	commit \| commitdiff \| tree
2024-03-18	Romain D	convert : use f32 outtype for bf16 tensors (#6106)	commit \| commitdiff \| tree
2024-03-17	Pierrick Hymbert	common: llama_load_model_from_url using --model-url...	commit \| commitdiff \| tree
2024-03-17	Georgi Gerganov	ci : close all stale issues at once (#6115)	commit \| commitdiff \| tree
2024-03-17	GainLee	ggml:fix finding transfer queue family index error...	commit \| commitdiff \| tree
2024-03-16	AmirAli Mirian	ggml : add AVX512F SIMD (#6088)	commit \| commitdiff \| tree
2024-03-16	Daniel Bevenius	gritlm : add initial README.md (#6086)	commit \| commitdiff \| tree
2024-03-16	Xuan Son Nguyen	readme : add wllama as a wasm binding (#6100)	commit \| commitdiff \| tree
2024-03-16	DAN™	common : refactor nested if causing error C1061 on...	commit \| commitdiff \| tree
2024-03-16	Pierrick Hymbert	ci : close inactive issue with workflow (#6053)	commit \| commitdiff \| tree
2024-03-15	slaren	llama : fix Baichuan2 13B (#6092)	commit \| commitdiff \| tree
2024-03-15	Theia Vogel	llama : add support for control vectors (#5970)	commit \| commitdiff \| tree
2024-03-15	Andrew Canis	llama : add Command-R support (#6033)	commit \| commitdiff \| tree
2024-03-15	Ting Lou	llava : change API to pure C style for Rust FFI bindgen...	commit \| commitdiff \| tree
2024-03-15	slaren	cuda : disable unused cudaLaunchHostFunc code (#6078)	commit \| commitdiff \| tree
2024-03-15	Neo Zhang Jianyu	fix set main gpu error (#6073)	commit \| commitdiff \| tree
2024-03-15	Georgi Gerganov	make : ggml-metal.o depends on ggml.h	commit \| commitdiff \| tree
2024-03-15	AidanBeltonS	[SYCL] Fix non-intel device selection (#6042)	commit \| commitdiff \| tree
2024-03-15	Ondřej Čertík	gguf : add support for I64 and F64 arrays (#6062)	commit \| commitdiff \| tree
2024-03-15	Xuan Son Nguyen	llama : add Orion chat template (#6066)	commit \| commitdiff \| tree
2024-03-15	slaren	llama-bench : use random tokens to improve accuracy...	commit \| commitdiff \| tree
2024-03-14	Georgi Gerganov	llama : fix integer overflow during quantization (...	commit \| commitdiff \| tree
2024-03-14	Steve Grubb	gguf : fix resource leaks (#6061)	commit \| commitdiff \| tree
2024-03-14	Ondřej Čertík	gguf-py : bump version to 0.8.0 (#6060)	commit \| commitdiff \| tree
2024-03-14	Michael Podvitskiy	llama : support models without vocabulary (#5798)	commit \| commitdiff \| tree
2024-03-14	Georgi Gerganov	embedding : add EOS token if not present (#899)	commit \| commitdiff \| tree
2024-03-14	Georgi Gerganov	gguf-py : fix dtype check (#6045)	commit \| commitdiff \| tree
2024-03-14	Jian Liao	readme : improve readme for Llava-1.6 example (#6044)	commit \| commitdiff \| tree
2024-03-14	Pierrick Hymbert	server: disable debug release type sanitizer, simplify...	commit \| commitdiff \| tree
2024-03-14	Georgi Gerganov	llama : fix typo	commit \| commitdiff \| tree
2024-03-14	Michael Podvitskiy	llama : optimize defrag moves + fix fragmentation calcu...	commit \| commitdiff \| tree
2024-03-14	Ondřej Čertík	gguf-py : add support for I8, I16 and I32 (#6045)	commit \| commitdiff \| tree
2024-03-14	Georgi Gerganov	ggml : designate enum vals for integer types (#6050)	commit \| commitdiff \| tree
2024-03-14	Georgi Gerganov	embedding : print all resulting embeddings (#899)	commit \| commitdiff \| tree
2024-03-14	Georgi Gerganov	metal : build metallib + fix embed path (#6015)	commit \| commitdiff \| tree
2024-03-14	Georgi Gerganov	embedding : print cosine similarity (#899)	commit \| commitdiff \| tree
2024-03-13	Linwei Wang	readme : update details about running llama in Termux...	commit \| commitdiff \| tree
2024-03-13	Georgi Gerganov	readme : update API changes and hot topics	commit \| commitdiff \| tree
2024-03-13	Clint Herron	grammar : handle missing "root" node (#6004)	commit \| commitdiff \| tree
2024-03-13	slaren	llama : add pipeline parallelism support (#6017)	commit \| commitdiff \| tree
2024-03-13	slaren	test-backend-ops : skip CPU backend by default (#6028)	commit \| commitdiff \| tree
2024-03-13	AidanBeltonS	Update get version (#6025)	commit \| commitdiff \| tree
2024-03-13	Xuan Son Nguyen	Server: Use multi-task for embeddings endpoint (#6001)	commit \| commitdiff \| tree
2024-03-12	slaren	ci : remove tidy-review (#6021)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom