git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2024-03-27	slaren	ggml : fix bounds checking of zero size views (#6347)	commit \| commitdiff \| tree
2024-03-27	Georgi Gerganov	make : whitespace	commit \| commitdiff \| tree
2024-03-27	howlger	embedding : show full embedding for single prompt ...	commit \| commitdiff \| tree
2024-03-27	AidanBeltonS	[SYCL] Fix batched impl for NVidia GPU (#6164)	commit \| commitdiff \| tree
2024-03-27	Kawrakow	Make IQ1_M work for QK_K = 64 (#6327)	commit \| commitdiff \| tree
2024-03-27	Sigbjørn Skjæret	common : change --no-penalize-nl to --penalize-nl ...	commit \| commitdiff \| tree
2024-03-27	Georgi Gerganov	llama2c : open file as binary (#6332)	commit \| commitdiff \| tree
2024-03-27	Mateusz Charytoniuk	readme : add php api bindings (#6326)	commit \| commitdiff \| tree
2024-03-27	Eric Zhang	server: public: use relative routes for static files...	commit \| commitdiff \| tree
2024-03-27	Neo Zhang Jianyu	[SYCL] fix no file in win rel (#6314)	commit \| commitdiff \| tree
2024-03-26	Jared Van Bortel	wpm : portable unicode tolower (#6305)	commit \| commitdiff \| tree
2024-03-26	compilade	llama : greatly reduce output buffer memory usage ...	commit \| commitdiff \| tree
2024-03-26	Kawrakow	IQ1_M: 1.75 bpw quantization (#6302)	commit \| commitdiff \| tree
2024-03-26	Pedro Cuenca	convert-hf : fix exception in sentencepiece with added...	commit \| commitdiff \| tree
2024-03-26	Kawrakow	quantize : be able to override metadata by key (#6321)	commit \| commitdiff \| tree
2024-03-26	Minsoo Cheong	embedding : adjust `n_ubatch` value (#6296)	commit \| commitdiff \| tree
2024-03-26	Jan Boon	server : add `n_discard` parameter (#6300)	commit \| commitdiff \| tree
2024-03-26	Joseph Stahl	nix: make `xcrun` visible in Nix sandbox for precompili...	commit \| commitdiff \| tree
2024-03-26	slaren	cuda : rename build flag to LLAMA_CUDA (#6299)	commit \| commitdiff \| tree
2024-03-25	Christian Kögler	nix: fix blas support (#6281)	commit \| commitdiff \| tree
2024-03-25	Kawrakow	tests : include IQ2_XXS and IQ2_XS in test-quantize...	commit \| commitdiff \| tree
2024-03-25	Georgi Gerganov	flake.lock: Update (#6266)	commit \| commitdiff \| tree
2024-03-25	slaren	cuda : fix LLAMA_CUDA_F16 build (#6298)	commit \| commitdiff \| tree
2024-03-25	slaren	cuda : refactor into multiple files (#6269)	commit \| commitdiff \| tree
2024-03-25	Xuan Son Nguyen	Server: clean up OAI params parsing function (#6284)	commit \| commitdiff \| tree
2024-03-25	Neo Zhang Jianyu	[SYCL] fix SYCL backend build on windows is break by...	commit \| commitdiff \| tree
2024-03-25	Minsoo Cheong	examples : add "retrieval" (#6193)	commit \| commitdiff \| tree
2024-03-25	Justine Tunney	ggml : support AVX512VNNI (#6280)	commit \| commitdiff \| tree
2024-03-24	Rick G	Fix heap corruption from wmode out-of-bound writes...	commit \| commitdiff \| tree
2024-03-24	Georgi Gerganov	imatrix : fix wname for mul_mat_id ops (#6271)	commit \| commitdiff \| tree
2024-03-24	Johannes Gäßler	Fixed lookup compilation issues on Windows (#6273)	commit \| commitdiff \| tree
2024-03-24	Pierrick Hymbert	ci : close inactive issue, increase operations per...	commit \| commitdiff \| tree
2024-03-24	Minsoo Cheong	sampling : deduplicated code for probability distributi...	commit \| commitdiff \| tree
2024-03-24	Meng, Hengyu	[SYCL] offload op (#6217)	commit \| commitdiff \| tree
2024-03-24	Neo Zhang Jianyu	Support build win release for SYCL (#6241)	commit \| commitdiff \| tree
2024-03-23	Jared Van Bortel	use _wfopen instead of fopen on Windows (#6248)	commit \| commitdiff \| tree
2024-03-23	Georgi Gerganov	gitignore : gguf-split	commit \| commitdiff \| tree
2024-03-23	Pierrick Hymbert	common: llama_load_model_from_url split support (...	commit \| commitdiff \| tree
2024-03-23	Pierrick Hymbert	server: docs: `--threads` and `--threads`, `--ubatch...	commit \| commitdiff \| tree
2024-03-23	Julius Arkenberg	llama : add grok-1 support (#6204)	commit \| commitdiff \| tree
2024-03-23	Pierrick Hymbert	split: add gguf-split in the make build target (#6262)	commit \| commitdiff \| tree
2024-03-23	Pierrick Hymbert	server: flush stdout after logging in both text and...	commit \| commitdiff \| tree
2024-03-23	Johannes Gäßler	lookup: complement data from context with general text...	commit \| commitdiff \| tree
2024-03-22	Georgi Gerganov	common : default --hf-file to --model (#6234)	commit \| commitdiff \| tree
2024-03-22	fraxy-v	convert-llama2c-to-ggml : enable conversion of GQA...	commit \| commitdiff \| tree
2024-03-22	Kawrakow	quantize: options for output and token embedding tensor...	commit \| commitdiff \| tree
2024-03-22	Pierrick Hymbert	llama_model_loader: support multiple split/shard GGUFs...	commit \| commitdiff \| tree
2024-03-22	Minsoo Cheong	ci: apply concurrency limit for github workflows (...	commit \| commitdiff \| tree
2024-03-22	Georgi Gerganov	common : add HF arg helpers (#6234)	commit \| commitdiff \| tree
2024-03-22	Nexesenex	llama : correction of the attn.v.weight quantization...	commit \| commitdiff \| tree
2024-03-22	Olivier Chafik	tests : conditional python & node json schema tests...	commit \| commitdiff \| tree
2024-03-22	Olivier Chafik	json-schema-to-grammar : fix order of props + non-str...	commit \| commitdiff \| tree
2024-03-22	slaren	cuda : add LLAMA_CUDA_NO_PEER_COPY to workaround broken...	commit \| commitdiff \| tree
2024-03-22	Xiaoyi Chen	readme : add RecurseChat to the list of UIs (#6219)	commit \| commitdiff \| tree
2024-03-22	Jan Boon	server : fix n_keep always showing as 0 in response...	commit \| commitdiff \| tree
2024-03-22	Georgi Gerganov	server : enable continuous batching by default (#6231)	commit \| commitdiff \| tree
2024-03-22	Georgi Gerganov	metal : proper assert for mat-mat memory alignment...	commit \| commitdiff \| tree
2024-03-22	Vaibhav Srivastav	ci : add CURL flag for the mac builds (#6214)	commit \| commitdiff \| tree
2024-03-22	Georgi Gerganov	metal : pad n_ctx by 32 (#6177)	commit \| commitdiff \| tree
2024-03-22	Neo Zhang Jianyu	add blog link (#6222)	commit \| commitdiff \| tree
2024-03-22	DAN™	Fix params underscore convert to dash. (#6203)	commit \| commitdiff \| tree
2024-03-21	Jan Boon	server : update readme doc from `slot_id` to `id_slot...	commit \| commitdiff \| tree
2024-03-21	slaren	cuda : disable host register by default (#6206)	commit \| commitdiff \| tree
2024-03-21	semidark	Corrected typo to wrong file (#6199)	commit \| commitdiff \| tree
2024-03-21	Georgi Gerganov	tests : disable system() calls (#6198)	commit \| commitdiff \| tree
2024-03-21	slaren	cuda : fix LLAMA_CUDA_F16 build (#6197)	commit \| commitdiff \| tree
2024-03-21	Kawrakow	ggml : same IQ4_NL quantization for CPU/CUDA/Metal...	commit \| commitdiff \| tree
2024-03-21	Olivier Chafik	json-schema-to-grammar improvements (+ added to server...	commit \| commitdiff \| tree
2024-03-21	Vaibhav Srivastav	ci : fix indentation error (#6195)	commit \| commitdiff \| tree
2024-03-21	Vaibhav Srivastav	build : add mac pre-build binaries (#6182)	commit \| commitdiff \| tree
2024-03-21	Kawrakow	Add ability to use Q5_0, Q5_1, and IQ4_NL for quantized...	commit \| commitdiff \| tree
2024-03-21	AidanBeltonS	Add nvidia and amd backends (#6157)	commit \| commitdiff \| tree
2024-03-21	slaren	cuda : fix conflict with std::swap (#6186)	commit \| commitdiff \| tree
2024-03-20	slaren	cuda : print the returned error when CUDA initializatio...	commit \| commitdiff \| tree
2024-03-20	Ziang Wu	llava : update MobileVLM-README.md (#6180)	commit \| commitdiff \| tree
2024-03-20	Ziang Wu	llava : add MobileVLM_V2 backup (#6175)	commit \| commitdiff \| tree
2024-03-20	slaren	cuda : refactor to remove global resources (#6170)	commit \| commitdiff \| tree
2024-03-20	Xuan Son Nguyen	Server: version bump for httplib and json (#6169)	commit \| commitdiff \| tree
2024-03-20	Georgi Gerganov	gitignore : ignore curl-related files	commit \| commitdiff \| tree
2024-03-20	Georgi Gerganov	server : allow to override -ngl in tests (#6170)	commit \| commitdiff \| tree
2024-03-20	Georgi Gerganov	Revert "llava : add a MobileVLM_V2-1.7B backup (#6152)"	commit \| commitdiff \| tree
2024-03-20	Ziang Wu	llava : add a MobileVLM_V2-1.7B backup (#6152)	commit \| commitdiff \| tree
2024-03-20	Karthick	Server: Handle n_keep parameter in the request (#6174)	commit \| commitdiff \| tree
2024-03-20	Jared Van Bortel	server tests : more pythonic process management; fix...	commit \| commitdiff \| tree
2024-03-20	Neo Zhang Jianyu	update readme sycl for new update (#6151)	commit \| commitdiff \| tree
2024-03-20	Abhilash Majumder	increase igpu cluster limit (#6159)	commit \| commitdiff \| tree
2024-03-19	DAN™	Remove undeed header file. (#6158)	commit \| commitdiff \| tree
2024-03-19	Pierrick Hymbert	gguf-split: split and merge gguf per batch of tensors...	commit \| commitdiff \| tree
2024-03-19	Georgi Gerganov	common : disable repeat penalties by default (#6127)	commit \| commitdiff \| tree
2024-03-19	slaren	ci : exempt some labels from being tagged as stale...	commit \| commitdiff \| tree
2024-03-19	DAN™	common : print usage on '-h' and '--help' (#6145)	commit \| commitdiff \| tree
2024-03-18	github-actions...	flake.lock: Update	commit \| commitdiff \| tree
2024-03-18	Jared Van Bortel	mpt : implement backwards compatiblity with duped outpu...	commit \| commitdiff \| tree
2024-03-18	Felix	clip : fix memory leak (#6138)	commit \| commitdiff \| tree
2024-03-18	slaren	backend : set max split inputs to GGML_MAX_SRC (#6137)	commit \| commitdiff \| tree
2024-03-18	Georgi Gerganov	ci : disable stale issue messages (#6126)	commit \| commitdiff \| tree
2024-03-18	Georgi Gerganov	ci : temporary disable sanitizer builds (#6128)	commit \| commitdiff \| tree
2024-03-18	slaren	backend : offload large batches to GPU (#6083)	commit \| commitdiff \| tree
2024-03-18	DAN™	common : tidy-up argument parsing (#6105)	commit \| commitdiff \| tree
2024-03-18	Thérence	convert : add support for CamembertModel architecture...	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom