git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2024-03-12	slaren	ci : remove tidy-review (#6021)	commit \| commitdiff \| tree
2024-03-12	Georgi Gerganov	ggml : reuse quantum structs across backends (#5943)	commit \| commitdiff \| tree
2024-03-12	Georgi Gerganov	ggml : fix UB in IQ2_S and IQ3_S (#6012)	commit \| commitdiff \| tree
2024-03-12	Georgi Gerganov	sycl : update IQ1_S kernels (WIP - not working!) (...	commit \| commitdiff \| tree
2024-03-11	gliptic	grammar : fix unnecessarily retained pointer to rules...	commit \| commitdiff \| tree
2024-03-11	Kawrakow	1.5 bit: we can do even better (#5999)	commit \| commitdiff \| tree
2024-03-11	Georgi Gerganov	llama : more consistent names of count variables (...	commit \| commitdiff \| tree
2024-03-11	Georgi Gerganov	llama : refactor unicode stuff (#5992)	commit \| commitdiff \| tree
2024-03-11	Jakub N	Update server docker image URLs (#5997)	commit \| commitdiff \| tree
2024-03-11	Xuan Son Nguyen	Server: format error to json (#5961)	commit \| commitdiff \| tree
2024-03-11	Michael Podvitskiy	ggml, ci : Windows ARM runner and build fixes (#5979)	commit \| commitdiff \| tree
2024-03-11	Minsoo Cheong	server : maintain chat completion id for streaming...	commit \| commitdiff \| tree
2024-03-11	Gilad S	cmake : fix subdir for `LLAMA_METAL_EMBED_LIBRARY`...	commit \| commitdiff \| tree
2024-03-11	Georgi Gerganov	llama : fix F16/F32 downcast + improve names (#5980)	commit \| commitdiff \| tree
2024-03-11	Kawrakow	Better 1.5 bit quantization (#5971)	commit \| commitdiff \| tree
2024-03-11	Abhilash Majumder	[SYCL] Add q3_s and q1_s (#5886)	commit \| commitdiff \| tree
2024-03-11	AidanBeltonS	[SYCL] Add support for SYCL Nvidia target (#5738)	commit \| commitdiff \| tree
2024-03-10	Georgi Gerganov	metal : move mm_id indices to shared mem (#5982)	commit \| commitdiff \| tree
2024-03-10	Dean	android : fix utf8 decoding error (#5935)	commit \| commitdiff \| tree
2024-03-10	Georgi Gerganov	readme : update hot topics	commit \| commitdiff \| tree
2024-03-10	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-03-10	Georgi Gerganov	ggml : try fix 32-bit arm compat (whisper/1938)	commit \| commitdiff \| tree
2024-03-10	Georgi Gerganov	ggml : remove __constant__ specifier for CUDA tables...	commit \| commitdiff \| tree
2024-03-10	Pierrick Hymbert	server: ci: windows build and tests (#5968)	commit \| commitdiff \| tree
2024-03-10	DAN™	llama : add support for GritLM (#5959)	commit \| commitdiff \| tree
2024-03-10	Clint Herron	grammar : verify parsed state (#5950)	commit \| commitdiff \| tree
2024-03-10	Georgi Gerganov	nix: update flake.lock (#5969)	commit \| commitdiff \| tree
2024-03-09	Pierrick Hymbert	server: benchmark: chat/completions scenario and other...	commit \| commitdiff \| tree
2024-03-09	Georgi Gerganov	server : print chat template info	commit \| commitdiff \| tree
2024-03-09	slaren	perplexity : support using multiple sequences to allow...	commit \| commitdiff \| tree
2024-03-09	Georgi Gerganov	readme : update hot topics	commit \| commitdiff \| tree
2024-03-09	Georgi Gerganov	ggml : fix unnecessary f32 -> f16 -> f32 casts (mmla...	commit \| commitdiff \| tree
2024-03-09	Georgi Gerganov	server : fix metrics init (#5964)	commit \| commitdiff \| tree
2024-03-09	Georgi Gerganov	ggml : remove old quantization functions (#5942)	commit \| commitdiff \| tree
2024-03-09	Georgi Gerganov	server : clarify some items in the readme (#5957)	commit \| commitdiff \| tree
2024-03-09	SeungWon Jeong	server : normalize embeddings (#5956)	commit \| commitdiff \| tree
2024-03-09	Georgi Gerganov	tests : gitignore ggml-common.h	commit \| commitdiff \| tree
2024-03-09	Alexey Parfenov	server : fix passing prompt as tokens (#5955)	commit \| commitdiff \| tree
2024-03-09	Georgi Gerganov	ggml : add ggml-common.h to deduplicate shared code...	commit \| commitdiff \| tree
2024-03-09	Georgi Gerganov	server : simplify logic for empty prompts (#5953)	commit \| commitdiff \| tree
2024-03-09	Xuan Son Nguyen	Server: reorganize some http logic (#5939)	commit \| commitdiff \| tree
2024-03-09	Gabe Goodhart	server : add SSL support (#5926)	commit \| commitdiff \| tree
2024-03-09	Pierrick Hymbert	server: tests: add truncated prompt tests, better kv...	commit \| commitdiff \| tree
2024-03-08	compilade	llama : support Mamba Selective State Space Models...	commit \| commitdiff \| tree
2024-03-08	compilade	llama : fix quantization of shared token_embd (#5944)	commit \| commitdiff \| tree
2024-03-08	Pierrick Hymbert	server: metrics: add llamacpp:prompt_seconds_total...	commit \| commitdiff \| tree
2024-03-08	Don Mahurin	llama : assume tied weights if lm_head/output weights...	commit \| commitdiff \| tree
2024-03-08	Georgi Gerganov	server : fix EOS token detection with disabled cache...	commit \| commitdiff \| tree
2024-03-08	UEXTM.com	log : fix MSVC compile errors (#5643)	commit \| commitdiff \| tree
2024-03-07	Georgi Gerganov	llama-bench : add embeddings option (#5924)	commit \| commitdiff \| tree
2024-03-07	Neo Zhang Jianyu	Revert "[SYCL] fix error when set main gpu to non-zero...	commit \| commitdiff \| tree
2024-03-07	Minsoo Cheong	server : add `/v1/completions` endpoint (#5914)	commit \| commitdiff \| tree
2024-03-07	Georgi Gerganov	server : refactor (#5882)	commit \| commitdiff \| tree
2024-03-07	Neo Zhang Jianyu	[SYCL] fix error when set main gpu to non-zero (#5901)	commit \| commitdiff \| tree
2024-03-06	Jared Van Bortel	ggml : use SYS_get_cpu if SYS_getcpu is not defined...	commit \| commitdiff \| tree
2024-03-06	bobqianic	ggml : use `uint8x16_t` return type for `ggml_vqtbl1q_u...	commit \| commitdiff \| tree
2024-03-06	Georgi Gerganov	convert : remove AWQ remnants (#5768)	commit \| commitdiff \| tree
2024-03-06	Neo Zhang Jianyu	add wait() to make code stable (#5895)	commit \| commitdiff \| tree
2024-03-05	slaren	compare-llama-bench.py : remove mul_mat_q (#5892)	commit \| commitdiff \| tree
2024-03-05	Jared Van Bortel	quants : use MM256_SET_M128I consistently to fix gcc...	commit \| commitdiff \| tree
2024-03-05	ExtReMLapin	grammars : blacklists character control set (#5888)	commit \| commitdiff \| tree
2024-03-05	Georgi Gerganov	Revert "grammars : don't allow to output unescaped...	commit \| commitdiff \| tree
2024-03-05	ExtReMLapin	grammars : don't allow to output unescaped new line...	commit \| commitdiff \| tree
2024-03-05	0cc4m	Vulkan Improvements (#5835)	commit \| commitdiff \| tree
2024-03-05	Neo Zhang Jianyu	[SYCL] fix mul_mat fault in CI/unit-test (#5862)	commit \| commitdiff \| tree
2024-03-05	Minsoo Cheong	fix editorconfig check break (#5879)	commit \| commitdiff \| tree
2024-03-05	Jeffrey Quesnelle	fix speculative decoding build on windows (#5874)	commit \| commitdiff \| tree
2024-03-05	hutli	nix: static build (#5814)	commit \| commitdiff \| tree
2024-03-04	Georgi Gerganov	llama : fix embeddings (#5796)	commit \| commitdiff \| tree
2024-03-04	Georgi Gerganov	flake : fix	commit \| commitdiff \| tree
2024-03-04	Georgi Gerganov	ggml : fix unknown status (#0)	commit \| commitdiff \| tree
2024-03-04	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-03-04	Michael Podvitskiy	ggml : introduce ggml_status (ggml/750)	commit \| commitdiff \| tree
2024-03-04	Dane Madsen	cmake : handle cases where git index is not found in...	commit \| commitdiff \| tree
2024-03-04	Minsoo Cheong	speculative : implement stochastic speculative sampling...	commit \| commitdiff \| tree
2024-03-04	Xuan Son Nguyen	add alias for chat template (#5858)	commit \| commitdiff \| tree
2024-03-04	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-03-04	leejet	add some new ops, fix some operators and add batch...	commit \| commitdiff \| tree
2024-03-04	DAN™	common : use LLAMA_DEFAULT_SEED (#5855)	commit \| commitdiff \| tree
2024-03-04	DAN™	main : support special tokens as reverse/anti prompt...	commit \| commitdiff \| tree
2024-03-03	slaren	cuda : fix data race in soft max (#5853)	commit \| commitdiff \| tree
2024-03-03	Georgi Gerganov	readme : add API changes section	commit \| commitdiff \| tree
2024-03-03	Douglas Hanley	llama : allow for user specified embedding pooling...	commit \| commitdiff \| tree
2024-03-03	Nindaleth	gguf-dump : support i-quants (#5841)	commit \| commitdiff \| tree
2024-03-03	compilade	llama : fix llama_copy_state_data with fragmented KV...	commit \| commitdiff \| tree
2024-03-03	Pierrick Hymbert	ci : schedule slow server tests only on Release or...	commit \| commitdiff \| tree
2024-03-03	Pierrick Hymbert	server : init http requests thread pool with --parallel...	commit \| commitdiff \| tree
2024-03-03	Georgi Gerganov	flake.lock: Update (#5842)	commit \| commitdiff \| tree
2024-03-02	Pierrick Hymbert	server: tests: passkey challenge / self-extend with...	commit \| commitdiff \| tree
2024-03-02	Michael Podvitskiy	llama : add abort_callback to interrupt computation...	commit \| commitdiff \| tree
2024-03-02	Georgi Gerganov	ggml : fix IQ3_S AVX implementation (#5834)	commit \| commitdiff \| tree
2024-03-02	Jared Van Bortel	convert : automatically fall back to HfVocab if tokeniz...	commit \| commitdiff \| tree
2024-03-02	Jared Van Bortel	convert-hf : make model class definitions self-containe...	commit \| commitdiff \| tree
2024-03-02	Kawrakow	ggml : IQ3_S improvements (#5829)	commit \| commitdiff \| tree
2024-03-02	Georgi Gerganov	scripts : add pod-llama.sh	commit \| commitdiff \| tree
2024-03-02	Xuan Son Nguyen	llama : refactor internal quantization functions (...	commit \| commitdiff \| tree
2024-03-02	compilade	llama : fix segfault from unknown model arch name ...	commit \| commitdiff \| tree
2024-03-02	Neo Zhang Jianyu	Support multiple GPUs (split mode) on SYCL backend...	commit \| commitdiff \| tree
2024-03-02	crasm	workflows : remove nocleanup arg for check-requirements...	commit \| commitdiff \| tree
2024-03-01	Tushar	build(nix): Introduce flake.formatter for `nix fmt...	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom