git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2024-01-15	Kawrakow	CUDA: faster dequantize kernels for Q4_0 and Q4_1 ...	commit \| commitdiff \| tree
2024-01-14	David Pflug	llama : fix missing quotes (#4937)	commit \| commitdiff \| tree
2024-01-14	Kawrakow	Add ability to use importance matrix for all k-quants...	commit \| commitdiff \| tree
2024-01-14	Georgi Gerganov	llama : check LLAMA_TRACE env for extra logging (#4929)	commit \| commitdiff \| tree
2024-01-14	Georgi Gerganov	scripts : sync-ggml-am.sh option to skip commits	commit \| commitdiff \| tree
2024-01-14	Georgi Gerganov	llama : use LLAMA_LOG_ macros for logging	commit \| commitdiff \| tree
2024-01-14	Kawrakow	Fix ffn_down quantization mix for MoE models (#4927)	commit \| commitdiff \| tree
2024-01-14	Alex Azarov	metal : correctly set SIMD support flags on iOS (#4923)	commit \| commitdiff \| tree
2024-01-14	Karthik Kumar...	llama : support WinXP build with MinGW 8.1.0 (#3419)	commit \| commitdiff \| tree
2024-01-14	Kawrakow	2-bit quantizations (#4897)	commit \| commitdiff \| tree
2024-01-14	Kawrakow	Make Q3_K_S be the same as olf Q3_K_L for Mixtral-8x7B...	commit \| commitdiff \| tree
2024-01-13	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-01-13	Johannes Gäßler	ggml: cache sin/cos for RoPE (#4908)	commit \| commitdiff \| tree
2024-01-13	Georgi Gerganov	metal : remove old API (#4919)	commit \| commitdiff \| tree
2024-01-13	Georgi Gerganov	server : fix prompt caching with system prompt (#4914)	commit \| commitdiff \| tree
2024-01-13	Georgi Gerganov	llama : fix detokenization of non-special added-tokens...	commit \| commitdiff \| tree
2024-01-13	Georgi Gerganov	metal : disable log for loaded kernels (#4794)	commit \| commitdiff \| tree
2024-01-13	David Friehs	llama : minimize size used for state save/load (#4820)	commit \| commitdiff \| tree
2024-01-13	Someone	workflows: unbreak nix-build-aarch64, and split it...	commit \| commitdiff \| tree
2024-01-13	Yann Follet	main : add parameter --no-display-prompt (#4541)	commit \| commitdiff \| tree
2024-01-13	texmex76	gguf : fix potential infinite for-loop (#4600)	commit \| commitdiff \| tree
2024-01-13	Georgi Gerganov	metal : refactor kernel loading code (#4794)	commit \| commitdiff \| tree
2024-01-13	Johannes Gäßler	compare-llama-bench: tweak output format (#4910)	commit \| commitdiff \| tree
2024-01-13	Ziad Ben Hadj...	server : fix deadlock that occurs in multi-prompt scena...	commit \| commitdiff \| tree
2024-01-13	makomk	server : fix crash with multimodal models without BOS...	commit \| commitdiff \| tree
2024-01-13	Georgi Gerganov	convert : update phi-2 to latest HF repo (#4903)	commit \| commitdiff \| tree
2024-01-12	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-01-12	Georgi Gerganov	ggml : fix 32-bit ARM compat for IQ2_XS (whisper/1758)	commit \| commitdiff \| tree
2024-01-12	slaren	backend_sched : fix assignments	commit \| commitdiff \| tree
2024-01-12	Maximilian...	examples : add pydantic models to GBNF grammar generato...	commit \| commitdiff \| tree
2024-01-12	Johannes Gäßler	CUDA: faster q8_0 -> f16 dequantization (#4895)	commit \| commitdiff \| tree
2024-01-12	slaren	llama : ggml-backend integration (#4766)	commit \| commitdiff \| tree
2024-01-12	Georgi Gerganov	llama : remove redundant assert for StableLM (#4901)	commit \| commitdiff \| tree
2024-01-12	Daniel Bevenius	export-lora : use LLAMA_FILE_MAGIC_GGLA (#4894)	commit \| commitdiff \| tree
2024-01-12	Zay	llama.swiftui : update models layout (#4826)	commit \| commitdiff \| tree
2024-01-12	Georgi Gerganov	gitignore : imatrix	commit \| commitdiff \| tree
2024-01-12	Johannes Gäßler	CUDA: fix softmax compile for old CUDA versions (#4862)	commit \| commitdiff \| tree
2024-01-12	Georgi Gerganov	llama : fix typo "imp_embd" -> "inp_embd"	commit \| commitdiff \| tree
2024-01-12	howlger	common : streamline the formatting of help (#4890)	commit \| commitdiff \| tree
2024-01-12	Georgi Gerganov	py : fix lint (#4889)	commit \| commitdiff \| tree
2024-01-12	Georgi Gerganov	llama : fix llm_build_k_shift to use correct n_rot...	commit \| commitdiff \| tree
2024-01-12	Kawrakow	Importance Matrix calculation (#4861)	commit \| commitdiff \| tree
2024-01-11	Georgi Gerganov	server : fix infill when prompt is empty (#4833)	commit \| commitdiff \| tree
2024-01-11	Georgi Gerganov	main : better name for variable n_print (#4874)	commit \| commitdiff \| tree
2024-01-11	Georgi Gerganov	main : disable token count by default (#4874)	commit \| commitdiff \| tree
2024-01-11	Georgi Gerganov	swift : track ggml release branch (#4867)	commit \| commitdiff \| tree
2024-01-11	Kawrakow	llama : restore intended k-quants mixes for MoE models...	commit \| commitdiff \| tree
2024-01-11	Kawrakow	ggml : SOTA 2-bit quants (add IQ2_XS) (#4856)	commit \| commitdiff \| tree
2024-01-11	Georgi Gerganov	swift : pin ggml commit + remove ggml.h from spm-header...	commit \| commitdiff \| tree
2024-01-11	Laura	server : implement credentialed CORS (#4514)	commit \| commitdiff \| tree
2024-01-11	Michael Coppola	server : support for multiple api keys (#4864)	commit \| commitdiff \| tree
2024-01-11	Behnam M	server : add `LOG_INFO` when model is successfully...	commit \| commitdiff \| tree
2024-01-11	Someone	ci: nix-flake-update: new token with pr permissions...	commit \| commitdiff \| tree
2024-01-11	pudepiedj	main : print total token count and tokens consumed...	commit \| commitdiff \| tree
2024-01-11	Isaac McFadyen	server : fix typo in model name (#4876)	commit \| commitdiff \| tree
2024-01-11	Paul Tsochantaris	metal : put encoder debug group behind a define (#4873)	commit \| commitdiff \| tree
2024-01-11	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-01-11	Georgi Gerganov	metal : fix deprecation warning (ggml/690)	commit \| commitdiff \| tree
2024-01-11	Timothy Cronin	ggml : remove ggml_cpy_inplace and ggml_cont_inplace...	commit \| commitdiff \| tree
2024-01-11	Jack Mousseau	metal : wrap each operation in debug group (ggml/690)	commit \| commitdiff \| tree
2024-01-11	leejet	ggml : change GGML_MAX_NAME at compile time (ggml/682)	commit \| commitdiff \| tree
2024-01-11	Halalaluyafail3	Fix execlp call (ggml/689)	commit \| commitdiff \| tree
2024-01-11	Erik Scholz	fix : cuda order of synchronization when setting a...	commit \| commitdiff \| tree
2024-01-11	Behnam M	server : update readme to document the new `/health...	commit \| commitdiff \| tree
2024-01-11	Georgi Gerganov	server : fix build + rename enums (#4870)	commit \| commitdiff \| tree
2024-01-10	Behnam M	server : add a `/health` endpoint (#4860)	commit \| commitdiff \| tree
2024-01-10	Brian	llama : add additional suffixes for model params (...	commit \| commitdiff \| tree
2024-01-10	Austin	llama : recognize 1B phi models (#4847)	commit \| commitdiff \| tree
2024-01-10	John	clip : support more quantization types (#4846)	commit \| commitdiff \| tree
2024-01-10	Johannes Gäßler	Python script to compare commits with llama-bench ...	commit \| commitdiff \| tree
2024-01-09	Austin	convert.py : fix vanilla LLaMA model conversion (#4818)	commit \| commitdiff \| tree
2024-01-09	Justine Tunney	llava-cli : don't crash if --image flag is invalid...	commit \| commitdiff \| tree
2024-01-09	Georgi Gerganov	metal : improve dequantize precision to match CPU ...	commit \| commitdiff \| tree
2024-01-09	Georgi Gerganov	scripts : improve get-pg.sh (#4838)	commit \| commitdiff \| tree
2024-01-09	iohub	readme : add 3rd party collama reference to UI list...	commit \| commitdiff \| tree
2024-01-09	Georgi Gerganov	scripts : script to get Paul Graham essays in txt forma...	commit \| commitdiff \| tree
2024-01-09	Behnam M	server : update readme about token probs (#4777)	commit \| commitdiff \| tree
2024-01-09	Zsapi	server : add api-key flag to documentation (#4832)	commit \| commitdiff \| tree
2024-01-09	Georgi Gerganov	ggml : fix vld1q_s8_x4 32-bit compat (#4828)	commit \| commitdiff \| tree
2024-01-09	Johannes Gäßler	CUDA: faster softmax via shared memory + fp16 math...	commit \| commitdiff \| tree
2024-01-08	howlger	common : fix the short form of `--grp-attn-w`, not...	commit \| commitdiff \| tree
2024-01-08	Georgi Gerganov	readme : add link to SOTA models	commit \| commitdiff \| tree
2024-01-08	Kawrakow	SOTA 2-bit quants (#4773)	commit \| commitdiff \| tree
2024-01-08	Georgi Gerganov	swift : exclude ggml-metal.metal from the package ...	commit \| commitdiff \| tree
2024-01-08	Georgi Gerganov	llama.swiftui : update readme	commit \| commitdiff \| tree
2024-01-08	Georgi Gerganov	main : add self-extend support (#4815)	commit \| commitdiff \| tree
2024-01-08	Georgi Gerganov	examples : add passkey test (#3856)	commit \| commitdiff \| tree
2024-01-07	Lars Grammel	readme : add lgrammel/modelfusion JS/TS client for...	commit \| commitdiff \| tree
2024-01-07	slaren	llama-bench : add no-kv-offload parameter (#4812)	commit \| commitdiff \| tree
2024-01-07	Johannes Gäßler	CUDA: fixed redundant value dequantization (#4809)	commit \| commitdiff \| tree
2024-01-07	Georgi Gerganov	llama : remove unused vars (#4796)	commit \| commitdiff \| tree
2024-01-07	Georgi Gerganov	llama : remove redundant GQA check (#4796)	commit \| commitdiff \| tree
2024-01-07	Alex Azarov	llama.swiftui : use llama.cpp as SPM package (#4804)	commit \| commitdiff \| tree
2024-01-07	Georgi Gerganov	llama : print tensor meta for debugging	commit \| commitdiff \| tree
2024-01-07	Alex Azarov	llama.swiftui : add visionOS target (#4805)	commit \| commitdiff \| tree
2024-01-07	Konstantin...	ggml : use __builtin_amdgcn_sudot4 in __dp4a for gfx11...	commit \| commitdiff \| tree
2024-01-07	Georgi Gerganov	server : fix n_predict check (#4798)	commit \| commitdiff \| tree
2024-01-06	Daniel Illescas...	llama.swiftui : use correct pointer for llama_token_eos...	commit \| commitdiff \| tree
2024-01-06	Georgi Gerganov	examples : improve base-translate.sh script (#4783)	commit \| commitdiff \| tree
2024-01-05	a-n-n-a-l-e-e	cmake : check for openblas64 (#4134)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom