git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2023-11-13	Georgi Gerganov	ggml : sync (im2col, GPU conv, 32-bit arm compat) ...	commit \| commitdiff \| tree
2023-11-13	Georgi Gerganov	readme : update hot topics	commit \| commitdiff \| tree
2023-11-13	Georgi Gerganov	sync : ggml (backend v2) (#3912)	commit \| commitdiff \| tree
2023-11-13	Kerfuffle	Add ReLU and SQR CUDA ops to (partially) fix Persimmon...	commit \| commitdiff \| tree
2023-11-12	Kerfuffle	gguf-py: gguf_writer: Use bytearray to build metadata...	commit \| commitdiff \| tree
2023-11-12	Richard Kiss	Fix some documentation typos/grammar mistakes (#4032)	commit \| commitdiff \| tree
2023-11-11	M. Yusuf Sarıgöz	Fix gguf-convert-endian script (#4037)	commit \| commitdiff \| tree
2023-11-11	Alexey Parfenov	server : fix crash when prompt exceeds context size...	commit \| commitdiff \| tree
2023-11-11	Kerfuffle	gguf-py: Refactor and allow reading/modifying existing...	commit \| commitdiff \| tree
2023-11-10	Jhen-Jie Hong	server : allow continue edit on completion mode (#3950)	commit \| commitdiff \| tree
2023-11-10	Galunid	Unbreak persimmon after #3837 (#4010)	commit \| commitdiff \| tree
2023-11-09	Galunid	scripts: Generalize convert scripts (#3838)	commit \| commitdiff \| tree
2023-11-09	Mihai	server : add min_p param (#3877)	commit \| commitdiff \| tree
2023-11-08	slaren	ggml-alloc : fix backend assignments of views (#3982)	commit \| commitdiff \| tree
2023-11-07	Jared Van Bortel	gguf : track writer state, free unneeded tensors, clean...	commit \| commitdiff \| tree
2023-11-07	Georgi Gerganov	make : do not add linker flags when compiling static...	commit \| commitdiff \| tree
2023-11-07	xaedes	ggml : fix backward rope after YaRN (#3974)	commit \| commitdiff \| tree
2023-11-07	Matthew Tejo	Use params when loading models in llava-cli (#3976)	commit \| commitdiff \| tree
2023-11-07	Meng Zhang	cuda : supports running on CPU for GGML_USE_CUBLAS...	commit \| commitdiff \| tree
2023-11-06	Damian Stewart	llava : expose as a shared library for downstream proje...	commit \| commitdiff \| tree
2023-11-05	slaren	ggml-cuda : fix f16 mul mat (#3961)	commit \| commitdiff \| tree
2023-11-05	Kerfuffle	Allow common process_escapes to handle \x sequences...	commit \| commitdiff \| tree
2023-11-05	Thái Hoàng Tâm	server : fix typo for --alias shortcut from -m to ...	commit \| commitdiff \| tree
2023-11-05	Jared Van Bortel	cuda : fix disabling device with --tensor-split 1,0...	commit \| commitdiff \| tree
2023-11-05	Meng Zhang	llama : mark LLM_ARCH_STARCODER as full offload support...	commit \| commitdiff \| tree
2023-11-05	Eve	cmake : MSVC instruction detection (fixed up #809)...	commit \| commitdiff \| tree
2023-11-05	Eve	ci : use intel sde when ci cpu doesn't support avx512...	commit \| commitdiff \| tree
2023-11-05	slaren	cuda : revert CUDA pool stuff (#3944)	commit \| commitdiff \| tree
2023-11-04	Kerfuffle	gguf-py: Support 01.AI Yi models (#3943)	commit \| commitdiff \| tree
2023-11-03	Peter Sugihara	metal : round up to 16 to fix MTLDebugComputeCommandEnc...	commit \| commitdiff \| tree
2023-11-03	Xiao-Yong Jin	ggml-metal: fix yarn rope (#3937)	commit \| commitdiff \| tree
2023-11-03	slaren	ggml-cuda : move row numbers to x grid dim in mmv kerne...	commit \| commitdiff \| tree
2023-11-03	Georgi Gerganov	speculative : change default p_accept to 0.5 + CLI...	commit \| commitdiff \| tree
2023-11-03	Georgi Gerganov	common : YAYF (yet another YARN fix) (#3925)	commit \| commitdiff \| tree
2023-11-03	cebtenzzre	llama : change yarn_ext_factor placeholder to -1 (...	commit \| commitdiff \| tree
2023-11-02	Kerfuffle	cuda : add ROCM aliases for CUDA pool stuff (#3918)	commit \| commitdiff \| tree
2023-11-02	Andrei	cmake : fix relative path to git submodule index (...	commit \| commitdiff \| tree
2023-11-02	Georgi Gerganov	readme : add notice about #3912	commit \| commitdiff \| tree
2023-11-02	Georgi Gerganov	cuda : fix const ptrs warning causing ROCm build issues...	commit \| commitdiff \| tree
2023-11-02	Oleksii Maryshchenko	cuda : use CUDA memory pool with async memory allocatio...	commit \| commitdiff \| tree
2023-11-02	Georgi Gerganov	gguf : print error for GGUFv1 files (#3908)	commit \| commitdiff \| tree
2023-11-02	slaren	cmake : disable LLAMA_NATIVE by default (#3906)	commit \| commitdiff \| tree
2023-11-02	Georgi Gerganov	gguf : remove special-case code for GGUFv1 (#3901)	commit \| commitdiff \| tree
2023-11-02	Georgi Gerganov	llm : prevent from 1-D tensors being GPU split (#3697)	commit \| commitdiff \| tree
2023-11-02	cebtenzzre	build : link against build info instead of compiling...	commit \| commitdiff \| tree
2023-11-02	Georgi Gerganov	cuda : check if this fixes Pascal card regression ...	commit \| commitdiff \| tree
2023-11-02	Georgi Gerganov	metal : fix build errors and kernel sig after #2268...	commit \| commitdiff \| tree
2023-11-02	cebtenzzre	cuda : fix RoPE after #2268 (#3897)	commit \| commitdiff \| tree
2023-11-01	cebtenzzre	llama : fix llama_context_default_params after #2268...	commit \| commitdiff \| tree
2023-11-01	slaren	ggml-cuda : compute ptrs for cublasGemmBatchedEx in...	commit \| commitdiff \| tree
2023-11-01	cebtenzzre	llama : implement YaRN RoPE scaling (#2268)	commit \| commitdiff \| tree
2023-11-01	Georgi Gerganov	llm : fix llm_build_kqv taking unused tensor (benign...	commit \| commitdiff \| tree
2023-11-01	Georgi Gerganov	llm : fix falcon norm after refactoring (#3837)	commit \| commitdiff \| tree
2023-11-01	Georgi Gerganov	metal : multi-simd softmax (#3710)	commit \| commitdiff \| tree
2023-11-01	Georgi Gerganov	common : minor (#3715)	commit \| commitdiff \| tree
2023-11-01	Georgi Gerganov	llm : add llm_build_context (#3881)	commit \| commitdiff \| tree
2023-11-01	bandoti	common : allow caller to handle help/argument exception...	commit \| commitdiff \| tree
2023-11-01	staviq	log : make generating separate log files optional ...	commit \| commitdiff \| tree
2023-11-01	l3utterfly	sampling : null grammar field after reset (#3885)	commit \| commitdiff \| tree
2023-11-01	Georgi Gerganov	ggml : fix UNUSED macro (#3762)	commit \| commitdiff \| tree
2023-11-01	Andrew Godfrey	finetune : add -ngl parameter (#3762)	commit \| commitdiff \| tree
2023-11-01	Georgi Gerganov	scripts : add server-llm.sh (#3868)	commit \| commitdiff \| tree
2023-11-01	Adrian Hesketh	server : re-enable completion and embedded at the same...	commit \| commitdiff \| tree
2023-11-01	Georgi Gerganov	llama : refactor graph build code (#3837)	commit \| commitdiff \| tree
2023-10-31	kalomaze	samplers : Min-P sampler implementation [alternative...	commit \| commitdiff \| tree
2023-10-31	Tungsten842	flake.nix: fix for rocm 5.7 (#3853)	commit \| commitdiff \| tree
2023-10-30	Georgi Gerganov	ggml : move FP16 <-> FP32 code to ggml-impl.h (#3861)	commit \| commitdiff \| tree
2023-10-29	Kerfuffle	Extend llama_kv_cache_seq_rm to allow matching any...	commit \| commitdiff \| tree
2023-10-29	cebtenzzre	make : remove unnecessary dependency on build-info...	commit \| commitdiff \| tree
2023-10-29	Georgi Gerganov	llama : fix kv shift bug (#3835)	commit \| commitdiff \| tree
2023-10-29	Georgi Gerganov	ggml : quantization refactoring (#3833)	commit \| commitdiff \| tree
2023-10-28	Erik Scholz	flake : update flake.lock for newer transformers versio...	commit \| commitdiff \| tree
2023-10-28	Aarni Koskela	metal : try cwd for ggml-metal.metal if bundle lookup...	commit \| commitdiff \| tree
2023-10-28	Georgi Gerganov	issues : change label from bug to bug-unconfirmed ...	commit \| commitdiff \| tree
2023-10-28	Georgi Gerganov	convert : ignore tokens if their IDs are within [0...	commit \| commitdiff \| tree
2023-10-28	Kerfuffle	llama : allow quantizing k-quants to fall back when...	commit \| commitdiff \| tree
2023-10-28	Georgi Gerganov	llama : add option for greedy sampling with probs ...	commit \| commitdiff \| tree
2023-10-28	Henk Poley	common : print that one line of the syntax help *also...	commit \| commitdiff \| tree
2023-10-28	Georgi Gerganov	starcoder : add GPU offloading (#3827)	commit \| commitdiff \| tree
2023-10-27	Kerfuffle	speculative : ensure draft and target model vocab match...	commit \| commitdiff \| tree
2023-10-27	cebtenzzre	llama : correctly report GGUFv3 format (#3818)	commit \| commitdiff \| tree
2023-10-27	Thibault Terrasson	simple : fix batch handling (#3803)	commit \| commitdiff \| tree
2023-10-27	Georgi Gerganov	cuda : improve text-generation and batched decoding...	commit \| commitdiff \| tree
2023-10-26	Georgi Gerganov	server : do not release slot on image input (#3798)	commit \| commitdiff \| tree
2023-10-25	Georgi Gerganov	batched-bench : print params at start	commit \| commitdiff \| tree
2023-10-25	Georgi Gerganov	log : disable pid in log filenames	commit \| commitdiff \| tree
2023-10-24	cebtenzzre	server : add parameter -tb N, --threads-batch N (#3584...	commit \| commitdiff \| tree
2023-10-24	Georgi Gerganov	server : do not block system prompt update (#3767)	commit \| commitdiff \| tree
2023-10-24	Georgi Gerganov	sync : ggml (conv ops + cuda MSVC fixes) (#3765)	commit \| commitdiff \| tree
2023-10-24	John Smith	cmake : add missed dependencies (#3763)	commit \| commitdiff \| tree
2023-10-24	Georgi Gerganov	cuda : add batched cuBLAS GEMM for faster attention...	commit \| commitdiff \| tree
2023-10-24	Galunid	Add more tokenizer tests (#3742)	commit \| commitdiff \| tree
2023-10-24	Georgi Gerganov	metal : handle ggml_scale for n%4 != 0 (close #3754)	commit \| commitdiff \| tree
2023-10-23	Georgi Gerganov	Revert "make : add optional CUDA_NATIVE_ARCH (#2482)"	commit \| commitdiff \| tree
2023-10-23	M. Yusuf Sarıgöz	issues : separate bug and enhancement template + no...	commit \| commitdiff \| tree
2023-10-23	Galunid	Update special token handling in conversion scripts...	commit \| commitdiff \| tree
2023-10-23	Marcus Dunn	llama : remove token functions with `context` args...	commit \| commitdiff \| tree
2023-10-23	Galunid	Fix baichuan convert script not detecing model (#3739)	commit \| commitdiff \| tree
2023-10-22	Alex	make : add optional CUDA_NATIVE_ARCH (#2482)	commit \| commitdiff \| tree
2023-10-22	Georgi Gerganov	server : parallel decoding and multimodal (#3677)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom