git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2024-01-28	Johannes Gäßler	Apply min_p to unsorted tokens (#5115)	commit \| commitdiff \| tree
2024-01-28	Johannes Gäßler	Tests for min_p, sampling queue (#5147)	commit \| commitdiff \| tree
2024-01-28	Marcus Dunn	readme : add link to rust bindings (#5148)	commit \| commitdiff \| tree
2024-01-28	sharpHL	llama : add support for Orion-14B (#5118)	commit \| commitdiff \| tree
2024-01-28	Kyle Mistele	docker : add server-first container images (#5157)	commit \| commitdiff \| tree
2024-01-27	John	llava : support for Yi-VL and fix for mobileVLM (#5093)	commit \| commitdiff \| tree
2024-01-27	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-01-27	Judd	ggml : check ggml_add src1 type (ggml/708)	commit \| commitdiff \| tree
2024-01-27	Michael Klimenko	Remove unused data and add fixes (#5154)	commit \| commitdiff \| tree
2024-01-27	Maximilian...	server : add self-extend support (#5104)	commit \| commitdiff \| tree
2024-01-26	0cc4m	Add OpenCL add kernel (#5151)	commit \| commitdiff \| tree
2024-01-26	Jared Van Bortel	cmake : pass CPU architecture flags to nvcc (#5146)	commit \| commitdiff \| tree
2024-01-26	slaren	cuda : fix tensor size calculation for non-split buffer...	commit \| commitdiff \| tree
2024-01-26	slaren	ggml-alloc : add 10% margin to the buffer sizes (#5149)	commit \| commitdiff \| tree
2024-01-26	snadampal	ggml : update softmax n_task calculation (#5126)	commit \| commitdiff \| tree
2024-01-26	Georgi Gerganov	scripts : move run-with-preset.py from root to scripts...	commit \| commitdiff \| tree
2024-01-26	Georgi Gerganov	tests : gitignore test-c.o	commit \| commitdiff \| tree
2024-01-26	Xuan Son Nguyen	server : refactored the task processing logic (#5065)	commit \| commitdiff \| tree
2024-01-26	crasm	ci : add model tests + script wrapper (#4586)	commit \| commitdiff \| tree
2024-01-26	Paul Tsochantaris	metal : remove unused `n_buffers` and `buffers` (#5129)	commit \| commitdiff \| tree
2024-01-26	Riceball LEE	gguf : fix "general.alignment" type in gguf_reader...	commit \| commitdiff \| tree
2024-01-26	Georgi Gerganov	readme : update hot topics	commit \| commitdiff \| tree
2024-01-26	Kawrakow	Another bucket sort (#5109)	commit \| commitdiff \| tree
2024-01-25	XiaotaoChen	readme : add MobileVLM 1.7B/3B to the supported models...	commit \| commitdiff \| tree
2024-01-25	l3utterfly	llama : dynamic temperature sampling (#4972)	commit \| commitdiff \| tree
2024-01-25	Jared Van Bortel	examples : make pydantic scripts pass mypy and support...	commit \| commitdiff \| tree
2024-01-25	Valentin Konovalov	android : use release cmake build type by default ...	commit \| commitdiff \| tree
2024-01-25	Kawrakow	Fix Q3_K_XS for MoE models (#5113)	commit \| commitdiff \| tree
2024-01-25	Georgi Gerganov	metal : show compile log messages	commit \| commitdiff \| tree
2024-01-24	Engininja2	cuda : fix 2-bit quants on amd hip (#5105)	commit \| commitdiff \| tree
2024-01-24	Michael Hueschen	nix-shell: use addToSearchPath	commit \| commitdiff \| tree
2024-01-24	Michael Hueschen	nix: add cc to devShell LD_LIBRARY_PATH	commit \| commitdiff \| tree
2024-01-24	slaren	llama : pre-allocate input tensors in a separate buffer...	commit \| commitdiff \| tree
2024-01-23	Georgi Gerganov	metal : disable support for MUL_MAT F32 x F16	commit \| commitdiff \| tree
2024-01-23	Kawrakow	Additional KL-divergence statistics (#5081)	commit \| commitdiff \| tree
2024-01-23	Johannes Gäßler	CUDA: more info when no device code (#5088)	commit \| commitdiff \| tree
2024-01-23	Georgi Gerganov	minor : clean-up some warnings and style (#5094)	commit \| commitdiff \| tree
2024-01-23	Xuan Son Nguyen	devops : add intel oneapi dockerfile (#5068)	commit \| commitdiff \| tree
2024-01-23	Michael Coppola	llama.vim : added api key support (#5090)	commit \| commitdiff \| tree
2024-01-22	slaren	llama : fix not enough space in buffer with Qwen (...	commit \| commitdiff \| tree
2024-01-22	Kawrakow	KL-divergence (#5076)	commit \| commitdiff \| tree
2024-01-22	Reinforce-II	ggml : parallelize FP32 conversion when using BLAS...	commit \| commitdiff \| tree
2024-01-22	XiaotaoChen	llava : MobileVLM support (#4954)	commit \| commitdiff \| tree
2024-01-22	Someone Serge	flake.nix: add a comment about flakes vs nix	commit \| commitdiff \| tree
2024-01-22	Someone Serge	nix: add a comment on the many nixpkgs-with-cuda instances	commit \| commitdiff \| tree
2024-01-22	Someone Serge	nix: add a comment about makeScope	commit \| commitdiff \| tree
2024-01-22	Someone Serge	nix: refactor the cleanSource rules	commit \| commitdiff \| tree
2024-01-22	Someone Serge	workflows: nix-ci: drop the redundant "paths" filter	commit \| commitdiff \| tree
2024-01-22	Someone Serge	workflows: nix-build-aarch64: rate limit	commit \| commitdiff \| tree
2024-01-22	Someone Serge	workflows: nix-ci: rebuild on flake.lock updates	commit \| commitdiff \| tree
2024-01-22	Kawrakow	imatrix : keep intermediate imatrix results (#5077)	commit \| commitdiff \| tree
2024-01-22	compilade	llama : support StableLM 2 1.6B (#5052)	commit \| commitdiff \| tree
2024-01-22	Daniel Bevenius	finetune : print sample-start/include-sample-start...	commit \| commitdiff \| tree
2024-01-22	Kawrakow	llama : add Q3_K_XS (#5060)	commit \| commitdiff \| tree
2024-01-22	bobqianic	ci : fix Windows CI by updating Intel SDE version ...	commit \| commitdiff \| tree
2024-01-22	Shijie	llama : add more qwen2 models (#5071)	commit \| commitdiff \| tree
2024-01-21	iSma	Revert LLAMA_NATIVE to OFF in flake.nix (#5066)	commit \| commitdiff \| tree
2024-01-21	kuronekosaiko	add safetensors support to convert-lora-to-ggml.py...	commit \| commitdiff \| tree
2024-01-21	bobqianic	add `#include <string>` to unicode.h (#5051)	commit \| commitdiff \| tree
2024-01-21	Kawrakow	Add ability to evauate multiple choice tasks (#5047)	commit \| commitdiff \| tree
2024-01-21	Kawrakow	Slightly faster imatrix (#5050)	commit \| commitdiff \| tree
2024-01-21	Georgi Gerganov	flake.lock: Update (#5054)	commit \| commitdiff \| tree
2024-01-20	Jared Van Bortel	convert : partially revert PR #4818 (#5041)	commit \| commitdiff \| tree
2024-01-20	Jared Van Bortel	perplexity : fix MSVC build after #5020 (#5043)	commit \| commitdiff \| tree
2024-01-20	slaren	llama : run all KQV ops on the CPU with no KV offload...	commit \| commitdiff \| tree
2024-01-20	Herman Semenov	cmake : add support for ccache (#5002)	commit \| commitdiff \| tree
2024-01-20	adel boussaken	Add a dart/flutter binding to README.md (#4882)	commit \| commitdiff \| tree
2024-01-20	Kylin	cuda : fix compile error in jetson platform (#4975)	commit \| commitdiff \| tree
2024-01-19	Uzo Nweke	finetune : fix ggml_allocr lifetimes (tmp workaround...	commit \| commitdiff \| tree
2024-01-19	Georgi Gerganov	imatrix : add README.md	commit \| commitdiff \| tree
2024-01-19	Shijie	llama : support upcoming Qwen2 (#5037)	commit \| commitdiff \| tree
2024-01-19	Georgi Gerganov	py : fix flake8 lint	commit \| commitdiff \| tree
2024-01-19	Kawrakow	winogrande: evaluate log-probs in parallel (#5036)	commit \| commitdiff \| tree
2024-01-19	chiranko	llama : add CodeShell support (#5016)	commit \| commitdiff \| tree
2024-01-19	Kawrakow	perplexity: avoid unnecessary alloocations and logit...	commit \| commitdiff \| tree
2024-01-19	Georgi Gerganov	perplexity : faster Winogrande via batching (#5024)	commit \| commitdiff \| tree
2024-01-18	John	llama : fix falcon arch for tied output embeddings...	commit \| commitdiff \| tree
2024-01-18	Georgi Gerganov	cmake : add ggml public headers (#5011)	commit \| commitdiff \| tree
2024-01-18	Xuan Son Nguyen	server : defer tasks when "slot unavailable" (#5018)	commit \| commitdiff \| tree
2024-01-18	slaren	llama : fix mlock with no-mmap with Metal (#5025)	commit \| commitdiff \| tree
2024-01-18	Georgi Gerganov	imatrix : fix assert for src0 non-cont check	commit \| commitdiff \| tree
2024-01-18	Georgi Gerganov	perplexity : fix winogrande N tasks option	commit \| commitdiff \| tree
2024-01-18	Georgi Gerganov	scripts : add get-winogrande.sh	commit \| commitdiff \| tree
2024-01-18	David Sommers	convert.py : fix llama/llama2 conversion due to vocab_s...	commit \| commitdiff \| tree
2024-01-18	Kawrakow	HellaSwag: speed up by parallelizing log-prob evaluatio...	commit \| commitdiff \| tree
2024-01-18	Georgi Gerganov	perplexity : faster HellaSwag via batching (#5017)	commit \| commitdiff \| tree
2024-01-18	Kawrakow	Add Winogrande evaluation (#5015)	commit \| commitdiff \| tree
2024-01-18	Georgi Gerganov	scritps : add helper script to get hellaswag data in...	commit \| commitdiff \| tree
2024-01-18	Paul Tsochantaris	metal : fix memory leak, dangling pointer and unused...	commit \| commitdiff \| tree
2024-01-17	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-01-17	Georgi Gerganov	ggml : add IQ2 to test-backend-ops + refactoring (...	commit \| commitdiff \| tree
2024-01-17	Georgi Gerganov	imatrix : offload to GPU support (#4957)	commit \| commitdiff \| tree
2024-01-17	Georgi Gerganov	backend : add eval callback (#4935)	commit \| commitdiff \| tree
2024-01-17	Georgi Gerganov	metal : create autorelease pool during library build...	commit \| commitdiff \| tree
2024-01-17	Georgi Gerganov	py : fix whitespace	commit \| commitdiff \| tree
2024-01-17	Georgi Gerganov	py : fix missing added_tokens_dict for SPM and BPE...	commit \| commitdiff \| tree
2024-01-17	Kawrakow	llama : use Q4_K for attn_v for Q2_K_S when n_gqa ...	commit \| commitdiff \| tree
2024-01-17	Paul Tsochantaris	metal : remove unnecessary nil check (#4986)	commit \| commitdiff \| tree
2024-01-17	David Renshaw	llama : fix copy/paste error in llama_sampling_params...	commit \| commitdiff \| tree
2024-01-16	Georgi Gerganov	py : remove unnecessary hasattr (#4903)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom