git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2024-06-18	jojorne	Fix no gcc pragma on Windows (#7751)	commit \| commitdiff \| tree
2024-06-18	Ulrich Drepper	Allow compiling with CUDA without CUDA runtime installe...	commit \| commitdiff \| tree
2024-06-18	Frank Mai	chore: clean useless beam search param (#7985)	commit \| commitdiff \| tree
2024-06-18	Abheek Gulati	readme : update UI list (#7943)	commit \| commitdiff \| tree
2024-06-18	Georgi Gerganov	ggml : sync	commit \| commitdiff \| tree
2024-06-18	Georgi Gerganov	whisper : use ggml_backend_sched (whisper/2239)	commit \| commitdiff \| tree
2024-06-17	Ștefan-Gabriel...	update: support Qwen2-57B-A14B (#7835)	commit \| commitdiff \| tree
2024-06-17	Srihari-mcw	Make updates to type cast based on compiler instead...	commit \| commitdiff \| tree
2024-06-17	Georgi Gerganov	llama : disable FA if KV head size do not match (#7982)	commit \| commitdiff \| tree
2024-06-17	Bryan Honof	Add Nix and Flox install instructions (#7899)	commit \| commitdiff \| tree
2024-06-17	slaren	sched : offload_op also requires supports_op (#7977)	commit \| commitdiff \| tree
2024-06-17	Frank Mai	fix: divide 0 exception in mamba (#7932)	commit \| commitdiff \| tree
2024-06-17	Markus Tavenrath	Implement non-mapped async IO for CUDA on Windows....	commit \| commitdiff \| tree
2024-06-17	Georgi Gerganov	rpc : fix load/store misaligned addresses (#7948)	commit \| commitdiff \| tree
2024-06-17	Brian	gguf-dump.py: add --markdown dump output (#7853)	commit \| commitdiff \| tree
2024-06-17	Neo Zhang	[SYCL] Update README-sycl.md for Chapter "Recommended...	commit \| commitdiff \| tree
2024-06-16	Calvin Laurenson	Add support for sqrt on CUDA (#7953)	commit \| commitdiff \| tree
2024-06-16	Georgi Gerganov	cuda : fix bounds check for src0 rows in MMVQ kernel...	commit \| commitdiff \| tree
2024-06-16	Hong Bo PENG	ggml : fix and optimize ppc64le (ggml/849)	commit \| commitdiff \| tree
2024-06-16	Daniel Bevenius	ggml : remove duplicate include of ggml-common.h (ggml...	commit \| commitdiff \| tree
2024-06-16	Georgi Gerganov	flake.lock: Update (#7951)	commit \| commitdiff \| tree
2024-06-16	Georgi Gerganov	unicode : avoid char32_t (#7957)	commit \| commitdiff \| tree
2024-06-16	hopkins385	readme : update UI list [no ci] (#7958)	commit \| commitdiff \| tree
2024-06-16	Georgi Gerganov	ggml : fix handling of zero blocks in IQ quants (#7955)	commit \| commitdiff \| tree
2024-06-16	Georgi Gerganov	github : update pr template	commit \| commitdiff \| tree
2024-06-16	0cc4m	Vulkan Shader Refactor, Memory Debugging Option (#7947)	commit \| commitdiff \| tree
2024-06-15	Xuan Son Nguyen	Add `cvector-generator` example (#7514)	commit \| commitdiff \| tree
2024-06-15	Meng, Hengyu	[SYCL] remove global variables (#7710)	commit \| commitdiff \| tree
2024-06-14	olexiyb	ci : fix macos x86 build (#7940)	commit \| commitdiff \| tree
2024-06-14	Johannes Gäßler	CUDA: faster q2_K, q3_K MMQ + int8 tensor cores (#7921)	commit \| commitdiff \| tree
2024-06-14	Georgi Gerganov	metal : utilize max shared memory for mul_mat_id (...	commit \| commitdiff \| tree
2024-06-14	Radoslav Gerganov	llama-bench : fix RPC indication (#7936)	commit \| commitdiff \| tree
2024-06-14	Sigbjørn Skjæret	llama : more checks before assuming FIM tokens (#7644)	commit \| commitdiff \| tree
2024-06-14	Elaine	convert : add Poro-34B-chat tokenizer support (#7713)	commit \| commitdiff \| tree
2024-06-13	Radoslav Gerganov	rpc : fix ggml_backend_rpc_supports_buft() (#7918)	commit \| commitdiff \| tree
2024-06-13	Galunid	readme : Remove outdated instructions from README.md...	commit \| commitdiff \| tree
2024-06-13	slaren	move BLAS to a separate backend (#6210)	commit \| commitdiff \| tree
2024-06-12	Olivier Chafik	`build`: rename main → llama-cli, server → llama-server...	commit \| commitdiff \| tree
2024-06-12	Johannes Gäßler	CUDA: fix broken oob check for FA vec f32 kernel (...	commit \| commitdiff \| tree
2024-06-12	Georgi Gerganov	tests : add non-cont unary tests (#7857)	commit \| commitdiff \| tree
2024-06-12	Georgi Gerganov	ggml : improve ggml_is_contiguous logic (#7856)	commit \| commitdiff \| tree
2024-06-12	Georgi Gerganov	server : restore numeric prompts (#7883)	commit \| commitdiff \| tree
2024-06-12	Meng, Hengyu	update intel docker oneapi-basekit to 2024.1.1-devel...	commit \| commitdiff \| tree
2024-06-12	Patrice Ferlet	Fix a typo and add Fedora 40 pacakge to install for...	commit \| commitdiff \| tree
2024-06-11	k.h.lai	vulkan: select only one device for single gpu with...	commit \| commitdiff \| tree
2024-06-11	0cc4m	Update Vulkan RoPE implementation (#7818)	commit \| commitdiff \| tree
2024-06-11	Deven Mistry	fix broken link in pr template (#7880) [no ci]	commit \| commitdiff \| tree
2024-06-11	Brian	github: move PR template to .github/ root (#7868)	commit \| commitdiff \| tree
2024-06-11	Johannes Gäßler	llama-bench: more compact markdown tables (#7879)	commit \| commitdiff \| tree
2024-06-11	Georgi Gerganov	tests : check the Python version (#7872)	commit \| commitdiff \| tree
2024-06-11	Johannes Gäßler	CUDA: int8 tensor cores for MMQ (q4_K, q5_K, q6_K)...	commit \| commitdiff \| tree
2024-06-11	slaren	fix CUDA CI by using a windows-2019 image (#7861)	commit \| commitdiff \| tree
2024-06-11	Olivier Chafik	json: refine constraint for whitespace to avoid runaway...	commit \| commitdiff \| tree
2024-06-11	Olivier Chafik	`json`: document schema conversion in GBNF readme,...	commit \| commitdiff \| tree
2024-06-10	Jared Van Bortel	cmake : fix CMake requirement for CUDA (#7821)	commit \| commitdiff \| tree
2024-06-10	slaren	ci : try win-2019 on server windows test (#7854)	commit \| commitdiff \| tree
2024-06-10	Georgi Gerganov	examples : remove --instruct remnants (#7846)	commit \| commitdiff \| tree
2024-06-10	Georgi Gerganov	server : improve "prompt" handling (#7847)	commit \| commitdiff \| tree
2024-06-10	Johannes Gäßler	CUDA: use tensor cores for MMQ (#7676)	commit \| commitdiff \| tree
2024-06-10	Ben Ashbaugh	use the correct SYCL context for host USM allocations...	commit \| commitdiff \| tree
2024-06-09	Georgi Gerganov	flake.lock: Update (#7838)	commit \| commitdiff \| tree
2024-06-09	Georgi Gerganov	imatrix : handle partial entries (#7833)	commit \| commitdiff \| tree
2024-06-09	Nicolás Pérez	docs: Added initial PR template with directions for...	commit \| commitdiff \| tree
2024-06-09	mgroeber9110	server: do not remove whitespace at the start of a...	commit \| commitdiff \| tree
2024-06-09	Johannes Gäßler	CUDA: revise q8_1 data layout for mul_mat_q (#7824)	commit \| commitdiff \| tree
2024-06-09	sasha0552	convert-hf : set the model name based on cli arg, if...	commit \| commitdiff \| tree
2024-06-09	compilade	convert-hf : match model part name prefix and suffix...	commit \| commitdiff \| tree
2024-06-09	compilade	gguf-py : decouple adding metadata from writing in...	commit \| commitdiff \| tree
2024-06-08	slaren	Revert "[SYCL] Update rpc-server.cpp to include SYCL...	commit \| commitdiff \| tree
2024-06-08	Olivier Chafik	url: save -mu downloads to new cache location (#7826)	commit \| commitdiff \| tree
2024-06-08	sasha0552	server : smart slot selection using Longest Common...	commit \| commitdiff \| tree
2024-06-07	slaren	vulkan : reuse parent extra for views (#7806)	commit \| commitdiff \| tree
2024-06-07	Christian Zhou...	gguf-split : change binary multi-byte units to decimal...	commit \| commitdiff \| tree
2024-06-07	intelmatt	cmake : fix BUILD_SHARED_LIBS=ON build (#7784)	commit \| commitdiff \| tree
2024-06-07	Johannes Gäßler	server: update cache_prompt documentation [no ci] ...	commit \| commitdiff \| tree
2024-06-07	woodx	server : do not get prompt in infill mode (#7286)	commit \| commitdiff \| tree
2024-06-07	pengxin99	[SYCL] fix softmax r2r result wrong issue (#7811)	commit \| commitdiff \| tree
2024-06-07	slaren	check for nans in imatrix and quantize (#7807)	commit \| commitdiff \| tree
2024-06-06	Georgi Gerganov	server : fix --threads-http arg (#7801)	commit \| commitdiff \| tree
2024-06-06	Georgi Gerganov	imatrix : migrate to gpt_params (#7771)	commit \| commitdiff \| tree
2024-06-06	Clint Herron	Added support for . (any character) token in grammar...	commit \| commitdiff \| tree
2024-06-06	Mattheus Chediak	README minor fixes (#7798) [no ci]	commit \| commitdiff \| tree
2024-06-06	Olivier Chafik	grammars: x{min,max} repetition operator (#6640)	commit \| commitdiff \| tree
2024-06-06	Joan Fontanals	llama : add jina v2 base code (#7596)	commit \| commitdiff \| tree
2024-06-06	slaren	docker : build only main and server in their images...	commit \| commitdiff \| tree
2024-06-06	slaren	docker : add openmp lib (#7780)	commit \| commitdiff \| tree
2024-06-05	Galunid	Fix encoding in python scripts (#7733)	commit \| commitdiff \| tree
2024-06-05	Johannes Gäßler	CUDA: refactor mmq, dmmv, mmvq (#7716)	commit \| commitdiff \| tree
2024-06-05	Georgi Gerganov	ggml : refactor rope norm/neox (#7634)	commit \| commitdiff \| tree
2024-06-05	arch-btw	readme : remove -ins (#7759)	commit \| commitdiff \| tree
2024-06-04	jaime-m-p	Fix per token atrributes bits (#7749)	commit \| commitdiff \| tree
2024-06-04	agray3	Allow number of nodes in CUDA graph to change (#7738)	commit \| commitdiff \| tree
2024-06-04	Georgi Gerganov	common : refactor cli arg parsing (#7675)	commit \| commitdiff \| tree
2024-06-04	Georgi Gerganov	ggml : remove OpenCL (#7735)	commit \| commitdiff \| tree
2024-06-04	Georgi Gerganov	llama : remove beam search (#7736)	commit \| commitdiff \| tree
2024-06-04	Georgi Gerganov	readme : remove obsolete Zig instructions (#7471)	commit \| commitdiff \| tree
2024-06-04	slaren	llama-bench : allow using a different printer for stder...	commit \| commitdiff \| tree
2024-06-04	Daniele	Improve hipBLAS support in CMake (#7696)	commit \| commitdiff \| tree
2024-06-04	zhouwg	refine .gitignore (#7688)	commit \| commitdiff \| tree
2024-06-04	jaime-m-p	Per token attributes (#7685)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom