git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2024-06-23	slaren	fix CI failures (#8066)	commit \| commitdiff \| tree
2024-06-23	0cc4m	Refactor Vulkan backend to allow multiple contexts...	commit \| commitdiff \| tree
2024-06-22	Clint Herron	Removing extra blank lines that were breaking Lint...	commit \| commitdiff \| tree
2024-06-22	Xuan Son Nguyen	cvector: fix CI + correct help message (#8064)	commit \| commitdiff \| tree
2024-06-22	HatsuneMikuUwU33	cvector-generator: Moe Moe Fixie-Fixie for Lots of...	commit \| commitdiff \| tree
2024-06-22	0xspringtime	convert-hf : change assert to exception (#8015)	commit \| commitdiff \| tree
2024-06-22	ddh0	Update llama-quantize ppl/file size output from LLaMA...	commit \| commitdiff \| tree
2024-06-22	Clint Herron	JSON Schema to GBNF integration tests (#7790)	commit \| commitdiff \| tree
2024-06-21	k.h.lai	vulkan: detect multiple devices by deviceUUID instead...	commit \| commitdiff \| tree
2024-06-21	Eve	ggml : AVX IQ quants (#7845)	commit \| commitdiff \| tree
2024-06-21	Georgi Gerganov	llama : optimize long word tokenization with WPM (...	commit \| commitdiff \| tree
2024-06-21	Douglas Hanley	llama : allow pooled embeddings on any model (#7477)	commit \| commitdiff \| tree
2024-06-21	Shuichi Tsutsumi	swiftui : enable stream updating (#7754)	commit \| commitdiff \| tree
2024-06-20	Hamdoud Hakem	requirements : Bump torch and numpy for python3.12...	commit \| commitdiff \| tree
2024-06-20	Hamdoud Hakem	convert-hf : Fix the encoding in the convert-hf-to...	commit \| commitdiff \| tree
2024-06-20	Johannes Gäßler	common: fix warning (#8036)	commit \| commitdiff \| tree
2024-06-20	luoyu-intel	[SYCL] Fix windows build and inference (#8003)	commit \| commitdiff \| tree
2024-06-20	Johannes Gäßler	CUDA: stream-k decomposition for MMQ (#8018)	commit \| commitdiff \| tree
2024-06-20	Michael de...	metal : fix `ggml_metal_supports_op` for BF16 (#8021)	commit \| commitdiff \| tree
2024-06-19	sasha0552	server : fix smart slot selection (#8020)	commit \| commitdiff \| tree
2024-06-19	Michael de...	un-ignore `build-info.cmake` and `build-info.sh` (...	commit \| commitdiff \| tree
2024-06-19	slaren	ggml : synchronize threads using barriers (#7993)	commit \| commitdiff \| tree
2024-06-19	Georgi Gerganov	codecov : remove (#8004)	commit \| commitdiff \| tree
2024-06-19	Meng, Hengyu	[SYCL] refactor (#6408)	commit \| commitdiff \| tree
2024-06-18	jaime-m-p	tokenizer : BPE fixes (#7530)	commit \| commitdiff \| tree
2024-06-18	Sigbjørn Skjæret	Only use FIM middle token if it exists (#7648)	commit \| commitdiff \| tree
2024-06-18	jojorne	Fix no gcc pragma on Windows (#7751)	commit \| commitdiff \| tree
2024-06-18	Ulrich Drepper	Allow compiling with CUDA without CUDA runtime installe...	commit \| commitdiff \| tree
2024-06-18	Frank Mai	chore: clean useless beam search param (#7985)	commit \| commitdiff \| tree
2024-06-18	Abheek Gulati	readme : update UI list (#7943)	commit \| commitdiff \| tree
2024-06-18	Georgi Gerganov	ggml : sync	commit \| commitdiff \| tree
2024-06-18	Georgi Gerganov	whisper : use ggml_backend_sched (whisper/2239)	commit \| commitdiff \| tree
2024-06-17	Ștefan-Gabriel...	update: support Qwen2-57B-A14B (#7835)	commit \| commitdiff \| tree
2024-06-17	Srihari-mcw	Make updates to type cast based on compiler instead...	commit \| commitdiff \| tree
2024-06-17	Georgi Gerganov	llama : disable FA if KV head size do not match (#7982)	commit \| commitdiff \| tree
2024-06-17	Bryan Honof	Add Nix and Flox install instructions (#7899)	commit \| commitdiff \| tree
2024-06-17	slaren	sched : offload_op also requires supports_op (#7977)	commit \| commitdiff \| tree
2024-06-17	Frank Mai	fix: divide 0 exception in mamba (#7932)	commit \| commitdiff \| tree
2024-06-17	Markus Tavenrath	Implement non-mapped async IO for CUDA on Windows....	commit \| commitdiff \| tree
2024-06-17	Georgi Gerganov	rpc : fix load/store misaligned addresses (#7948)	commit \| commitdiff \| tree
2024-06-17	Brian	gguf-dump.py: add --markdown dump output (#7853)	commit \| commitdiff \| tree
2024-06-17	Neo Zhang	[SYCL] Update README-sycl.md for Chapter "Recommended...	commit \| commitdiff \| tree
2024-06-16	Calvin Laurenson	Add support for sqrt on CUDA (#7953)	commit \| commitdiff \| tree
2024-06-16	Georgi Gerganov	cuda : fix bounds check for src0 rows in MMVQ kernel...	commit \| commitdiff \| tree
2024-06-16	Hong Bo PENG	ggml : fix and optimize ppc64le (ggml/849)	commit \| commitdiff \| tree
2024-06-16	Daniel Bevenius	ggml : remove duplicate include of ggml-common.h (ggml...	commit \| commitdiff \| tree
2024-06-16	Georgi Gerganov	flake.lock: Update (#7951)	commit \| commitdiff \| tree
2024-06-16	Georgi Gerganov	unicode : avoid char32_t (#7957)	commit \| commitdiff \| tree
2024-06-16	hopkins385	readme : update UI list [no ci] (#7958)	commit \| commitdiff \| tree
2024-06-16	Georgi Gerganov	ggml : fix handling of zero blocks in IQ quants (#7955)	commit \| commitdiff \| tree
2024-06-16	Georgi Gerganov	github : update pr template	commit \| commitdiff \| tree
2024-06-16	0cc4m	Vulkan Shader Refactor, Memory Debugging Option (#7947)	commit \| commitdiff \| tree
2024-06-15	Xuan Son Nguyen	Add `cvector-generator` example (#7514)	commit \| commitdiff \| tree
2024-06-15	Meng, Hengyu	[SYCL] remove global variables (#7710)	commit \| commitdiff \| tree
2024-06-14	olexiyb	ci : fix macos x86 build (#7940)	commit \| commitdiff \| tree
2024-06-14	Johannes Gäßler	CUDA: faster q2_K, q3_K MMQ + int8 tensor cores (#7921)	commit \| commitdiff \| tree
2024-06-14	Georgi Gerganov	metal : utilize max shared memory for mul_mat_id (...	commit \| commitdiff \| tree
2024-06-14	Radoslav Gerganov	llama-bench : fix RPC indication (#7936)	commit \| commitdiff \| tree
2024-06-14	Sigbjørn Skjæret	llama : more checks before assuming FIM tokens (#7644)	commit \| commitdiff \| tree
2024-06-14	Elaine	convert : add Poro-34B-chat tokenizer support (#7713)	commit \| commitdiff \| tree
2024-06-13	Radoslav Gerganov	rpc : fix ggml_backend_rpc_supports_buft() (#7918)	commit \| commitdiff \| tree
2024-06-13	Galunid	readme : Remove outdated instructions from README.md...	commit \| commitdiff \| tree
2024-06-13	slaren	move BLAS to a separate backend (#6210)	commit \| commitdiff \| tree
2024-06-12	Olivier Chafik	`build`: rename main → llama-cli, server → llama-server...	commit \| commitdiff \| tree
2024-06-12	Johannes Gäßler	CUDA: fix broken oob check for FA vec f32 kernel (...	commit \| commitdiff \| tree
2024-06-12	Georgi Gerganov	tests : add non-cont unary tests (#7857)	commit \| commitdiff \| tree
2024-06-12	Georgi Gerganov	ggml : improve ggml_is_contiguous logic (#7856)	commit \| commitdiff \| tree
2024-06-12	Georgi Gerganov	server : restore numeric prompts (#7883)	commit \| commitdiff \| tree
2024-06-12	Meng, Hengyu	update intel docker oneapi-basekit to 2024.1.1-devel...	commit \| commitdiff \| tree
2024-06-12	Patrice Ferlet	Fix a typo and add Fedora 40 pacakge to install for...	commit \| commitdiff \| tree
2024-06-11	k.h.lai	vulkan: select only one device for single gpu with...	commit \| commitdiff \| tree
2024-06-11	0cc4m	Update Vulkan RoPE implementation (#7818)	commit \| commitdiff \| tree
2024-06-11	Deven Mistry	fix broken link in pr template (#7880) [no ci]	commit \| commitdiff \| tree
2024-06-11	Brian	github: move PR template to .github/ root (#7868)	commit \| commitdiff \| tree
2024-06-11	Johannes Gäßler	llama-bench: more compact markdown tables (#7879)	commit \| commitdiff \| tree
2024-06-11	Georgi Gerganov	tests : check the Python version (#7872)	commit \| commitdiff \| tree
2024-06-11	Johannes Gäßler	CUDA: int8 tensor cores for MMQ (q4_K, q5_K, q6_K)...	commit \| commitdiff \| tree
2024-06-11	slaren	fix CUDA CI by using a windows-2019 image (#7861)	commit \| commitdiff \| tree
2024-06-11	Olivier Chafik	json: refine constraint for whitespace to avoid runaway...	commit \| commitdiff \| tree
2024-06-11	Olivier Chafik	`json`: document schema conversion in GBNF readme,...	commit \| commitdiff \| tree
2024-06-10	Jared Van Bortel	cmake : fix CMake requirement for CUDA (#7821)	commit \| commitdiff \| tree
2024-06-10	slaren	ci : try win-2019 on server windows test (#7854)	commit \| commitdiff \| tree
2024-06-10	Georgi Gerganov	examples : remove --instruct remnants (#7846)	commit \| commitdiff \| tree
2024-06-10	Georgi Gerganov	server : improve "prompt" handling (#7847)	commit \| commitdiff \| tree
2024-06-10	Johannes Gäßler	CUDA: use tensor cores for MMQ (#7676)	commit \| commitdiff \| tree
2024-06-10	Ben Ashbaugh	use the correct SYCL context for host USM allocations...	commit \| commitdiff \| tree
2024-06-09	Georgi Gerganov	flake.lock: Update (#7838)	commit \| commitdiff \| tree
2024-06-09	Georgi Gerganov	imatrix : handle partial entries (#7833)	commit \| commitdiff \| tree
2024-06-09	Nicolás Pérez	docs: Added initial PR template with directions for...	commit \| commitdiff \| tree
2024-06-09	mgroeber9110	server: do not remove whitespace at the start of a...	commit \| commitdiff \| tree
2024-06-09	Johannes Gäßler	CUDA: revise q8_1 data layout for mul_mat_q (#7824)	commit \| commitdiff \| tree
2024-06-09	sasha0552	convert-hf : set the model name based on cli arg, if...	commit \| commitdiff \| tree
2024-06-09	compilade	convert-hf : match model part name prefix and suffix...	commit \| commitdiff \| tree
2024-06-09	compilade	gguf-py : decouple adding metadata from writing in...	commit \| commitdiff \| tree
2024-06-08	slaren	Revert "[SYCL] Update rpc-server.cpp to include SYCL...	commit \| commitdiff \| tree
2024-06-08	Olivier Chafik	url: save -mu downloads to new cache location (#7826)	commit \| commitdiff \| tree
2024-06-08	sasha0552	server : smart slot selection using Longest Common...	commit \| commitdiff \| tree
2024-06-07	slaren	vulkan : reuse parent extra for views (#7806)	commit \| commitdiff \| tree
2024-06-07	Christian Zhou...	gguf-split : change binary multi-byte units to decimal...	commit \| commitdiff \| tree
2024-06-07	intelmatt	cmake : fix BUILD_SHARED_LIBS=ON build (#7784)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom