git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2024-11-08	Jhen-Jie Hong	swift : exclude ggml-metal-embed.metal (#10211)	commit \| commitdiff \| tree
2024-11-07	Xuan Son Nguyen	server : minor UI fix (#10207)	commit \| commitdiff \| tree
2024-11-07	Xuan Son Nguyen	server : revamp chat UI with vuejs and daisyui (#10175)	commit \| commitdiff \| tree
2024-11-07	Georgi Gerganov	scripts : add amx to sync-ggml.sh [no ci]	commit \| commitdiff \| tree
2024-11-07	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-11-07	Georgi Gerganov	scripts : sync update	commit \| commitdiff \| tree
2024-11-07	Diego Devesa	ggml : add ggml-cpu.h to the public headers (#10204)	commit \| commitdiff \| tree
2024-11-07	Faisal Zaghloul	Remove identical wte/etw logic for jais (#10203)	commit \| commitdiff \| tree
2024-11-07	wwoodsTM	DRY: Fixes clone functionality (#10192)	commit \| commitdiff \| tree
2024-11-07	snadampal	fix q4_0_8_8 format for corrupted tokens issue (#10198)	commit \| commitdiff \| tree
2024-11-07	Zhiyuan Li	Optimize RWKV6 Operator Naming and Implement Multi...	commit \| commitdiff \| tree
2024-11-06	Georgi Gerganov	metal : add BF16 support (#8439)	commit \| commitdiff \| tree
2024-11-06	Georgi Gerganov	server : remove hack for extra parallel slot (#10187)	commit \| commitdiff \| tree
2024-11-06	Diego Devesa	metal : fix from ptr buffer name (#10189)	commit \| commitdiff \| tree
2024-11-06	Georgi Gerganov	ggml : adjust is_first_call init value (#10193)	commit \| commitdiff \| tree
2024-11-06	Georgi Gerganov	metal : add quantized FA support (#10149)	commit \| commitdiff \| tree
2024-11-05	Gabe Goodhart	llama : add <\|tool_call\|> formatting to Granite templat...	commit \| commitdiff \| tree
2024-11-04	Diego Devesa	ggml : fix arch check in bf16_to_fp32 (#10164)	commit \| commitdiff \| tree
2024-11-04	Eve	Q6_K AVX improvements (#10118)	commit \| commitdiff \| tree
2024-11-04	Diego Devesa	ggml : fix gelu tables initialization (#10172)	commit \| commitdiff \| tree
2024-11-04	Diego Devesa	ggml : fix q4xx mat mul, increase ggml_aligned_malloc...	commit \| commitdiff \| tree
2024-11-04	Xuan Son Nguyen	server : clarify /slots endpoint, add is_processing...	commit \| commitdiff \| tree
2024-11-04	snadampal	fix build break on arm64 linux (#10166)	commit \| commitdiff \| tree
2024-11-04	Diego Devesa	cuda : clear error after changing peer access (#10153)	commit \| commitdiff \| tree
2024-11-04	Georgi Gerganov	metal : simplify f16 and f32 dequant kernels (#0)	commit \| commitdiff \| tree
2024-11-04	Georgi Gerganov	metal : move dequantize templates to beginning of MSL...	commit \| commitdiff \| tree
2024-11-04	leo-pony	CANN: adjust backend registry refactor. (#10158)	commit \| commitdiff \| tree
2024-11-04	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-11-04	Yuri Khrustalev	cmake : make it possible linking ggml as external lib...	commit \| commitdiff \| tree
2024-11-04	Plamen Minev	metal : fix minor string leaks (ggml/1004)	commit \| commitdiff \| tree
2024-11-03	Diego Devesa	ggml : move CPU backend to a separate file (#10144)	commit \| commitdiff \| tree
2024-11-03	Georgi Gerganov	metal : minor fixup in FA kernel (#10143)	commit \| commitdiff \| tree
2024-11-03	Georgi Gerganov	flake.lock: Update (#10146)	commit \| commitdiff \| tree
2024-11-02	Christian Köhnenkamp	Add apple arm to presets (#10134)	commit \| commitdiff \| tree
2024-11-02	sasha0552	server : fix slot selection by lru (#10126)	commit \| commitdiff \| tree
2024-11-02	Georgi Gerganov	server : fix endpoint checks (#10135)	commit \| commitdiff \| tree
2024-11-02	Georgi Gerganov	llama : adjust default context size + print warnings...	commit \| commitdiff \| tree
2024-11-02	Diego Devesa	simple-chat : only add bos on first prompt (#10129)	commit \| commitdiff \| tree
2024-11-02	Xuan Son Nguyen	convert-lora : make `--base` optional (#10110)	commit \| commitdiff \| tree
2024-11-01	Diego Devesa	llama : add simple-chat example (#10124)	commit \| commitdiff \| tree
2024-11-01	Diego Devesa	llama : use smart pointers for ggml resources (#10117)	commit \| commitdiff \| tree
2024-11-01	Shupei Fan	vulkan : improve ggml_vk_create_buffer error handling...	commit \| commitdiff \| tree
2024-11-01	Georgi Gerganov	readme : update hot topics	commit \| commitdiff \| tree
2024-11-01	sasha0552	server : fix smart selection of available slot (#10120)	commit \| commitdiff \| tree
2024-11-01	Georgi Gerganov	ggml : remove ggml_scratch (#10121)	commit \| commitdiff \| tree
2024-11-01	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-11-01	Georgi Gerganov	ggml : alloc ggml_contexts on the heap (whisper/2525)	commit \| commitdiff \| tree
2024-11-01	Zhenwei Jin	build: fix build error in Windows env with OneAPI setup...	commit \| commitdiff \| tree
2024-10-31	Diego Devesa	llama : improve output buffer type selection (#10098)	commit \| commitdiff \| tree
2024-10-31	Diego Devesa	quantize : fix --keep-split (#10114)	commit \| commitdiff \| tree
2024-10-31	Diego Devesa	llama : fix buffer checks for mamba and rwk (#10111)	commit \| commitdiff \| tree
2024-10-31	Zhenwei Jin	loader: refactor tensor weights storage (#9935)	commit \| commitdiff \| tree
2024-10-31	Kevin Gibbons	server : include scheme when printing URL (#10106)	commit \| commitdiff \| tree
2024-10-31	Diego Devesa	ggml : check tensor name lengths in gguf files (#10100)	commit \| commitdiff \| tree
2024-10-31	Sergio López	kompute: add mul_mat_q4_k shader (#10097)	commit \| commitdiff \| tree
2024-10-30	Sergio López	kompute: add backend registry / device interfaces ...	commit \| commitdiff \| tree
2024-10-30	Diego Devesa	ggml : fix memory leaks when loading invalid gguf files...	commit \| commitdiff \| tree
2024-10-30	Rich Dougherty	readme : more lora detail in main example readme (...	commit \| commitdiff \| tree
2024-10-30	Rich Dougherty	convert : more detailed convert lora usage docs (#10065)	commit \| commitdiff \| tree
2024-10-30	xctan	ggml : add Q4_0_8_8 RISC-V GEMV and GEMM kernels (...	commit \| commitdiff \| tree
2024-10-30	Diego Devesa	llama : refactor model loader with backend registry...	commit \| commitdiff \| tree
2024-10-29	Changyeon Kim	ggml: Add POOL2D OP for GPU acceleration to the Vulkan...	commit \| commitdiff \| tree
2024-10-29	Georgi Gerganov	llama : remove Tail-Free sampling (#10071)	commit \| commitdiff \| tree
2024-10-28	arch-btw	llama : Add IBM granite template (#10013)	commit \| commitdiff \| tree
2024-10-28	Georgi Gerganov	flake.lock: Update (#10063)	commit \| commitdiff \| tree
2024-10-28	R0CKSTAR	musa: workaround for Guilty Lockup in cleaning src0...	commit \| commitdiff \| tree
2024-10-28	Georgi Gerganov	server : don't overfill the batch during infill (#10018)	commit \| commitdiff \| tree
2024-10-27	Georgi Gerganov	llama : switch KQ multiplication to F32 precision by...	commit \| commitdiff \| tree
2024-10-26	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-10-26	bssrdf	increase cuda_cpy block size (ggml/996)	commit \| commitdiff \| tree
2024-10-26	Georgi Gerganov	scripts : fix amx sync [no ci]	commit \| commitdiff \| tree
2024-10-25	Georgi Gerganov	metal : support permuted matrix multiplicaions (#10033)	commit \| commitdiff \| tree
2024-10-25	wwoodsTM	llama : add DRY sampler (#9702)	commit \| commitdiff \| tree
2024-10-25	Michael Podvitskiy	llama: string_split fix (#10022)	commit \| commitdiff \| tree
2024-10-25	Srihari-mcw	llamafile : extend sgemm.cpp support for Q5_0 models...	commit \| commitdiff \| tree
2024-10-25	Georgi Gerganov	server : check that the prompt fits in the slot's conte...	commit \| commitdiff \| tree
2024-10-24	Xuan Son Nguyen	server : refactor slot input data, move tokenizer to...	commit \| commitdiff \| tree
2024-10-24	Georgi Gerganov	ci : fix cmake flags for SYCL	commit \| commitdiff \| tree
2024-10-24	Johannes Gäßler	CUDA: fix insufficient buffer clearing for MMQ (#10032)	commit \| commitdiff \| tree
2024-10-24	Johannes Gäßler	CUDA: fix MMQ for non-contiguous src0, add tests (...	commit \| commitdiff \| tree
2024-10-23	wwoodsTM	server : samplers accept the prompt correctly (#10019)	commit \| commitdiff \| tree
2024-10-23	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-10-23	Georgi Gerganov	llama.vim : bump generation time limit to 3s [no ci]	commit \| commitdiff \| tree
2024-10-23	Johannes Gäßler	CUDA: fix 1D im2col, add tests (ggml/993)	commit \| commitdiff \| tree
2024-10-23	Daniel Bevenius	ggml : remove redundant set of contexts used field...	commit \| commitdiff \| tree
2024-10-23	Michael Coppola	llama.vim : add classic vim support (#9995)	commit \| commitdiff \| tree
2024-10-23	Jun Hee Yoo	metal : add POOL2D and fix IM2COL (#9943)	commit \| commitdiff \| tree
2024-10-23	github-actions...	flake.lock: Update	commit \| commitdiff \| tree
2024-10-22	Xuan Son Nguyen	llama : fix empty batch causing llama_batch_allocr...	commit \| commitdiff \| tree
2024-10-22	Daniel Bevenius	llama : rename batch to ubatch (#9950)	commit \| commitdiff \| tree
2024-10-22	Molly Sophia	Rwkv chat template fix (#10001)	commit \| commitdiff \| tree
2024-10-22	Xuan Son Nguyen	lora : warn user if new token is added in the adapter...	commit \| commitdiff \| tree
2024-10-22	Molly Sophia	llama : add chat template for RWKV-World + fix EOT...	commit \| commitdiff \| tree
2024-10-22	leo-pony	[CANN] Adapt to dynamically loadable backends mechanism...	commit \| commitdiff \| tree
2024-10-22	Daniel Bevenius	arg : fix typo in embeddings argument help [no ci]...	commit \| commitdiff \| tree
2024-10-21	Georgi Gerganov	llama.vim : fix info text display [no ci] (#9787)	commit \| commitdiff \| tree
2024-10-21	Georgi Gerganov	llama.vim : move info to the right of screen [no ci...	commit \| commitdiff \| tree
2024-10-21	Asghar Ghorbani	readme : update UI list (#9972)	commit \| commitdiff \| tree
2024-10-21	Daniel Bevenius	arg : fix attention non-causal arg value hint (#9985)	commit \| commitdiff \| tree
2024-10-21	Georgi Gerganov	llama.vim : plugin for Neovim (#9787)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom