git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/shortlog

overview / pkg / ggml / sources / llama.cpp / shortlog

2024-12-03	Jeff Bolz	vulkan: optimize and reenable split_k (#10637)	commit \| commitdiff \| tree
2024-12-03	Xuan Son Nguyen	server : (web ui) Various improvements, now use vite...	commit \| commitdiff \| tree
2024-12-03	Georgi Gerganov	scripts : remove amx sync	commit \| commitdiff \| tree
2024-12-03	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-12-03	mahorozte	CUDA: remove unnecessary warp reduce in FA (ggml/1032)	commit \| commitdiff \| tree
2024-12-03	PAB	feat: add `GGML_UNARY_OP_ARGMAX` Metal kernel (ggml...	commit \| commitdiff \| tree
2024-12-03	PAB	metal : add `GGML_OP_CONV_TRANSPOSE_1D` kernels (ggml...	commit \| commitdiff \| tree
2024-12-03	Xuan Son Nguyen	llama : add missing LLAMA_API for llama_chat_builtin_te...	commit \| commitdiff \| tree
2024-12-03	Nikolaos Pothitos	readme : add option, update default value, fix formatti...	commit \| commitdiff \| tree
2024-12-03	Georgi Gerganov	metal : small-batch mat-mul kernels (#10581)	commit \| commitdiff \| tree
2024-12-03	Georgi Gerganov	github : minify link [no ci] (revert)	commit \| commitdiff \| tree
2024-12-03	Georgi Gerganov	github : minify link [no ci]	commit \| commitdiff \| tree
2024-12-03	Georgi Gerganov	server : fix default draft model parameters (#10586)	commit \| commitdiff \| tree
2024-12-02	Xuan Son Nguyen	llama : add enum for built-in chat templates (#10623)	commit \| commitdiff \| tree
2024-12-02	Georgi Gerganov	make : deprecate (#10514)	commit \| commitdiff \| tree
2024-12-02	haopeng	server: Add "tokens per second" information in the...	commit \| commitdiff \| tree
2024-12-02	Akarshan Biswas	SYCL: Fix and switch to GGML_LOG system instead of...	commit \| commitdiff \| tree
2024-12-02	Georgi Gerganov	contrib : refresh (#10593)	commit \| commitdiff \| tree
2024-12-01	Juk Armstrong	Add `mistral-v1`, `mistral-v3`, `mistral-v3-tekken...	commit \| commitdiff \| tree
2024-12-01	Georgi Gerganov	grammars : add English-only grammar (#10612)	commit \| commitdiff \| tree
2024-12-01	Wang Qin	ci: add error handling for Python venv creation in...	commit \| commitdiff \| tree
2024-12-01	Diego Devesa	ggml : automatic selection of best CPU backend (#10606)	commit \| commitdiff \| tree
2024-12-01	alek3y	server : bind to any port when specified (#10590)	commit \| commitdiff \| tree
2024-12-01	Georgi Gerganov	readme : update the usage section with examples (#10596)	commit \| commitdiff \| tree
2024-12-01	Wang Qin	build: update Makefile comments for C++ version change...	commit \| commitdiff \| tree
2024-11-30	Adrien Gallouët	ggml-cpu: replace AArch64 NEON assembly with intrinsics...	commit \| commitdiff \| tree
2024-11-30	Georgi Gerganov	readme : remove old badge	commit \| commitdiff \| tree
2024-11-30	Georgi Gerganov	readme : refresh (#10587)	commit \| commitdiff \| tree
2024-11-30	Eve	vulkan: Dynamic subgroup size support for Q6_K mat_vec...	commit \| commitdiff \| tree
2024-11-29	Diego Devesa	ggml : move AMX to the CPU backend (#10570)	commit \| commitdiff \| tree
2024-11-29	Xuan Son Nguyen	server : add more test cases (#10569)	commit \| commitdiff \| tree
2024-11-29	Robert Collins	imatrix : support combine-only (#10492)	commit \| commitdiff \| tree
2024-11-29	Diego Devesa	cleanup UI link list (#10577)	commit \| commitdiff \| tree
2024-11-29	Georgi Gerganov	ggml : fix I8MM Q4_1 scaling factor conversion (#10562)	commit \| commitdiff \| tree
2024-11-29	Shupei Fan	ggml-cpu: fix typo in gemv/gemm iq4_nl_4_4 (#10580)	commit \| commitdiff \| tree
2024-11-29	Alberto Cabrera...	sycl : offload of get_rows set to 0 (#10432)	commit \| commitdiff \| tree
2024-11-29	Alberto Cabrera...	sycl : Reroute permuted mul_mats through oneMKL (#10408)	commit \| commitdiff \| tree
2024-11-29	Chenguang Li	CANN: RoPE operator optimization (#10563)	commit \| commitdiff \| tree
2024-11-29	Jeff Bolz	vulkan: get the first command buffer submitted sooner...	commit \| commitdiff \| tree
2024-11-29	Ting Lou	llava: return false instead of exit (#10546)	commit \| commitdiff \| tree
2024-11-28	Georgi Gerganov	ggml : remove redundant copyright notice + update authors	commit \| commitdiff \| tree
2024-11-28	Georgi Gerganov	llama : add missing model types	commit \| commitdiff \| tree
2024-11-28	Xuan Son Nguyen	server : (tests) don't use thread for capturing stdout...	commit \| commitdiff \| tree
2024-11-28	Johannes Gäßler	common: fix warning message when no GPU found (#10564)	commit \| commitdiff \| tree
2024-11-28	Random Fly	docs: fix outdated usage of llama-simple (#10565)	commit \| commitdiff \| tree
2024-11-28	Diego Devesa	ci : fix tag name in cuda and hip releases (#10566)	commit \| commitdiff \| tree
2024-11-28	Georgi Gerganov	ggml : fix row condition for i8mm kernels (#10561)	commit \| commitdiff \| tree
2024-11-28	Georgi Gerganov	cmake : fix ARM feature detection (#10543)	commit \| commitdiff \| tree
2024-11-28	Shupei Fan	ggml-cpu: support IQ4_NL_4_4 by runtime repack (#10541)	commit \| commitdiff \| tree
2024-11-28	Sergio López	kompute : improve backend to pass test_backend_ops...	commit \| commitdiff \| tree
2024-11-28	Ruixin Huang	CANN: Update cann.md to display correctly in CLion...	commit \| commitdiff \| tree
2024-11-28	leo-pony	CANN: Fix SOC_TYPE compile bug (#10519)	commit \| commitdiff \| tree
2024-11-28	Chenguang Li	CANN: ROPE operator optimization (#10540)	commit \| commitdiff \| tree
2024-11-27	Xuan Son Nguyen	common : fix duplicated file name with hf_repo and...	commit \| commitdiff \| tree
2024-11-27	uvos	Add some minimal optimizations for CDNA (#10498)	commit \| commitdiff \| tree
2024-11-27	Diego Devesa	ci : faster CUDA toolkit installation method and use...	commit \| commitdiff \| tree
2024-11-27	Georgi Gerganov	metal : fix group_norm support condition (#0)	commit \| commitdiff \| tree
2024-11-27	Georgi Gerganov	sync : ggml	commit \| commitdiff \| tree
2024-11-27	Frankie Robertson	Do not include arm_neon.h when compiling CUDA code...	commit \| commitdiff \| tree
2024-11-27	Jeff Bolz	vulkan: define all quant data structures in types.comp...	commit \| commitdiff \| tree
2024-11-27	Jeff Bolz	vulkan: Handle GPUs with less shared memory (#10468)	commit \| commitdiff \| tree
2024-11-27	Jeff Bolz	vulkan: further optimize q5_k mul_mat_vec (#10479)	commit \| commitdiff \| tree
2024-11-27	Jeff Bolz	vulkan: skip integer div/mod in get_offsets for batch_i...	commit \| commitdiff \| tree
2024-11-27	Jeff Bolz	vulkan: optimize Q2_K and Q3_K mul_mat_vec (#10459)	commit \| commitdiff \| tree
2024-11-26	Diego Devesa	ci : fix cuda releases (#10532)	commit \| commitdiff \| tree
2024-11-26	Shane A	Add OLMo 2 model in docs (#10530)	commit \| commitdiff \| tree
2024-11-26	Diego Devesa	ci : remove nix workflows (#10526)	commit \| commitdiff \| tree
2024-11-26	Diego Devesa	llama : disable warnings for 3rd party sha1 dependency...	commit \| commitdiff \| tree
2024-11-26	Tristan Druyen	Fix HIP flag inconsistency & build docs (#10524)	commit \| commitdiff \| tree
2024-11-26	R0CKSTAR	mtgpu: Add MUSA_DOCKER_ARCH in Dockerfiles && update...	commit \| commitdiff \| tree
2024-11-26	Jeff Bolz	vulkan: fix group_norm (#10496)	commit \| commitdiff \| tree
2024-11-26	Xuan Son Nguyen	server : replace behave with pytest (#10416)	commit \| commitdiff \| tree
2024-11-26	Neo Zhang Jianyu	restore the condistion to build & update pacakge when...	commit \| commitdiff \| tree
2024-11-26	Georgi Gerganov	cmake : enable warnings in llama (#10474)	commit \| commitdiff \| tree
2024-11-26	Diego Devesa	ci : publish the docker images created during scheduled...	commit \| commitdiff \| tree
2024-11-26	Diego Devesa	ci : add ubuntu cuda build, build with one arch on...	commit \| commitdiff \| tree
2024-11-26	Charles Xu	ggml-cpu: cmake add arm64 cpu feature check for macos...	commit \| commitdiff \| tree
2024-11-26	Georgi Gerganov	server : fix parallel speculative decoding (#10513)	commit \| commitdiff \| tree
2024-11-26	Georgi Gerganov	speculative : simplify the implementation (#10504)	commit \| commitdiff \| tree
2024-11-26	Shanshan Shen	CANN: Improve the Inferencing Performance for Ascend...	commit \| commitdiff \| tree
2024-11-26	Chenguang Li	CANN: RoPE and CANCAT operator optimization (#10488)	commit \| commitdiff \| tree
2024-11-26	Junil Kim	vulkan: Fix a vulkan-shaders-gen arugment parsing error...	commit \| commitdiff \| tree
2024-11-25	Eric Curtin	Introduce llama-run (#10291)	commit \| commitdiff \| tree
2024-11-25	Diego Devesa	ci : build docker images only once daily (#10503)	commit \| commitdiff \| tree
2024-11-25	Georgi Gerganov	server : add more information about error (#10455)	commit \| commitdiff \| tree
2024-11-25	Georgi Gerganov	server : enable cache_prompt by default (#10501)	commit \| commitdiff \| tree
2024-11-25	Georgi Gerganov	metal : enable mat-vec kernels for bs <= 4 (#10491)	commit \| commitdiff \| tree
2024-11-25	Shane A	Rename Olmo1124 to Olmo2 (#10500)	commit \| commitdiff \| tree
2024-11-25	Diego Devesa	llama : accept a list of devices to use to offload...	commit \| commitdiff \| tree
2024-11-25	Johannes Gäßler	Github: update issue templates [no ci] (#10489)	commit \| commitdiff \| tree
2024-11-25	brucepro	Add download chat feature to server chat (#10481)	commit \| commitdiff \| tree
2024-11-25	Georgi Gerganov	server : add speculative decoding support (#10455)	commit \| commitdiff \| tree
2024-11-25	Diego Devesa	ggml : add support for dynamic loading of backends...	commit \| commitdiff \| tree
2024-11-25	Georgi Gerganov	tests : fix compile warning	commit \| commitdiff \| tree
2024-11-25	Georgi Gerganov	metal : minor code formatting	commit \| commitdiff \| tree
2024-11-25	Neo Zhang Jianyu	[SYCL] Fix building Win package for oneAPI 2025.0 updat...	commit \| commitdiff \| tree
2024-11-25	Georgi Gerganov	speculative : refactor and add a simpler example (...	commit \| commitdiff \| tree
2024-11-24	Georgi Gerganov	flake.lock: Update (#10470)	commit \| commitdiff \| tree
2024-11-24	Diego Devesa	llama : fix op mul check with command-r-plus (#10476)	commit \| commitdiff \| tree
2024-11-24	Gabe Goodhart	convert : XLMRoberta Type Vocab Size (#10458)	commit \| commitdiff \| tree
next

Packaging of ggml-org/llama.cpp

RSS Atom