git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	uvos <redacted>
	Wed, 12 Mar 2025 09:14:11 +0000 (10:14 +0100)
committer	Georgi Gerganov <redacted>
	Thu, 27 Mar 2025 07:35:24 +0000 (09:35 +0200)
commit	57b58207e1714bef01bbe6f6933758511d25d66c
tree	7438c19c3d789c02bec36ad29cc1855bcaf456e6	tree
parent	0c482f7b8eb721f5e00c95a197148b2dddcef47a	commit \| diff

CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (llama/12315)

When fattn-wmma was ported over to warp64 various bits that also touch fattn-vec where converted to
selectable warp size, however the fattn-vec kernels dont work with 64 wide warps for now, so we need
to avoid launching them with parameters for warp64

src/ggml-cuda/fattn-common.cuh		diff \| blob \| history
src/ggml-cuda/fattn-wmma-f16.cu		diff \| blob \| history

Packaging of ggml-org/ggml

RSS Atom