git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	uvos <redacted>
	Wed, 12 Mar 2025 09:14:11 +0000 (10:14 +0100)
committer	GitHub <redacted>
	Wed, 12 Mar 2025 09:14:11 +0000 (10:14 +0100)
commit	34c961b181836a4f06ab4c56d5ce61ce03fc478b
tree	6c82eaa03b736d7df3bb72105b3a89c44d1640ec	tree
parent	7841fc723e059d1fd9640e5c0ef19050fcc7c698	commit \| diff

CUDA/HIP: Fix fattn-vec-* when device warp size is not 32 (#12315)

When fattn-wmma was ported over to warp64 various bits that also touch fattn-vec where converted to
selectable warp size, however the fattn-vec kernels dont work with 64 wide warps for now, so we need
to avoid launching them with parameters for warp64

ggml/src/ggml-cuda/fattn-common.cuh		diff \| blob \| history
ggml/src/ggml-cuda/fattn-wmma-f16.cu		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom