git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

ggml: CUDA: add head size 72 for flash-attn (llama/16962)

src/ggml-cuda/fattn-tile.cu		diff \| blob \| history
src/ggml-cuda/fattn-tile.cuh		diff \| blob \| history
src/ggml-cuda/fattn.cu		diff \| blob \| history
src/ggml-cuda/template-instances/fattn-tile-instance-dkq72-dv72.cu	[new file with mode: 0644]	blob
src/ggml-cuda/template-instances/generate_cu_files.py		diff \| blob \| history

Packaging of ggml-org/ggml