git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

CUDA: faster tile FA (Pascal/AMD), headsize 256 (#15769)

ggml/src/ggml-cuda/fattn-tile-f16.cu	[deleted file]	blob \| history
ggml/src/ggml-cuda/fattn-tile-f16.cuh	[deleted file]	blob \| history
ggml/src/ggml-cuda/fattn-tile-f32.cu	[deleted file]	blob \| history
ggml/src/ggml-cuda/fattn-tile-f32.cuh	[deleted file]	blob \| history
ggml/src/ggml-cuda/fattn-tile.cu	[new file with mode: 0644]	blob
ggml/src/ggml-cuda/fattn-tile.cuh	[new file with mode: 0644]	blob
ggml/src/ggml-cuda/fattn.cu		diff \| blob \| history

Packaging of ggml-org/llama.cpp