git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	Johannes Gäßler <redacted>
	Sat, 24 Aug 2024 19:34:59 +0000 (21:34 +0200)
committer	Georgi Gerganov <redacted>
	Tue, 27 Aug 2024 19:01:14 +0000 (22:01 +0300)
commit	141bc623c354579bb6abd9507ac5cbc1165efb64
tree	ae268d400a131f01f449f02a8dd727d2a07eceac	tree
parent	cd8374c1bd992a97761bffe6a3e271bb24a167f0	commit \| diff

CPU/CUDA: Gemma 2 FlashAttention support (llama/8542)

* CPU/CUDA: Gemma 2 FlashAttention support

* apply logit_softcap to scale in kernel

* disable logit softcapping tests on Metal

* remove metal check

Packaging of ggml-org/ggml

RSS Atom

include/ggml.h		diff \| blob \| history
src/ggml-cuda/fattn-common.cuh		diff \| blob \| history
src/ggml-cuda/fattn-tile-f16.cu		diff \| blob \| history
src/ggml-cuda/fattn-tile-f32.cu		diff \| blob \| history
src/ggml-cuda/fattn-vec-f16.cuh		diff \| blob \| history
src/ggml-cuda/fattn-vec-f32.cuh		diff \| blob \| history
src/ggml-cuda/fattn-wmma-f16.cuh		diff \| blob \| history
src/ggml-cuda/fattn.cu		diff \| blob \| history
src/ggml-metal.m		diff \| blob \| history
src/ggml.c		diff \| blob \| history
tests/test-backend-ops.cpp		diff \| blob \| history