git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit

overview / pkg / ggml / sources / whisper.cpp / commit

author	Johannes Gäßler <redacted>
	Sat, 24 Aug 2024 19:34:59 +0000 (21:34 +0200)
committer	Georgi Gerganov <redacted>
	Wed, 28 Aug 2024 10:22:20 +0000 (13:22 +0300)
commit	24d8534bd8636e2d5ba9e922e286ddf4b5363296
tree	151f4d881157d13c57acaf850417c25933fbc2f2	tree
parent	9b16ddd3a5094d96ef391fb8205361f6ae13beee	commit \| diff

CPU/CUDA: Gemma 2 FlashAttention support (llama/8542)

* CPU/CUDA: Gemma 2 FlashAttention support

* apply logit_softcap to scale in kernel

* disable logit softcapping tests on Metal

* remove metal check

Packaging of ggerganov/whisper.cpp

RSS Atom

ggml/include/ggml.h		diff \| blob \| history
ggml/src/ggml-cuda/fattn-common.cuh		diff \| blob \| history
ggml/src/ggml-cuda/fattn-tile-f16.cu		diff \| blob \| history
ggml/src/ggml-cuda/fattn-tile-f32.cu		diff \| blob \| history
ggml/src/ggml-cuda/fattn-vec-f16.cuh		diff \| blob \| history
ggml/src/ggml-cuda/fattn-vec-f32.cuh		diff \| blob \| history
ggml/src/ggml-cuda/fattn-wmma-f16.cuh		diff \| blob \| history
ggml/src/ggml-cuda/fattn.cu		diff \| blob \| history
ggml/src/ggml-metal.m		diff \| blob \| history
ggml/src/ggml.c		diff \| blob \| history