git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Aman Gupta <redacted>
	Sat, 28 Jun 2025 17:30:53 +0000 (01:30 +0800)
committer	GitHub <redacted>
	Sat, 28 Jun 2025 17:30:53 +0000 (01:30 +0800)
commit	27208bf657cfe7262791df473927225e48efe482
tree	489644a597e720c29d69bfe2b9007cd616ae0bf1	tree
parent	63a7bb3c7e1c6b0a92d03b0a594d3cd501d6ed3e	commit \| diff

CUDA: add bf16 and f32 support to cublas_mul_mat_batched (#14361)

* CUDA: add bf16 and f32 support to cublas_mul_mat_batched

* Review: add type traits and make function more generic

* Review: make check more explicit, add back comments, and fix formatting

* Review: fix formatting, remove useless type conversion, fix naming for bools

ggml/src/ggml-cuda/convert.cu		diff \| blob \| history
ggml/src/ggml-cuda/convert.cuh		diff \| blob \| history
ggml/src/ggml-cuda/ggml-cuda.cu		diff \| blob \| history
tests/test-backend-ops.cpp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom