git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Michael Wand <redacted>
	Thu, 26 Mar 2026 08:54:03 +0000 (01:54 -0700)
committer	GitHub <redacted>
	Thu, 26 Mar 2026 08:54:03 +0000 (09:54 +0100)
commit	112c78159f917c88ca08f74e67359599c3311829
tree	b8492137cd61f5a3eeb6c93346f2d500e6eafad3	tree
parent	0fac87b157305eb82a70902327abffbbce25bd3e	commit \| diff

ggml-cuda: Add NVFP4 dp4a kernel (#20644)

Added check for dst_t to cuda_cast template for float
Restored ggml_cuda_ue4m3_to_fp32, changed vecdot ints to int32ts
Added CUDART/HIP Check and HIP/fp8 include
Added NVFP4 to Test-backend-ops
Added hip_fp8_e4m3 to __nv_fp8_e4m3 typedef

---------

Co-authored-by: Johannes Gäßler <redacted>

Packaging of ggml-org/llama.cpp

RSS Atom

ggml/src/ggml-cuda/common.cuh		diff \| blob \| history
ggml/src/ggml-cuda/convert.cu		diff \| blob \| history
ggml/src/ggml-cuda/ggml-cuda.cu		diff \| blob \| history
ggml/src/ggml-cuda/mmvq.cu		diff \| blob \| history
ggml/src/ggml-cuda/vecdotq.cuh		diff \| blob \| history
ggml/src/ggml-cuda/vendors/cuda.h		diff \| blob \| history
ggml/src/ggml-cuda/vendors/hip.h		diff \| blob \| history
tests/test-backend-ops.cpp		diff \| blob \| history