git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Jeff Bolz <redacted>
	Sat, 16 Aug 2025 09:18:31 +0000 (04:18 -0500)
committer	GitHub <redacted>
	Sat, 16 Aug 2025 09:18:31 +0000 (11:18 +0200)
commit	de2192794f4e8e04f2e8167ef2424905145e88fc
tree	0fe24209577711ae492b3285164600b28d244401	tree
parent	2e2b22ba6607414a5d619ac6d2f034b5b02214e5	commit \| diff

vulkan: Support mul_mat_id with f32 accumulators (#15337)

* vulkan: Add missing bounds checking to scalar/coopmat1 mul_mat_id

* vulkan: Support mul_mat_id with f32 accumulators, but they are not hooked up

- There's no explicit way to request f32 precision for mul_mat_id, but there
probably should be, and this gets the code in place for that.
- A couple fixes to check_results.
- Remove casts to fp16 in coopmat1 FA shader (found by inspection).

ggml/src/ggml-vulkan/ggml-vulkan.cpp		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_cm1.comp		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom