git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Jeff Bolz <redacted>
	Thu, 5 Feb 2026 15:26:38 +0000 (09:26 -0600)
committer	GitHub <redacted>
	Thu, 5 Feb 2026 15:26:38 +0000 (09:26 -0600)
commit	449ec2ab0751fc713fe338da2ced153125b5c674
tree	c54c02a6549bc770e1a2c747de5b6e321b1ec84b	tree
parent	3795cc1e89e16fbc145f8a5457ea30abd86e0d1d	commit \| diff

vulkan: Preprocess FA mask to detect all-neg-inf and all-zero. (#19281)

Write out a 2-bit code per block and avoid loading the mask when it
matches these two common cases.

Apply this optimization when the mask is relatively large (i.e. prompt
processing).

Packaging of ggml-org/llama.cpp

RSS Atom

ggml/src/ggml-vulkan/ggml-vulkan.cpp		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn.comp		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_base.glsl		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_cm1.comp		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_cm2.comp		diff \| blob \| history
ggml/src/ggml-vulkan/vulkan-shaders/flash_attn_mask_opt.comp	[new file with mode: 0644]	blob
ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp		diff \| blob \| history
tests/test-backend-ops.cpp		diff \| blob \| history