]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
HIP: Patch failed testcase in WMMA-MMQ kernels for RDNA 4 (llama/17502)
authorJiacheng (Jason) Chen <redacted>
Wed, 26 Nov 2025 10:18:48 +0000 (05:18 -0500)
committerGeorgi Gerganov <redacted>
Thu, 11 Dec 2025 13:32:45 +0000 (15:32 +0200)
commitd63409b2b4d634a880b024535418c37c20d4a8d9
tree6a48f07d0cd5685ff4b5071e5e81f5d4ab54fea4
parent563130cac8add8854f47de4cf94ea6592a946768
HIP: Patch failed testcase in WMMA-MMQ kernels for RDNA 4 (llama/17502)

* patch failed test case MUL_MAT(type_a=q4_0,type_b=f32,m=576,n=512,k=576,bs=[1,1],nr=[1,1],per=[0,1,2,3],k_v=0,o=1) for enabling WMMA on RDNA4

* Quick clean up on mma.cuh to add ggml_cuda_memcpy_1 back in for half2 and bfloat162
src/ggml-cuda/mma.cuh
src/ggml-cuda/mmq.cuh