]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit
Revert "CUDA: add expert reduce kernel (#16857)" (llama/17100)
authorAman Gupta <redacted>
Sat, 8 Nov 2025 13:05:19 +0000 (21:05 +0800)
committerGeorgi Gerganov <redacted>
Sun, 9 Nov 2025 16:30:22 +0000 (18:30 +0200)
commit83623dd1fd6d3d1d9c3a90435495ac1e6a6ecc3f
treed38101668bf59da4497892ff240ee0a4d5080890
parent8e7d02098fcf610f3468c37cf290ec7971e5cd12
Revert "CUDA: add expert reduce kernel (#16857)" (llama/17100)
src/ggml-cuda/ggml-cuda.cu
src/ggml-cuda/moe-expert-reduce.cu [deleted file]
src/ggml-cuda/moe-expert-reduce.cuh [deleted file]
tests/test-backend-ops.cpp