]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
cuda : Enable CUDA Graph usage for Nemotron Nano v2 (NemotronH) (llama/16328)
authoranavp-nvidia <redacted>
Tue, 30 Sep 2025 08:13:22 +0000 (08:13 +0000)
committerGeorgi Gerganov <redacted>
Tue, 30 Sep 2025 09:31:04 +0000 (12:31 +0300)
commit62b3b86e3f1db6f84457d5605920fa6e07cc4958
tree1a5fe74fc4075e64a7543e87d13347af97666207
parent78f85f2b929db5bbadc5cdc9d21e468d7ca9d6f1
cuda : Enable CUDA Graph usage for Nemotron Nano v2 (NemotronH) (llama/16328)

* Fix Nemotron Nano v2 9B not executing as CUDA Graph on NVIDIA GPUs

* fix to ensure test-backend-ops check passes
ggml/src/ggml-cuda/cpy.cu
ggml/src/ggml-cuda/ggml-cuda.cu