git.djapps.eu Git - pkg/ggml/sources/ggml/commit

]> git.djapps.eu Git - pkg/ggml/sources/ggml/commit

overview / pkg / ggml / sources / ggml / commit

author	Gaurav Garg <redacted>
	Tue, 3 Feb 2026 06:41:02 +0000 (12:11 +0530)
committer	Georgi Gerganov <redacted>
	Sat, 7 Feb 2026 08:37:38 +0000 (10:37 +0200)
commit	8c2086e7fe98fd222835c12f41358a1f7715f6d3
tree	b4f45030eb57717030f8a08cc44c014d00ea6e34	tree
parent	86a110caf3327e07ec3950119c64fb6a53159cec	commit \| diff

cuda : revert CUDA_SCALE_LAUNCH_QUEUES override until investigated (llama/19227)

Hangs were reported on Jetson Orin AGX if we set CUDA_SCALE_LAUNCH_QUEUES=4x. Reverting the previous PR (#19042) and updating the document to consider setting CUDA_SCALE_LAUNCH_QUEUES=4x for faster throughput on multi-GPU systems.

src/ggml-cuda/ggml-cuda.cu

diff | blob | history

Packaging of ggml-org/ggml

RSS Atom