git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commit

overview / pkg / ggml / sources / llama.cpp / commit

author	Gaurav Garg <redacted>
	Tue, 3 Feb 2026 06:41:02 +0000 (12:11 +0530)
committer	GitHub <redacted>
	Tue, 3 Feb 2026 06:41:02 +0000 (08:41 +0200)
commit	41e3f02647be2976c4a302128680ca5983568ae5
tree	9d0ea28e2ea6d7dd3ff2f996f0b19ea88d14227d	tree
parent	1efb5f7ae120c7cc7a33c4d1d82a05b3c50122f6	commit \| diff

cuda : revert CUDA_SCALE_LAUNCH_QUEUES override until investigated (#19227)

Hangs were reported on Jetson Orin AGX if we set CUDA_SCALE_LAUNCH_QUEUES=4x. Reverting the previous PR (#19042) and updating the document to consider setting CUDA_SCALE_LAUNCH_QUEUES=4x for faster throughput on multi-GPU systems.

docs/build.md		diff \| blob \| history
ggml/src/ggml-cuda/ggml-cuda.cu		diff \| blob \| history

Packaging of ggml-org/llama.cpp

RSS Atom