]> git.djapps.eu Git - pkg/ggml/sources/whisper.cpp/commit
CUDA/HIP: Share the same unified memory allocation logic. (llama/12934)
authorDavid Huang <redacted>
Tue, 15 Apr 2025 09:20:38 +0000 (17:20 +0800)
committerGeorgi Gerganov <redacted>
Thu, 24 Apr 2025 17:39:16 +0000 (20:39 +0300)
commit43e3d25d9365676ab24ac82e8dee624cbdfe0683
tree54f7c7364119bd39f048c36940ec7f1a29bc2fdb
parente1dbf9a42e2948a66a89c93ce5aaa55993336cc7
CUDA/HIP: Share the same unified memory allocation logic. (llama/12934)

Replace compile-time `GGML_HIP_UMA` with environment variable `GGML_CUDA_ENABLE_UNIFIED_MEMORY`. This unifies the usage on NVIDIA and AMD GPUs, and allows a single binary to be shared between integrated and dedicated GPUs.
ggml/CMakeLists.txt
ggml/src/ggml-cuda/ggml-cuda.cu
ggml/src/ggml-cuda/vendors/hip.h
ggml/src/ggml-hip/CMakeLists.txt