-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
nccl.h not found when compiling from source #8504
Comments
CC: @tonyyang-svail @dzhwinter could you guys take a look, we probably should support compiling without docker. |
Hi @yu239 , we switched to nccl2, please install the dependency using nccl2 is closed source comparing to nccl1, so we can not use cmake to download the source and compile. Maybe manually install it using apt-get is the best solution. |
@helinwang Is it necessary to add a build option to switch off nccl dependency? |
The nccl2 is not open sourced anymore, NVIDIA provide the cuda docker image with ppa(the apt source url) included inside, so we can make a |
Should we remain nccl1?
|
Does our codebase (develop branch) still depend on NCCL 1? If not, let us remove nccl.cmake. @luotao1 |
Our codebase still works well with NCCL 1, and @dzhwinter will update the nccl.cmake later to get the compatibility of the NCCL 1 and 2. |
In our codebase, we provide NCCL as a DSO(Dynamic Shared Library) library. It means that we only use a nccl.h to compile, no more static library is depended. According to the NCCL installl guide https://docs.nvidia.com/deeplearning/sdk/pdf/NCCL-Installation-Guide.pdf , we have the dependency relation below. nccl2.1.4(latest) -> cuda9.0 or higher To make our Multi-GPU supported in more platform, we still need the nccl1 to compatible with older CUDA version. |
See the reopened #5035.
The text was updated successfully, but these errors were encountered: