-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] Unable to build against CUDA 12.4 without #1403
Comments
Hi! This is for CUTLASS version 3.1 which was released quite a few months ago (before the release of CUDA 12.4). Are you able to repro this with CUTLASS 3.4? It seems like you are also building on Windows, and our support for windows builds has improved quite a bit since 3.1. |
My latest build on Windows within same environment stuck for several hours against
|
That's an out of memory error so likely an issue with the compiler or the system used to build the kernels? CC @mhoemmen |
I got following error output from ONNX Runtime build trying to engage the "cutlass" as a CMake submodule against CUDA 12.4:
|
@mc-nv Thanks for the error report! We'll take a look at this. |
Hi @mhoemmen any update on this? |
@yf711 wrote:
I'm not sure what it means to "engage the 'cutlass' as a CMake submodule against CUDA 12.4." Could you please post all the CMake options that were given to CUTLASS? Without the list of CMake options, it will be a lot harder for us to try to reproduce this. |
This issue has been labeled |
Hi @mhoemmen OnnxRuntime's CUDA EP includes cutlass header directories to its cmake target:
https://github.com/microsoft/onnxruntime/blob/main/cmake/external/cutlass.cmake So some of its header files can include cutlass header files The compiler errors were shown when building the CUDA EP target. With cutlass 3.1.0:
With cutlass 3.5.0:
|
A colleague has been communicating offline to see if setting the |
|
Thanks @tianleiwu ! : - ) @hwu36 It looks like we can close this issue. |
This issue has been labeled |
Describe the bug
Unable to compile cutlass source code against CUDA 12.4
Steps/Code to reproduce bug
Getting error:
Expected behavior
Shouldn't compile without issues.
Environment details (please complete the following information):
Docker, Bare metal
BUILDTOOLS_VERSION:
17.9.34622.214
CMAKE_VERSION:
3.27.1
CUDA_VERSION:
12.4.0
CUDNN_VERSION:
9.0.0.312
PYTHON_VERSION:
3.8.10
TENSORRT_VERSION:
8.6.1.6
VCPGK_VERSION:
2023.11.20
Additional context
microsoft/onnxruntime#19891
The text was updated successfully, but these errors were encountered: