-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Build] Unable to build ONNX Runtime against CUDA 12.5 #20953
Comments
cc: @pranavsharma |
Hi @mc-nv Is your test branch including this commit? I tried building latest main with cuda 12.5 and didn't repro this issue |
@yf711 my build is based on |
I see. rel-1.18.1 will include this fix and support cuda 12.5, which is planning to be released on 6/17 |
Can we have that branch available? |
The branch will be available for testing by the end of next week (6/14) |
### Description Upgrade cutlass to 3.5 to fix build errors using CUDA 12.4 or 12.5 in Windows - [x] Upgrade cutlass to 3.5.0. - [x] Fix flash attention build error with latest cutlass header files and APIs. This fix is provided by @wangyems. - [x] Update efficient attention to use new cutlass fmha interface. - [x] Patch cutlass to fix `hrsqrt` not found error for sm < 53. - [x] Disable TF32 Staged Accumulation to fix blkq4_fp16_gemm_sm80_test build error for cuda 11.8 to 12.3. - [x] Disable TRT 10 deprecate warnings. The following are not included in this PR: * TRT provider replaces the deprecated APIs. * Fix blkq4_fp16_gemm_sm80_test build error for cuda 12.4 or 12.5. This test is not built by default unless you add `--cmake_extra_defines onnxruntime_ENABLE_CUDA_EP_INTERNAL_TESTS=ON` in build command. To integrate to rel-1.18.1: Either bring in other changes (like onnx 1.16.1), or generate manifest and upload a new ONNX Runtime Build Time Deps artifact based on rel-1.18.1. ### Motivation and Context #19891 #20924 #20953
Use branch rel-1.18.1, and use |
Describe the issue
Unable to build ONNX Runtime against CUDA 12.5.
Urgency
It's quiet important as it may impact Triton 24.06 scope.
Target platform
Windows
Build branch
rel-1.18.0
Build script
Error / output
Visual Studio Version
No response
GCC / Compiler Version
No response
The text was updated successfully, but these errors were encountered: