diff --git a/sycl/doc/GetStartedGuide.md b/sycl/doc/GetStartedGuide.md index 4758292c15543..f20daaabb7354 100644 --- a/sycl/doc/GetStartedGuide.md +++ b/sycl/doc/GetStartedGuide.md @@ -641,6 +641,17 @@ clang++ -fsycl -fsycl-targets=amdgcn-amd-amdhsa \ simple-sycl-app.cpp -o simple-sycl-app-amd.exe ``` +The target architecture may also be specified for the CUDA backend, with +`-Xsycl-target-backend --cuda-gpu-arch=`. Specifying the architecture is +necessary if an application aims to use newer hardware features, such as +native atomic operations or tensor core operations. + +```bash +clang++ -fsycl -fsycl-targets=nvptx64-nvidia-cuda \ + simple-sycl-app.cpp -o simple-sycl-app-cuda.exe \ + -Xsycl-target-backend --cuda-gpu-arch=sm_80 +``` + To build simple-sycl-app ahead of time for GPU, CPU or Accelerator devices, specify the target architecture. The examples provided use a supported alias for the target, representing a full triple. Additional details can