GPU User Experience Improvements #1283

tbennun · 2023-06-23T18:44:43Z

mcopik · 2023-06-25T22:58:21Z

If I understand correctly, the current proposal is to check for errors after execution. I think it might be worth considering a case where we fail at the first CUDA error, not just check at the end - this will make debugging much easier and faster. We could make it an optional feature if there's any concern that an additional check of a return value from CUDA calls would add an unnecessary performance tax.

Happy to help with this feature by improving & merging my solution for this.

tbennun · 2023-06-26T03:17:23Z

We don’t want the overhead by default, that’s what syncdebug is for. It does what you proposed and more (and even more after this PR).

tbennun · 2023-06-27T14:21:14Z

@mcopik I followed your suggestion in the latest commit

…if extra dimensions specified

phschaad

LGTM, minor comments to consider.

dace/transformation/passes/fusion_inline.py

dace/runtime/include/dace/cuda/stream.cuh

Add post-SDFG error checks for GPU-enabled runs

2a492f8

tbennun self-assigned this Jun 23, 2023

Fix MPI dtor

ecbf806

tbennun added the no-ci Do not run any CI or actions for this PR label Jun 26, 2023

Add informative details on failed kernel launch

aa42eaa

tbennun removed the no-ci Do not run any CI or actions for this PR label Jun 26, 2023

tbennun and others added 5 commits June 26, 2023 17:07

Merge branch 'master' into gpu-ux

f8a98e8

Add Windows support for GPU runtime check, make optional

5c554f0

Add GPU launch bounds property

f221a26

Always check for GPU runtime errors if returned

b6c5f9e

Fix C++ issue

afac4d3

tbennun added 16 commits June 27, 2023 09:58

Check for and warn on empty GPU grids

8fc8412

Prettier code generation

5448343

Propagate GPU runtime error codes instead of throwing exceptions

bbd686b

Better handle casts and never-empty grids

84c668a

Add inaccessible memlet checks in validation

4484d4c

Fix test

73f2d57

Clarify error message

9f17d01

More information when saving invalid file

d7eafef

Minor typos

a72cfd1

Fix important typos and allow GPU transform without simplification

0add7d3

Fix symbolic test

33a96c2

Merge branch 'master' into gpu-ux

69836ca

Further correct symbolic test

1706566

Consider default schedule maps in access validation

8175e5b

Refine validation test on potential failure

d665d34

Check for extra arguments when calling dace.programs

edf0115

tbennun mentioned this pull request Jun 28, 2023

DaCe does not warn on excess arguments to dace.program #1263

Closed

tbennun added 18 commits June 28, 2023 01:27

Fix invalid tests

561f9fb

Fix more broken tests

8b9dae5

Place validation printout at the end of the exception

4c2c9f7

Add nested SDFG parent pointer validation

f1e0e1b

Fix parent-pointing bug in loop to map

86eea02

Fix potentially unbound local

29d89aa

Fix yet another LoopToMap bug

c30bf4d

GPU runtime: Use pre-installed CUB if exists

3c0f505

More informative block warnings and default block size linearization …

ff964a6

…if extra dimensions specified

Fix erroneously detecting 1d kernels as 0d

b302cea

Warn when multiple block sizes are used

3e02efb

Error when gpu_block_size and thread-block map sizes conflict

b870f96

Warnings and errors on mismatching block sizes

b8717a5

Errors for block sizes that are too large

52f95e5

Merge branch 'master' into gpu-ux

68e309b

Fix comments in test

9c61c9d

Fix tests

a6093b9

Run reference fix pass on SDFG after deepcopy

c942f4b

tbennun marked this pull request as ready for review June 29, 2023 03:37

tbennun requested a review from phschaad June 29, 2023 05:02

phschaad approved these changes Jun 29, 2023

View reviewed changes

dace/transformation/passes/fusion_inline.py Outdated Show resolved Hide resolved

dace/runtime/include/dace/cuda/stream.cuh Outdated Show resolved Hide resolved

tbennun added 2 commits June 28, 2023 23:51

Merge branch 'master' into gpu-ux

f4ece77

Apply review suggestions

5d2ce3e

tbennun enabled auto-merge June 29, 2023 06:55

tbennun merged commit 81b3e4e into master Jun 29, 2023
9 checks passed

tbennun deleted the gpu-ux branch June 29, 2023 10:09

tbennun mentioned this pull request Jun 30, 2023

Unsupported grid size in cuda kernels #1266

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU User Experience Improvements #1283

GPU User Experience Improvements #1283

tbennun commented Jun 23, 2023 •

edited

Loading

mcopik commented Jun 25, 2023 •

edited

Loading

tbennun commented Jun 26, 2023

tbennun commented Jun 27, 2023

phschaad left a comment

GPU User Experience Improvements #1283

GPU User Experience Improvements #1283

Conversation

tbennun commented Jun 23, 2023 • edited Loading

mcopik commented Jun 25, 2023 • edited Loading

tbennun commented Jun 26, 2023

tbennun commented Jun 27, 2023

phschaad left a comment

Choose a reason for hiding this comment

tbennun commented Jun 23, 2023 •

edited

Loading

mcopik commented Jun 25, 2023 •

edited

Loading