Use float64 add Atomics, Where Available #3172

njwhite · 2018-07-26T21:57:13Z

Compute Capability cards >= 6.0 support atomic double addition. Use this (instead of numba's CAS-spinning) if available).

@seibert you were right about numba caching LLVM IR globally in an AutoJitCUDAKernel - this needs to be per-compute capability as numba poly-fills functionality where not available.

codecov-io · 2018-07-27T02:54:14Z

Codecov Report

Merging #3172 into master will decrease coverage by 0.02%.
The diff coverage is n/a.

@@            Coverage Diff            @@
##           master   #3172      +/-   ##
=========================================
- Coverage   81.13%   81.1%   -0.03%     
=========================================
  Files         384     386       +2     
  Lines       75077   76398    +1321     
  Branches     8434    8590     +156     
=========================================
+ Hits        60915   61964    +1049     
- Misses      12874   13114     +240     
- Partials     1288    1320      +32

sklam · 2018-08-09T20:44:34Z

numba/cuda/compiler.py

@@ -812,7 +820,7 @@ def _rebuild(cls, func_reduced, bind, targetoptions, config):
    def __reduce__(self):
        """
        Reduce the instance for serialization.
-        Compiled definitions are serialized in PTX form.
+        Compiled definitions are discarded.


Good catch! opened #3210

sklam

The code looks good. Pending buildfarm to report back.

sklam

The following tests are failing:

test_polytyped (numba.cuda.tests.cudapy.test_inspect.TestInspect)
test_debuginfo_in_asm (numba.cuda.tests.cudapy.test_debuginfo.TestCudaDebugInfo)
test_environment_override (numba.cuda.tests.cudapy.test_debuginfo.TestCudaDebugInfo)

njwhite · 2018-08-10T20:43:33Z

@sklam oops, fixed

CAS operation work on bit types.

stuartarchibald · 2018-09-03T22:21:37Z

@sklam I pushed 9575027 merged to master through the smoke test, it passed.

Use float64 add Atomics, Where Available

b0b8902

stuartarchibald added the CUDA CUDA related issue/PR label Jul 30, 2018

seibert requested a review from sklam July 31, 2018 16:36

stuartarchibald added the 3 - Ready for Review label Aug 3, 2018

sklam reviewed Aug 9, 2018

View reviewed changes

sklam added the Pending BuildFarm For PRs that have been reviewed but pending a push through our buildfarm label Aug 9, 2018

sklam approved these changes Aug 9, 2018

View reviewed changes

sklam requested changes Aug 9, 2018

View reviewed changes

fix review feedback

5db9753

Fix test for CC<6.0

9575027

CAS operation work on bit types.

sklam approved these changes Aug 30, 2018

View reviewed changes

stuartarchibald added BuildFarm Passed For PRs that have been through the buildfarm and passed and removed Pending BuildFarm For PRs that have been reviewed but pending a push through our buildfarm labels Sep 3, 2018

seibert merged commit 62ec322 into numba:master Sep 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use float64 add Atomics, Where Available #3172

Use float64 add Atomics, Where Available #3172

njwhite commented Jul 26, 2018

codecov-io commented Jul 27, 2018 •

edited

Loading

sklam Aug 9, 2018

sklam left a comment

sklam left a comment

njwhite commented Aug 10, 2018

stuartarchibald commented Sep 3, 2018

Use float64 add Atomics, Where Available #3172

Use float64 add Atomics, Where Available #3172

Conversation

njwhite commented Jul 26, 2018

codecov-io commented Jul 27, 2018 • edited Loading

Codecov Report

sklam Aug 9, 2018

Choose a reason for hiding this comment

sklam left a comment

Choose a reason for hiding this comment

sklam left a comment

Choose a reason for hiding this comment

njwhite commented Aug 10, 2018

stuartarchibald commented Sep 3, 2018

codecov-io commented Jul 27, 2018 •

edited

Loading