[REVIEW] improve device atomic overloads for potential issues #1691

kovaltan · 2019-05-09T10:44:22Z

Update/improve device atomic overloads for potential issues, and simplify the implementation.
Then, it would be easy to implement when cudf changes the underlying type of the wrappers,
or cudf introduces a new data type.
And this PR also introduce cuda native atomicAdd support for signed int64_t. I didn't implemented it yet since cuda does not have signed long long int overload, however, two's complement representation of signed integer has the advantage that the fundamental arithmetic operations of addition, subtraction, and multiplication are identical to those for unsigned binary numbers. See also: https://en.wikipedia.org/wiki/Two%27s_complement

separate device binary operators definition into device_operators.cuh
move atomicCASImpl(int8 or int16) into typesAtomicCASImpl
device atomic overloads independent from the specific underlying type of the wrappers
remove genericAtomicOperationUnderlyingType
remove typesAtomicOperation32|64
add cudf::bool8 test cases
add cuda native atomicAdd support for signed int64_t

Closes #1398
Related #1685

split the file into `device_atomics.cuh` and `device_operators.cuh` separated the difinition of the device operators

move atomicCASImpl(int8 or int16) into typesAtomicCASImpl

simplify atomicMin, atomixMax add cudf::bool8 for atomic test case for atomicAdd,Min,Max add cudf::bool8 specialization for genericAtomicOperation

jrhemstad

I love these simplifications. Much cleaner and easier to understand.

cpp/src/utilities/device_atomics.cuh

…date

harrism

Looks good in general. One concern and one question.

cpp/src/utilities/device_atomics.cuh

Add '__forceinline__ __device__' to `W genericAtomicOperator(W)`

Add size check assert between `long long int` and `int64_t`

…date

remove redundant `sizeof(T)` when calling 'typesAtomicCASImpl`

remove redundant `sizeof(T)` when calling 'genericAtomicOperationImpl`

kovaltan · 2019-05-13T12:59:15Z

rerun tests

kovaltan · 2019-05-13T23:48:34Z

rerun tests

cpp/src/utilities/device_atomics.cuh

Add native atomicAdd(uint64_t) call for sint64_t

Add comment for `genericAtomicOperationImpl<int64_t, DeviceSum, 8>` why it uses atomicAdd(uint64) inside

Removed `genericAtomicOperation(W)` since it is not invoked for cudf::wrapper types. Merged it into `genericAtomicOperation(T)` Add size check assert at `type_reinterpret`.

jrhemstad · 2019-05-14T14:05:12Z

cpp/src/utilities/device_atomics.cuh

-}
-
-// specialization for cudf::detail::wrapper types
-template <typename T, gdf_dtype dtype, typename BinaryOp, typename W = cudf::detail::wrapper<T, dtype> >


This shouldn't be removed. I rely on it in #1478

It will need to be added back in a future PR.

kkraus14 · 2019-05-14T18:38:08Z

Fixed by #1735

kovaltan added 5 commits May 9, 2019 15:58

Split device_atomics.cuh file

6dfb591

split the file into `device_atomics.cuh` and `device_operators.cuh` separated the difinition of the device operators

Remove atomicCASImpl(int8 or int16)

24c42e2

move atomicCASImpl(int8 or int16) into typesAtomicCASImpl

Simplify atomicAdd

9914f20

Simplify atomicMin, atomixMax

d538af5

simplify atomicMin, atomixMax add cudf::bool8 for atomic test case for atomicAdd,Min,Max add cudf::bool8 specialization for genericAtomicOperation

Add more test coverage

c3ce65a

kovaltan requested a review from a team as a code owner May 9, 2019 10:44

Simplify atomicAnd/Or/Xor

52c715d

jrhemstad requested changes May 9, 2019

View reviewed changes

cpp/src/utilities/device_atomics.cuh Show resolved Hide resolved

kovaltan added 5 commits May 10, 2019 15:33

Removed genericAtomicOperationUnderlyingType

0993907

Remove typesAtomicOperation32|64

857c552

Update doxygen texts for atomics

76f30bd

Update changelog

59cacf8

Merge remote-tracking branch 'upstream/branch-0.8' into bug_atomic_up…

440d158

…date

kovaltan self-assigned this May 10, 2019

kovaltan changed the title ~~[WIP] improve device atomic overloads for potential issues~~ [REVIEW] improve device atomic overloads for potential issues May 10, 2019

kovaltan added 3 - Ready for Review Ready for review by team 4 - Needs Review Waiting for reviewer to review or respond libcudf Affects libcudf (C++/CUDA) code. labels May 10, 2019

kovaltan requested a review from jrhemstad May 10, 2019 07:56

harrism requested changes May 13, 2019

View reviewed changes

cpp/src/utilities/device_atomics.cuh Show resolved Hide resolved

cpp/src/utilities/device_atomics.cuh Outdated Show resolved Hide resolved

kovaltan added 5 commits May 13, 2019 13:43

Add '__forceinline__ __device__'

ab3c2da

Add '__forceinline__ __device__' to `W genericAtomicOperator(W)`

add static_assert for long long int size

f866f6e

Add size check assert between `long long int` and `int64_t`

Merge remote-tracking branch 'upstream/branch-0.8' into bug_atomic_up…

99bb770

…date

remove redundant sizeof(T) from CASImpl

fa9b617

remove redundant `sizeof(T)` when calling 'typesAtomicCASImpl`

remove redundant sizeof(T) from atomic op impl

586d208

remove redundant `sizeof(T)` when calling 'genericAtomicOperationImpl`

kovaltan requested a review from harrism May 13, 2019 13:03

jrhemstad requested changes May 14, 2019

View reviewed changes

cpp/src/utilities/device_atomics.cuh Show resolved Hide resolved

Add genericAtomicOperationImpl(int64_t, Sum)

d3e433c

Add native atomicAdd(uint64_t) call for sint64_t

harrism approved these changes May 14, 2019

View reviewed changes

harrism mentioned this pull request May 14, 2019

[REVIEW] Fix slow count groupby aggregations. #1730

Closed

kovaltan added 2 commits May 14, 2019 15:25

Add comment for impl of atomicAdd(int64_t)

96e1037

Add comment for `genericAtomicOperationImpl<int64_t, DeviceSum, 8>` why it uses atomicAdd(uint64) inside

Removed genericAtomicOperation(W)

edc14cb

Removed `genericAtomicOperation(W)` since it is not invoked for cudf::wrapper types. Merged it into `genericAtomicOperation(T)` Add size check assert at `type_reinterpret`.

kovaltan mentioned this pull request May 14, 2019

groupby() super slow on branch-0.7[BUG] #1685

Closed

jrhemstad changed the base branch from branch-0.8 to release-0.7.2 May 14, 2019 14:25

jrhemstad approved these changes May 14, 2019

View reviewed changes

jrhemstad requested review from a team as code owners May 14, 2019 14:29

jrhemstad changed the base branch from release-0.7.2 to branch-0.8 May 14, 2019 14:47

jrhemstad force-pushed the bug_atomic_update branch from 9a3a178 to edc14cb Compare May 14, 2019 14:49

kovaltan requested a review from a team May 14, 2019 14:49

jrhemstad mentioned this pull request May 14, 2019

[HOTFIX] Overload for atomicAdd on int64 #1735

Merged

kkraus14 closed this May 14, 2019

kovaltan mentioned this pull request May 17, 2019

[BUG] Issues with device atomic overloads for wrapper types #1398

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REVIEW] improve device atomic overloads for potential issues #1691

[REVIEW] improve device atomic overloads for potential issues #1691

kovaltan commented May 9, 2019 •

edited

Loading

jrhemstad left a comment

harrism left a comment

kovaltan commented May 13, 2019

kovaltan commented May 13, 2019

jrhemstad May 14, 2019

kkraus14 commented May 14, 2019

[REVIEW] improve device atomic overloads for potential issues #1691

[REVIEW] improve device atomic overloads for potential issues #1691

Conversation

kovaltan commented May 9, 2019 • edited Loading

jrhemstad left a comment

Choose a reason for hiding this comment

harrism left a comment

Choose a reason for hiding this comment

kovaltan commented May 13, 2019

kovaltan commented May 13, 2019

jrhemstad May 14, 2019

Choose a reason for hiding this comment

kkraus14 commented May 14, 2019

kovaltan commented May 9, 2019 •

edited

Loading