[HOTFIX] Overload for atomicAdd on int64 #1735

jrhemstad · 2019-05-14T15:42:18Z

Supersedes #1691

Because #1691 was based off of branch-0.8, I had to cherry-pick it's commits onto a new branch based off of release-0.7.2.

In order to hotfix the abysmal performance of groupby when summing int64, provides an overload for atomicAdd int64 that simply casts to uint64. Because 2s complement is used, this is safe.

Compared to other systems for representing signed numbers (e.g., ones' complement), two's complement has the advantage that the fundamental arithmetic operations of addition, subtraction, and multiplication are identical to those for unsigned binary numbers (as long as the inputs are represented in the same number of bits - as the output, and any overflow beyond those bits is discarded from the result). This property makes the system simpler to implement, especially for higher-precision arithmetic.

https://en.wikipedia.org/wiki/Two%27s_complement

split the file into `device_atomics.cuh` and `device_operators.cuh` separated the difinition of the device operators

move atomicCASImpl(int8 or int16) into typesAtomicCASImpl

simplify atomicMin, atomixMax add cudf::bool8 for atomic test case for atomicAdd,Min,Max add cudf::bool8 specialization for genericAtomicOperation

Add '__forceinline__ __device__' to `W genericAtomicOperator(W)`

Add size check assert between `long long int` and `int64_t`

remove redundant `sizeof(T)` when calling 'typesAtomicCASImpl`

remove redundant `sizeof(T)` when calling 'genericAtomicOperationImpl`

Add native atomicAdd(uint64_t) call for sint64_t

Add comment for `genericAtomicOperationImpl<int64_t, DeviceSum, 8>` why it uses atomicAdd(uint64) inside

Removed `genericAtomicOperation(W)` since it is not invoked for cudf::wrapper types. Merged it into `genericAtomicOperation(T)` Add size check assert at `type_reinterpret`.

CHANGELOG.md

Co-Authored-By: Keith Kraus <keith.j.kraus@gmail.com>

kovaltan added 16 commits May 14, 2019 08:36

Split device_atomics.cuh file

ca2f015

split the file into `device_atomics.cuh` and `device_operators.cuh` separated the difinition of the device operators

Remove atomicCASImpl(int8 or int16)

52a4a83

move atomicCASImpl(int8 or int16) into typesAtomicCASImpl

Simplify atomicAdd

add8914

Simplify atomicMin, atomixMax

9982c45

simplify atomicMin, atomixMax add cudf::bool8 for atomic test case for atomicAdd,Min,Max add cudf::bool8 specialization for genericAtomicOperation

Add more test coverage

23f57bd

Simplify atomicAnd/Or/Xor

6ff0db1

Removed genericAtomicOperationUnderlyingType

3dff68e

Remove typesAtomicOperation32|64

702d6ac

Update doxygen texts for atomics

7aec18c

Add '__forceinline__ __device__'

44845a2

Add '__forceinline__ __device__' to `W genericAtomicOperator(W)`

add static_assert for long long int size

81cd5c3

Add size check assert between `long long int` and `int64_t`

remove redundant sizeof(T) from CASImpl

cd38704

remove redundant `sizeof(T)` when calling 'typesAtomicCASImpl`

remove redundant sizeof(T) from atomic op impl

3db11f5

remove redundant `sizeof(T)` when calling 'genericAtomicOperationImpl`

Add genericAtomicOperationImpl(int64_t, Sum)

7477c88

Add native atomicAdd(uint64_t) call for sint64_t

Add comment for impl of atomicAdd(int64_t)

f3cfe12

Add comment for `genericAtomicOperationImpl<int64_t, DeviceSum, 8>` why it uses atomicAdd(uint64) inside

Removed genericAtomicOperation(W)

e7754ac

Removed `genericAtomicOperation(W)` since it is not invoked for cudf::wrapper types. Merged it into `genericAtomicOperation(T)` Add size check assert at `type_reinterpret`.

jrhemstad requested a review from a team as a code owner May 14, 2019 15:42

jrhemstad added libcudf Affects libcudf (C++/CUDA) code. 4 - Needs Review Waiting for reviewer to review or respond labels May 14, 2019

jrhemstad added 2 commits May 14, 2019 08:43

CHANGELOG.

6e4984e

CHANGELOG.

1c70494

kkraus14 reviewed May 14, 2019

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

Update CHANGELOG.md

9b7a4fc

Co-Authored-By: Keith Kraus <keith.j.kraus@gmail.com>

kkraus14 approved these changes May 14, 2019

View reviewed changes

kkraus14 merged commit 6ef6b24 into rapidsai:release-0.7.2 May 14, 2019

kkraus14 mentioned this pull request May 14, 2019

[REVIEW] improve device atomic overloads for potential issues #1691

Closed

7 tasks

harrism mentioned this pull request May 17, 2019

[BUG] Issues with device atomic overloads for wrapper types #1398

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[HOTFIX] Overload for atomicAdd on int64 #1735

[HOTFIX] Overload for atomicAdd on int64 #1735

jrhemstad commented May 14, 2019

[HOTFIX] Overload for atomicAdd on int64 #1735

[HOTFIX] Overload for atomicAdd on int64 #1735

Conversation

jrhemstad commented May 14, 2019