Revert "Use cudf to compute exact hash join output row sizes (#3288)" #3657

jlowe · 2021-09-24T18:48:33Z

This reverts commit 25bad3d.

Fixes #3640. When we switched to building the hash table explicitly, we lost the ability to be dynamic with which table is used as the build-side table for an inner join. It's definitely something we can do ourselves, but it will be tricky to do properly given how the join code assumes the table designated as the build-side will be used for a hash and the stream side is the only one that is splittable.

Since we're in the process of finishing up 21.10, I think it's prudent to revert #3288 and tackle this in a future release.
#2354 tracks solving the real root issue which is requiring everything on an arbitrarily-chosen build side of an inner join to be pulled in at once. Ideally this should try to examine both sides to choose the best build side and have a fallback if we cannot pull in either table completely to build the final hash table.

…3288)" This reverts commit 25bad3d. Signed-off-by: Jason Lowe <jlowe@nvidia.com>

jlowe · 2021-09-24T18:54:53Z

build

Revert "Use cudf to compute exact hash join output row sizes (NVIDIA#…

a25ba9c

…3288)" This reverts commit 25bad3d. Signed-off-by: Jason Lowe <jlowe@nvidia.com>

jlowe added this to the Sep 13 - Sep 24 milestone Sep 24, 2021

jlowe self-assigned this Sep 24, 2021

jlowe mentioned this pull request Sep 24, 2021

[FEA] Use CUDF API for getting join output size #2440

Open

revans2 approved these changes Sep 24, 2021

View reviewed changes

tgravescs merged commit f08ac9a into NVIDIA:branch-21.10 Sep 24, 2021

jlowe mentioned this pull request Oct 22, 2021

[FEA] AST enabled GpuBroadcastNestedLoopJoin left side can't be small #3832

Closed

sameerz added the task Work required that improves the product but is not user facing label Dec 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert "Use cudf to compute exact hash join output row sizes (#3288)" #3657

Revert "Use cudf to compute exact hash join output row sizes (#3288)" #3657

jlowe commented Sep 24, 2021

jlowe commented Sep 24, 2021

Revert "Use cudf to compute exact hash join output row sizes (#3288)" #3657

Revert "Use cudf to compute exact hash join output row sizes (#3288)" #3657

Conversation

jlowe commented Sep 24, 2021

jlowe commented Sep 24, 2021