Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Don't do an extra shuffle in some TopN cases #2536

Merged
merged 1 commit into from
Jun 1, 2021

Conversation

revans2
Copy link
Collaborator

@revans2 revans2 commented May 28, 2021

This is a very small performance improvement for TakeOrderedAndProject. It just skips doing an extra shuffle if the input partitioning is already a single partition. This can happen when doing a window operation with no partitioning. Not common, but it can help.

Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>
@revans2 revans2 added the performance A performance related task/issue label May 28, 2021
@revans2 revans2 added this to the May 24 - Jun 4 milestone May 28, 2021
@revans2 revans2 self-assigned this May 28, 2021
@revans2
Copy link
Collaborator Author

revans2 commented May 28, 2021

build

@revans2 revans2 merged commit 061095f into NVIDIA:branch-21.06 Jun 1, 2021
@revans2 revans2 deleted the take_ordered_perf branch June 1, 2021 12:51
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
Signed-off-by: Robert (Bobby) Evans <bobby@apache.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
performance A performance related task/issue
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants