[FEA] Consider creating combined GpuCoalesceBatches and GpuShuffleExchange operator #719

andygrove · 2020-09-10T15:33:24Z

Is your feature request related to a problem? Please describe.
When AQE is enabled and we are planning a new query stage, we must return an operator that implements ShuffleExchangeLike (since Spark 3.0.1) so we remove any GpuCoalesceBatches operator and insert it later around the GpuCustomShuffleReader that will read the shuffle output.

I think it is worth exploring an alternate approach where instead of removing the GpuCoalesceBatches operator, we create a new operator that combines GpuCoalesceBatches and GpuShuffleExchangeExec and returns that as the new query stage.

The benefit of this approach if it works is that it makes the AQE and non-AQE plans more consistent and removes some complexity. It may also result in improved performance if it means that the shuffle reader is now reading coalesced batches, but I'm not 100% sure if I am understanding this correctly, so could do with a second opinion on this.

Describe the solution you'd like
See the previous section.

Describe alternatives you've considered
The alternative is the current design of coalescing after the shuffle reader.

Additional context
N/A

The text was updated successfully, but these errors were encountered:

JustPlay · 2020-09-11T01:59:02Z

It may also result in improved performance if it means that the shuffle reader is now reading coalesced batches

Will It may also result in improved performance if it means that the shuffle reader is now reading coalesced batches be helpful for this #679 (the gpu semaphore limit cpu concurrency)

revans2 · 2020-09-11T16:13:15Z

Will It may also result in improved performance if it means that the shuffle reader is now reading coalesced batches be helpful for this #679 (the gpu semaphore limit cpu concurrency)

I personally don't think the shuffle reader will be able to read coalesced batches. The reason the code is doing a shuffle is because the data needs to move from one location to another so that the desired data is all in the same location for processing. Even with AQE where on the reading side we might end up in some cases reading more batches then we originally thought, the data has already been partitioned N ways assuming that it will be read N ways. With how the data is laid out we will still need to combine the data together at some point.

The only real advantage of putting them together for #679 would be in batching and signalling, like I talked about. It would let us bypass some of the complexity in trying to coordinate between the shuffle and coalesce.

andygrove · 2020-11-06T00:07:20Z

I am closing this now that I understand this better and I agree with @revans2 comment that the shuffle reader would not be able to read coalesced batches.

…IDIA#719) Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com> Signed-off-by: spark-rapids automation <70000568+nvauto@users.noreply.github.com>

andygrove added feature request New feature or request ? - Needs Triage Need team to review and classify labels Sep 10, 2020

sameerz added P2 Not required for release performance A performance related task/issue and removed ? - Needs Triage Need team to review and classify labels Sep 15, 2020

andygrove closed this as completed Nov 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Consider creating combined GpuCoalesceBatches and GpuShuffleExchange operator #719

[FEA] Consider creating combined GpuCoalesceBatches and GpuShuffleExchange operator #719

andygrove commented Sep 10, 2020

JustPlay commented Sep 11, 2020 •

edited

Loading

revans2 commented Sep 11, 2020

andygrove commented Nov 6, 2020

[FEA] Consider creating combined GpuCoalesceBatches and GpuShuffleExchange operator #719

[FEA] Consider creating combined GpuCoalesceBatches and GpuShuffleExchange operator #719

Comments

andygrove commented Sep 10, 2020

JustPlay commented Sep 11, 2020 • edited Loading

revans2 commented Sep 11, 2020

andygrove commented Nov 6, 2020

JustPlay commented Sep 11, 2020 •

edited

Loading