[BUG] Why is the hash aggregate not handling empty result expressions #4017

abellina · 2021-11-03T16:39:33Z

The hash aggregate is currently disabled for cases where there are no resultExpressions. This is something that was missed and has been in the hash aggregate code for a long time.

https://github.com/NVIDIA/spark-rapids/blob/branch-21.12/sql-plugin/src/main/scala/com/nvidia/spark/rapids/aggregate.scala#L872

Prior issues reference the mortgage test as a potential repro case. I can imagine that without much effort this can be fixed and we can remove this special case.

The text was updated successfully, but these errors were encountered:

revans2 · 2021-11-03T16:45:00Z

spark.sql("SELECT c_birth_month, SUM(c_birth_year) as sum_year from customer where c_salutation rlike 'Mr.' group by c_birth_month").count()

Shows this on the TPCDS dataset. Although it is really just a bogus query. Essentially it looks like in this situation there is a count as the output of the query so they don't want to materialize the output. Just get the number of rows of output there would be. So output a batch with no columns, just rows.

abellina · 2021-11-03T16:49:40Z

@revans2 thanks, the example reproduces locally

viadea · 2021-11-03T17:35:33Z

Thanks Bobby for the mini repro. Let me put the interested Spark driver log message here for future search:

!Exec <HashAggregateExec> cannot run on GPU because result expressions is empty

abellina added bug Something isn't working ? - Needs Triage Need team to review and classify labels Nov 3, 2021

abellina self-assigned this Nov 3, 2021

sameerz added this to the Nov 1 - Nov 12 milestone Nov 3, 2021

abellina mentioned this issue Nov 4, 2021

Hash aggregate fix empty resultExpressions [databricks] #4035

Merged

abellina closed this as completed in #4035 Nov 5, 2021

sameerz removed the ? - Needs Triage Need team to review and classify label Nov 9, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Why is the hash aggregate not handling empty result expressions #4017

[BUG] Why is the hash aggregate not handling empty result expressions #4017

abellina commented Nov 3, 2021 •

edited

Loading

revans2 commented Nov 3, 2021

abellina commented Nov 3, 2021

viadea commented Nov 3, 2021

[BUG] Why is the hash aggregate not handling empty result expressions #4017

[BUG] Why is the hash aggregate not handling empty result expressions #4017

Comments

abellina commented Nov 3, 2021 • edited Loading

revans2 commented Nov 3, 2021

abellina commented Nov 3, 2021

viadea commented Nov 3, 2021

abellina commented Nov 3, 2021 •

edited

Loading