Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

skip test_window_aggs_for_rows_lead_lag_on_arrays #2471

Merged
merged 1 commit into from
May 21, 2021

Conversation

wbo4958
Copy link
Collaborator

@wbo4958 wbo4958 commented May 21, 2021

If some rows of order-by columns (it's a,b,c in the test) are equal,
then it may fail because CPU and GPU can't guarantee the order for the
same rows, while lead/lag is typically depending on row's order.

The solution is we should add the aggregation column d and the default column
d_default columns into the order-by to guarantee the order. But for now,
sorting on array has not been supported yet, see
#2470.

So this PR just skip the test

Signed-off-by: Bobby Wang wbo4958@gmail.com

If some rows of order-by columns (it's `a,b,c` in the test) are equal,
then it may fail because CPU and GPU can't guarantee the order for the
same rows, while lead/lag is typically depending on row's order.

The solution is we should add the aggregation column `d` and the default column
`d_default` columns into the order-by to guarantee the order. But for now,
sorting on array has not been supported yet, see
NVIDIA#2470.

So this PR just skip the test

Signed-off-by: Bobby Wang <wbo4958@gmail.com>
@wbo4958 wbo4958 self-assigned this May 21, 2021
@wbo4958 wbo4958 requested a review from jlowe May 21, 2021 07:47
@wbo4958
Copy link
Collaborator Author

wbo4958 commented May 21, 2021

build

@pxLi pxLi merged commit ee4b369 into NVIDIA:branch-21.06 May 21, 2021
@revans2
Copy link
Collaborator

revans2 commented May 21, 2021

So fix this test instead of skipping it! There are ways to generate data that has no ambiguity in the order.

nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
If some rows of order-by columns (it's `a,b,c` in the test) are equal,
then it may fail because CPU and GPU can't guarantee the order for the
same rows, while lead/lag is typically depending on row's order.

The solution is we should add the aggregation column `d` and the default column
`d_default` columns into the order-by to guarantee the order. But for now,
sorting on array has not been supported yet, see
NVIDIA#2470.

So this PR just skip the test

Signed-off-by: Bobby Wang <wbo4958@gmail.com>
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
If some rows of order-by columns (it's `a,b,c` in the test) are equal,
then it may fail because CPU and GPU can't guarantee the order for the
same rows, while lead/lag is typically depending on row's order.

The solution is we should add the aggregation column `d` and the default column
`d_default` columns into the order-by to guarantee the order. But for now,
sorting on array has not been supported yet, see
NVIDIA#2470.

So this PR just skip the test

Signed-off-by: Bobby Wang <wbo4958@gmail.com>
@wbo4958 wbo4958 deleted the skip_lead_lag_test branch June 15, 2021 02:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
test Only impacts tests
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants