Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix reserialization of broadcasted tables #3504

Merged
merged 3 commits into from
Sep 20, 2021

Conversation

jlowe
Copy link
Member

@jlowe jlowe commented Sep 15, 2021

Fixes #3266.

Re-serializing a broadcast caused issues because it purposely deserializes into a different state on the executor than it does on the driver, so serializing on the executor due to memory pressure after being deserialized could cause an NPE.

This updates the broadcast to always deserialize only to memory but then throw away the host buffers once the GPU batch is requested at which point it creates it.

Signed-off-by: Jason Lowe <jlowe@nvidia.com>
@jlowe jlowe added this to the Sep 13 - Sep 24 milestone Sep 15, 2021
@jlowe jlowe self-assigned this Sep 15, 2021
@jlowe
Copy link
Member Author

jlowe commented Sep 15, 2021

build

@jlowe
Copy link
Member Author

jlowe commented Sep 15, 2021

build

1 similar comment
@pxLi
Copy link
Collaborator

pxLi commented Sep 16, 2021

build

revans2
revans2 previously approved these changes Sep 16, 2021
@revans2
Copy link
Collaborator

revans2 commented Sep 16, 2021

build

@sameerz sameerz added the bug Something isn't working label Sep 16, 2021
@tgravescs
Copy link
Collaborator

looks like a lot of test failures and might be related to changes

@jlowe
Copy link
Member Author

jlowe commented Sep 20, 2021

Test failures were related to row-count-only tables. I updated to match the old behavior where we don't try to serialize or deserialize the types if there's only rows. I could simplify this code by serializing the datatypes in all cases to avoid the special-case scenarios at the cost of a bit bigger serialization data for the row-count-only cases. Let me know if that's desirable.

@jlowe
Copy link
Member Author

jlowe commented Sep 20, 2021

build

@jlowe jlowe merged commit 26c62f4 into NVIDIA:branch-21.10 Sep 20, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging this pull request may close these issues.

CDP - Flakiness in JoinSuite in Integration tests
5 participants