-
Notifications
You must be signed in to change notification settings - Fork 237
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix reserialization of broadcasted tables #3504
Fix reserialization of broadcasted tables #3504
Conversation
Signed-off-by: Jason Lowe <jlowe@nvidia.com>
build |
build |
1 similar comment
build |
build |
looks like a lot of test failures and might be related to changes |
Test failures were related to row-count-only tables. I updated to match the old behavior where we don't try to serialize or deserialize the types if there's only rows. I could simplify this code by serializing the datatypes in all cases to avoid the special-case scenarios at the cost of a bit bigger serialization data for the row-count-only cases. Let me know if that's desirable. |
build |
Fixes #3266.
Re-serializing a broadcast caused issues because it purposely deserializes into a different state on the executor than it does on the driver, so serializing on the executor due to memory pressure after being deserialized could cause an NPE.
This updates the broadcast to always deserialize only to memory but then throw away the host buffers once the GPU batch is requested at which point it creates it.