Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: Specify schema when calling readJSON in GpuJsonToStructs #10326

Closed

Conversation

andygrove
Copy link
Contributor

@andygrove andygrove commented Jan 30, 2024

Draft PR for discussion.

Related to #8558

Status: prototype code, exploring how we can specify the schema rather than have cuDF infer it in GpuJsontoStructs.

Benefits:

  • Performance improvement because we only read the columns we need
  • Makes code more consistent with GpuJsonScan and reads primitives as strings and then we cast to the expected type in the plugin for compatibility with Spark

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant