-
Notifications
You must be signed in to change notification settings - Fork 237
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Improve date support in JSON and CSV readers #4853
Conversation
Signed-off-by: Andy Grove <andygrove@nvidia.com>
Signed-off-by: Andy Grove <andygrove@nvidia.com>
'spark.rapids.sql.csv.read.bool.enabled': 'true', | ||
'spark.rapids.sql.csv.read.date.enabled': 'true', | ||
'spark.rapids.sql.csv.read.byte.enabled': 'true', | ||
'spark.rapids.sql.csv.read.short.enabled': 'true', | ||
'spark.rapids.sql.csv.read.integer.enabled': 'true', | ||
'spark.rapids.sql.csv.read.long.enabled': 'true', | ||
'spark.rapids.sql.csv.read.float.enabled': 'true', | ||
'spark.rapids.sql.csv.read.double.enabled': 'true', |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Most of these options were removed from the configs in earlier PRs related to JSON support
build |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Mostly nits outside of the documentation needing to be updated.
sql-plugin/src/main/scala/com/nvidia/spark/rapids/DateUtils.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuBatchScanExec.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuTextBasedPartitionReader.scala
Outdated
Show resolved
Hide resolved
sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuTextBasedPartitionReader.scala
Show resolved
Hide resolved
@@ -150,7 +150,6 @@ trait SparkQueryCompareTestSuite extends FunSuite with Arm { | |||
|
|||
def enableCsvConf(): SparkConf = { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should we just delete this?
It also looks like some versions of Spark do not include a dateFormat option. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The nits I have left are small enough I am happy to merge this in as is.
build |
build |
Build failed with Spark 3.3.0
I will need to do some shim work next |
build |
build |
build |
@revans2 I fixed the issues with 3.3.0 and tests are now passing |
Closes #4849 and #1091 and partially fixes #1111 and #123
Adds support for reading date columns from JSON and CSV, respecting
timeParserPolicy
.Status: