-
Notifications
You must be signed in to change notification settings - Fork 237
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Qualification tool: Report write data format and nested types #3263
Conversation
Signed-off-by: Niranjan Artal <nartal@nvidia.com>
build |
tools/src/main/scala/com/nvidia/spark/rapids/tool/qualification/QualOutputWriter.scala
Show resolved
Hide resolved
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
Signed-off-by: Niranjan Artal <nartal@nvidia.com>
Signed-off-by: Niranjan Artal <nartal@nvidia.com>
build |
@tgravescs, I think I have addressed all the review comments. PTAL. |
build |
tools/src/main/scala/com/nvidia/spark/rapids/tool/qualification/PluginTypeChecker.scala
Outdated
Show resolved
Hide resolved
tools/src/main/scala/com/nvidia/spark/rapids/tool/qualification/PluginTypeChecker.scala
Outdated
Show resolved
Hide resolved
val source = Source.fromResource(DEFAULT_DS_FILE) | ||
val supportedWriteFormat = ArrayBuffer[String]() | ||
try { | ||
val fileContents = source.getLines().toSeq |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we are parsing file twice, I'd say just parse it once and split read/writes into different datastructures. so remove this and update readSupportedTypeForPlugin
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
Signed-off-by: Niranjan Artal <nartal@nvidia.com>
Signed-off-by: Niranjan Artal <nartal@nvidia.com>
build |
there are some unit tests failing please take a look @nartal1 04:17:51 �[32mPluginTypeCheckerSuite:�[0m |
Signed-off-by: Niranjan Artal <nartal@nvidia.com>
build |
@@ -2019,10 +2019,12 @@ object SupportedOpsForTools { | |||
val conf = new RapidsConf(Map.empty[String, String]) | |||
val types = TypeEnum.values.toSeq | |||
val header = Seq("Format", "Direction") ++ types | |||
val writeOps: Array[String] = Array.fill(18)("NA") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
18 is very specific here, seems easily broken if format of that file changes.
tools/src/main/scala/com/nvidia/spark/rapids/tool/qualification/PluginTypeChecker.scala
Outdated
Show resolved
Hide resolved
tools/src/main/scala/com/nvidia/spark/rapids/tool/qualification/PluginTypeChecker.scala
Outdated
Show resolved
Hide resolved
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
Signed-off-by: Niranjan Artal <nartal@nvidia.com>
build |
tools/src/main/scala/org/apache/spark/sql/rapids/tool/qualification/QualAppInfo.scala
Outdated
Show resolved
Hide resolved
tools/src/test/scala/com/nvidia/spark/rapids/tool/qualification/QualificationSuite.scala
Show resolved
Hide resolved
build |
Signed-off-by: Niranjan Artal <nartal@nvidia.com>
Signed-off-by: Niranjan Artal <nartal@nvidia.com>
build |
tools/src/test/scala/com/nvidia/spark/rapids/tool/qualification/QualificationSuite.scala
Outdated
Show resolved
Hide resolved
Signed-off-by: Niranjan Artal <nartal@nvidia.com>
Signed-off-by: Niranjan Artal <nartal@nvidia.com>
build |
This closes #3045 and closes #3044.
For write data format, unsupported write data formats are reported in the csv.
For nested type, it parses the ReadSchema and identifies the nested types in the schema which is reported in the csv.