Change Cast.toString as "cast" instead of "ansi_cast" under ANSI mode #5047

HaoYang670 · 2022-03-25T07:33:32Z

Signed-off-by: remzi 13716567376yh@gmail.com

Signed-off-by: remzi <13716567376yh@gmail.com>

HaoYang670 · 2022-03-25T07:47:41Z

build

andygrove · 2022-03-25T23:07:19Z

sql-plugin/src/main/scala/com/nvidia/spark/rapids/GpuCast.scala

@@ -1544,11 +1544,7 @@ case class GpuCast(
    }
  }

-  override def toString: String = if (ansiMode) {


Can we add a test for this? I think this is something that needs to go into the shim layer since it varies by Spark version.

This is the Spark 3.1.x implementation:

override def toString: String = { val ansi = if (ansiEnabled) "ansi_" else "" s"${ansi}cast($child as ${dataType.simpleString})" }

andygrove

It looks like the name change only impacts Spark 3.3.0 and later so we need to introduce shim code to determine the name for this expression

revans2 · 2022-03-28T15:03:56Z

It looks like the name change only impacts Spark 3.3.0 and later so we need to introduce shim code to determine the name for this expression

The change to make it ansi vs not in the text was introduced in 3.0.0 and reverted in 3.3.0. For me it is so minor of a change that I just don't think it is worth shimming it/testing it. But I could be convinced if someone is actually parsing the text returned by this.

andygrove · 2022-03-28T15:29:00Z

I also doubt that users would be relying on the name of the expression. At some point, it would be nice if our tests compared the schema of results from GPU and CPU and not just the results themselves. That would automate some of our audit work. I'll remove the change request.

revans2 · 2022-03-28T15:50:21Z

@andygrove

At some point, it would be nice if our tests compared the schema of results from GPU and CPU and not just the results themselves.

That is a great point and is something that should be super simpler to add into asserts.py because we are passing in the dataframe. For the most part there is a clear mapping between the SQL type and a corresponding python type that is returned, but in a few cases it is ambiguous and we really should fix those cases. If you want to file an issue for that please do it. If not I will. You cannot see it, but I am face palming after I looked and realized that I forgot to add that into the integration tests when I first wrote them.

andygrove · 2022-03-28T15:57:28Z

Thanks. I filed #5072 for improving the tests

replace ansi_cast by cast

4d878ff

Signed-off-by: remzi <13716567376yh@gmail.com>

HaoYang670 linked an issue Mar 25, 2022 that may be closed by this pull request

[FEA][Audit] [SPARK-38251][SQL]- Change Cast.toString as "cast" instead of "ansi_cast" under ANSI mode #4870

Closed

HaoYang670 added the audit_3.3.0 Audit related tasks for 3.3.0 label Mar 25, 2022

revans2 approved these changes Mar 25, 2022

View reviewed changes

andygrove reviewed Mar 25, 2022

View reviewed changes

sameerz added this to the Mar 21 - Apr 1 milestone Mar 26, 2022

andygrove suggested changes Mar 28, 2022

View reviewed changes

andygrove approved these changes Mar 28, 2022

View reviewed changes

andygrove merged commit 3d1084c into NVIDIA:branch-22.06 Apr 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change Cast.toString as "cast" instead of "ansi_cast" under ANSI mode #5047

Change Cast.toString as "cast" instead of "ansi_cast" under ANSI mode #5047

HaoYang670 commented Mar 25, 2022

HaoYang670 commented Mar 25, 2022

andygrove Mar 25, 2022

andygrove left a comment

revans2 commented Mar 28, 2022

andygrove commented Mar 28, 2022

revans2 commented Mar 28, 2022

andygrove commented Mar 28, 2022

Change Cast.toString as "cast" instead of "ansi_cast" under ANSI mode #5047

Change Cast.toString as "cast" instead of "ansi_cast" under ANSI mode #5047

Conversation

HaoYang670 commented Mar 25, 2022

HaoYang670 commented Mar 25, 2022

andygrove Mar 25, 2022

Choose a reason for hiding this comment

andygrove left a comment

Choose a reason for hiding this comment

revans2 commented Mar 28, 2022

andygrove commented Mar 28, 2022

revans2 commented Mar 28, 2022

andygrove commented Mar 28, 2022