-
Notifications
You must be signed in to change notification settings - Fork 442
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[GLUTEN-4835][CORE] Match metric names with Spark #4834
Conversation
Thanks for opening a pull request! Could you open an issue for this pull request on Github Issues? https://github.com/oap-project/gluten/issues Then could you also rename commit message and pull request title in the following format?
See also: |
Run Gluten Clickhouse CI |
Thanks, could you open an issue for this, to describe why need this change. |
I'll create one. Here's the short answer - it breaks some Spark internal things due to the mismatch of metric names. |
Run Gluten Clickhouse CI |
gluten-ut/spark34/src/test/scala/org/apache/spark/sql/GlutenSQLQuerySuite.scala
Outdated
Show resolved
Hide resolved
Run Gluten Clickhouse CI |
Run Gluten Clickhouse CI |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks.
backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala
Outdated
Show resolved
Hide resolved
backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala
Outdated
Show resolved
Hide resolved
gluten-ut/spark34/src/test/scala/org/apache/spark/sql/GlutenSQLQuerySuite.scala
Show resolved
Hide resolved
Run Gluten Clickhouse CI |
Run Gluten Clickhouse CI |
The patch looks good to me. Indeed we didnt pay much attention on the Spark built metrics/instrumentation system. |
@FelixYBW Do you think this change makes sense? Thanks. |
Yes, looks the names are inherited from Gazelle. Not sure why we don't use spark standard name |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
+1, lgtm
backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala
Show resolved
Hide resolved
backends-velox/src/main/scala/io/glutenproject/backendsapi/velox/MetricsApiImpl.scala
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@zzcclp Please help confirm on CH backend, thanks!
===== Performance report for TPCH SF2000 with Velox backend, for reference only ====
|
What changes were proposed in this pull request?
Rename metrics "inputRows" and "outputRows" to "numInputRows" and "numOutputRows" respectively, to match the names used in Spark.
(Fixes: #4835)
How was this patch tested?
Added a test case under GlutenSQLQuerySuite