Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[VL] Add RowToColumnarExec to FakeRowAdaptor when use velox native parquet write #2799

Closed
wants to merge 15 commits into from

Conversation

WolverineJiang
Copy link
Contributor

@WolverineJiang WolverineJiang commented Aug 17, 2023

What changes were proposed in this pull request?

Some problems bring in by patch #1925 . This patch will resolve it.

How was this patch tested?

Run UT by set spark.gluten.sql.native.writer.enabled=true

Run TPCDS by set spark.gluten.sql.native.parquet.writer.enabled=true && spark.sql.sources.useV1SourceList=parquet

jianghaonan and others added 15 commits August 11, 2023 18:49
…velox native parquet write

Change-Id: If330f096a64a7127887733c71e39a370a185b62e
…velox native parquet write

Change-Id: If2b813993e6bf245d9bb981b3c6a01951a630da4
…velox native parquet write

Change-Id: I284e42f027273528077f5f6f1918c59fc625c691
…velox native parquet write

Change-Id: I5addffdede5ab8c3d111dfc6b9e067177c7ef384
…velox native parquet write

Change-Id: I8c8af6d1dffca25d1b87a0b47a2ee46ddd35c0a5
…velox native parquet write

Change-Id: I8937f81767d343acb2e4dde14ada25b4ab1d5270
…velox native parquet write

Change-Id: I4a9afa718dace1a6ccac221cae204065391ae968
…velox native parquet write

Change-Id: I168510d8a9cf98dde8f044e96693fece54bc1313
…velox native parquet write

Change-Id: I4ec20640bf2e7a3259ec2fa367927432bb02aadc
…velox native parquet write

Change-Id: I546013b04d54b9f3e75fb183cba75dd3d8019480
…rquet write

Change-Id: I3b23628f134515cb6f3da4128024566a98100312
@github-actions
Copy link

Thanks for opening a pull request!

Could you open an issue for this pull request on Github Issues?

https://github.com/oap-project/gluten/issues

Then could you also rename commit message and pull request title in the following format?

[GLUTEN-${ISSUES_ID}][COMPONENT]feat/fix: ${detailed message}

See also:

@github-actions
Copy link

Run Gluten Clickhouse CI

rc.withNewChildren(
Array(
FakeRowAdaptor(
BackendsApiManager.getSparkPlanExecApiInstance.genRowToColumnarExec(other))))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for your PR!
I note doExecuteColumnar at FakeRowAdaptor has a similar handling for row-based child operator. Doesn't that piece of code work?
If there is any issue found in your test, could you please add some UTs? Thanks!

case other => rc.withNewChildren(Array(FakeRowAdaptor(other)))
case other =>
if (other.supportsColumnar) {
rc.withNewChildren(Array(FakeRowAdaptor(other)))
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add some UTs to show why we should do this and confirm problem fixed.

@jackylee-ch
Copy link
Contributor

@PHILO-HE I just checked the problem @WolverineJiang met. it has nothing to do with the current pr, he did not override the ParquetFileFormat through extraClassPath. Thus I will close this pr now.

@PHILO-HE
Copy link
Contributor

@PHILO-HE I just checked the problem @WolverineJiang met. it has nothing to do with the current pr, he did not override the ParquetFileFormat through extraClassPath. Thus I will close this pr now.

No problem. Thanks for your clarification!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants