Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Prepare for 3.2.1 shim, various shim build fixes and improvements [databricks] #3923

Merged
merged 31 commits into from
Oct 29, 2021

Conversation

gerashegalov
Copy link
Collaborator

@gerashegalov gerashegalov commented Oct 26, 2021

Signed-off-by: Gera Shegalov gera@apache.org

Contributes to #3470
Closes #3915

  • Remove unused dependency-reduced-pom-generation from agg pom
  • Eliminate double invocation of the shade plugin taking extra 6 seconds by delegating artifact attachment to the build helper plugin
  • Improve clean by customizing maven-clean-plugin
  • Enable *WithDatabricks profiles in buildall
  • Generate SparkShimServiceProvider discovery files on the fly
  • Refactor dist/pom.xml to reduce copy/paste code by 50%, delegate to default plugins when possible by customization
  • Disable doc generation automatically if the profile does not contain 301 to avoid inadvertent diffs on CI
  • Fix typo in verify_same_sha_for_unshimmed
  • Remove redundant SparkShims.getRapidsShuffleManagerClass
  • Consolidate most of Spark320 shim code into Spark32xShims.

@gerashegalov
Copy link
Collaborator Author

build

Signed-off-by: Gera Shegalov <gera@apache.org>
@gerashegalov
Copy link
Collaborator Author

build

@gerashegalov
Copy link
Collaborator Author

build

@gerashegalov
Copy link
Collaborator Author

Personally I would have preferred this pr be broken up into build changes and 3.2.1 separately for easier review and figure out what changes went with what.

I took 3.2.1 out of this PR

Copy link
Collaborator

@tgravescs tgravescs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

mostly nits, love that this cleans up all this hardcoded pom file stuff!

aggregator/pom.xml Outdated Show resolved Hide resolved
aggregator/pom.xml Outdated Show resolved Hide resolved
build/buildall Outdated Show resolved Hide resolved
build/buildall Outdated Show resolved Hide resolved
build/buildall Show resolved Hide resolved
dist/maven-antrun/build-parallel-worlds.xml Outdated Show resolved Hide resolved
dist/maven-antrun/build-parallel-worlds.xml Show resolved Hide resolved
pom.xml Show resolved Hide resolved
process-classes is too late

Signed-off-by: Gera Shegalov <gera@apache.org>
@gerashegalov gerashegalov changed the title Prepare for 3.2.1 shim, various shim build fixes and improvements Prepare for 3.2.1 shim, various shim build fixes and improvements [databricks] Oct 28, 2021
@gerashegalov
Copy link
Collaborator Author

build

tgravescs
tgravescs previously approved these changes Oct 29, 2021
@@ -42,35 +42,21 @@
<artifactId>rapids-4-spark-aggregator_${scala.binary.version}</artifactId>
<version>${project.version}</version>
</dependency>
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like this needs upmerged to latest changes where we marked provided? or maybe it doesn't matter because we remove the dependencies in reduced pom

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

upmerge fixed it , and thanks for pointing out inconsistent use of spark-hive deps

@tgravescs
Copy link
Collaborator

guess this needs to be upmerged, I can look again once that is done

@gerashegalov
Copy link
Collaborator Author

build

than antrun

Signed-off-by: Gera Shegalov <gera@apache.org>
@gerashegalov
Copy link
Collaborator Author

build

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
build Related to CI / CD or cleanly building Spark 3.2+ tech debt
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[BUG] typo in verify_same_sha_for_unshimmed prevents the offending class file name from being logged.
2 participants