Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support Databricks 3.0.1 with new build profiles #3356

Merged
merged 169 commits into from
Aug 31, 2021
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
169 commits
Select commit Hold shift + click to select a range
a02f789
initial pom files
tgravescs Aug 16, 2021
02aae7b
Copy code from 3.0.1 shim to 3.1.1
tgravescs Aug 16, 2021
b99369d
fixes
tgravescs Aug 16, 2021
d15a5b5
Add Quatenary
tgravescs Aug 16, 2021
0a5ef7f
Merge branch 'build301' of github.com:tgravescs/spark-rapids into bui…
tgravescs Aug 16, 2021
47befb9
checkpoint
tgravescs Aug 16, 2021
7946e70
checkpoint
tgravescs Aug 16, 2021
e7f49aa
checkpoint
tgravescs Aug 16, 2021
158f41e
working build both
tgravescs Aug 17, 2021
c3a4d82
working include in jars
tgravescs Aug 17, 2021
e9a18fb
java trying to work
tgravescs Aug 17, 2021
f304417
checkpoint
tgravescs Aug 17, 2021
ec591cb
checkpoint
tgravescs Aug 17, 2021
9af5f62
revert poms to oroginal and add classifiers
tgravescs Aug 18, 2021
5f3e16e
Working build
tgravescs Aug 18, 2021
24b1e6c
cleanup
tgravescs Aug 18, 2021
d073913
Merge remote-tracking branch 'origin/branch-21.10' into buildperspark
tgravescs Aug 18, 2021
de3bb0c
upmerge
tgravescs Aug 18, 2021
5e0b745
cleanup
tgravescs Aug 18, 2021
4f44dc0
upmerge and cleanup
tgravescs Aug 18, 2021
6e9f929
move thigns to be inside profiles
tgravescs Aug 18, 2021
b29f19b
fix dependencies default
tgravescs Aug 18, 2021
ba214ea
clean
tgravescs Aug 18, 2021
2bcc995
work around dependency missing
tgravescs Aug 18, 2021
f8c625e
move dependencies inside profile to package properly
tgravescs Aug 18, 2021
8849b6a
revert 311 changes
tgravescs Aug 18, 2021
1e8da74
remove release 311 profiles
tgravescs Aug 18, 2021
4a9cdd5
fix shim dependencies
tgravescs Aug 18, 2021
5129b25
revert change for 3.1.1
tgravescs Aug 18, 2021
7f8ba94
Merge remote-tracking branch 'origin/branch-21.10' into buildperspark
tgravescs Aug 19, 2021
3641c45
change name of integration tests jar
tgravescs Aug 19, 2021
84b6945
change to use with-classifier profile
tgravescs Aug 19, 2021
ced230e
move dependencies back to wwhere they were
tgravescs Aug 19, 2021
7d251d9
fix copyright
tgravescs Aug 19, 2021
a16a567
fix spacing
tgravescs Aug 19, 2021
274300b
First 303 support
tgravescs Aug 19, 2021
bf0d47e
fix missed 301
tgravescs Aug 19, 2021
8b10fb8
Added spark 304
tgravescs Aug 19, 2021
702c845
add 311
tgravescs Aug 19, 2021
8549b3e
remove extras 311
tgravescs Aug 19, 2021
34e353b
remove newlines
tgravescs Aug 19, 2021
77f4ab7
split spark 3.1.1 into a base shim to easily copy to other 3.1.x
tgravescs Aug 19, 2021
25e025a
add spark 3.1.2
tgravescs Aug 19, 2021
a1729e0
add poms 311
tgravescs Aug 19, 2021
29c4216
fixes
tgravescs Aug 19, 2021
7434503
spark 3.1.3
tgravescs Aug 19, 2021
7648605
311cdh
tgravescs Aug 19, 2021
53b3cf6
update package
tgravescs Aug 19, 2021
3eda953
poms cdh3
tgravescs Aug 19, 2021
a9f48d9
cdh changes
tgravescs Aug 20, 2021
f7ab378
Fix dependencies on dist pre-merge profile
tgravescs Aug 20, 2021
ba4ea8e
Fix dependency
tgravescs Aug 20, 2021
d5f5914
remove comment out
tgravescs Aug 20, 2021
19baa0f
Fix integration test depdnencies
tgravescs Aug 20, 2021
71f10bb
Fix cdh param
tgravescs Aug 20, 2021
2735226
update poms for cdh
tgravescs Aug 20, 2021
88a8c81
copy files for emr but not tested
tgravescs Aug 20, 2021
6192734
fix emr
tgravescs Aug 20, 2021
6c48775
Add property to set 2 profiles, address review comments
tgravescs Aug 20, 2021
84f9ef2
Merge remote-tracking branch 'tgravescs/buildperspark' into buildpers…
tgravescs Aug 20, 2021
c0c89ec
update for properties
tgravescs Aug 20, 2021
4717169
minimize code changes to shims pom files
tgravescs Aug 20, 2021
3dca739
debug premerge failures print active profiles
tgravescs Aug 20, 2021
c37f29a
disable mvn_verify
tgravescs Aug 20, 2021
854e093
enable mvn debug
tgravescs Aug 20, 2021
9f99ae8
revert premerge build script changes
tgravescs Aug 20, 2021
d8ec7c1
Merge remote-tracking branch 'origin/branch-21.10' into buildperspark
tgravescs Aug 20, 2021
7c786ba
Update 302 to match changes to 301 shim
tgravescs Aug 20, 2021
3b9e4f1
Update missed 301 dep
tgravescs Aug 20, 2021
d240ff0
Merge remote-tracking branch 'tgravescs/buildperspark' into buildpers…
tgravescs Aug 20, 2021
abd67ff
Update shims for changes to base versions
tgravescs Aug 20, 2021
0d599ea
update 301emr
tgravescs Aug 20, 2021
b5eccff
Shim fixes
tgravescs Aug 23, 2021
2b18890
updates for cdh
tgravescs Aug 23, 2021
4a871b2
Merge remote-tracking branch 'origin/branch-21.10' into buildperspark2
tgravescs Aug 23, 2021
bbd8883
fixes
tgravescs Aug 23, 2021
384fabe
update auto generated docs
tgravescs Aug 23, 2021
9c0015c
move under generated sources for tests 310+
tgravescs Aug 23, 2021
f065444
only run spark310+ tests for 3.1.1 for now
tgravescs Aug 23, 2021
74fecb9
databricks initial
tgravescs Aug 24, 2021
9914014
more deps
tgravescs Aug 24, 2021
cc462da
add more shims
tgravescs Aug 24, 2021
f627daf
make db shims base shim
tgravescs Aug 24, 2021
958089d
copy files to 311db
tgravescs Aug 24, 2021
3383646
more databricks generic fixes, sql building
tgravescs Aug 24, 2021
343c555
move Arrow python to shim
tgravescs Aug 24, 2021
a1d390b
move more to shims
tgravescs Aug 24, 2021
869c58f
GpuFlatMap to shim
tgravescs Aug 24, 2021
637f651
move GpuAggregate pandas
tgravescs Aug 24, 2021
9381f17
Gpu map in pandas
tgravescs Aug 24, 2021
f3d8797
move gpu window in pandas
tgravescs Aug 24, 2021
0f131f4
changes
tgravescs Aug 24, 2021
892a4ae
more changes
tgravescs Aug 24, 2021
734a36d
Databricks 311 building
tgravescs Aug 25, 2021
20ad10b
udpate dependencies 301
tgravescs Aug 25, 2021
5617de3
add more deps
tgravescs Aug 25, 2021
41568f2
Merge branch 'buildperspark-db' of github.com:tgravescs/spark-rapids …
tgravescs Aug 25, 2021
12abf6f
building all
tgravescs Aug 25, 2021
3c7635e
change everything provided
tgravescs Aug 25, 2021
1513a34
Fix using GpuFileScanRDD
tgravescs Aug 25, 2021
99d2734
Merge branch 'buildperspark-db' of github.com:tgravescs/spark-rapids …
tgravescs Aug 25, 2021
2ec85ef
Merge remote-tracking branch 'origin/branch-21.10' into buildperspark…
tgravescs Aug 25, 2021
dc8d576
fix multiple profiles
tgravescs Aug 25, 2021
0a188c7
misc fixes
tgravescs Aug 25, 2021
843cb02
more updates
tgravescs Aug 25, 2021
a83bd95
updates
tgravescs Aug 25, 2021
82d69ce
fix shim layers
tgravescs Aug 25, 2021
a0ec6ee
more shim fixes
tgravescs Aug 25, 2021
222d560
building tests now
tgravescs Aug 25, 2021
a2802df
Fix arrow dep on shim
tgravescs Aug 25, 2021
e2a9a07
more
tgravescs Aug 25, 2021
e6859f8
add missing file
tgravescs Aug 25, 2021
c4ad00e
Merge remote-tracking branch 'origin/branch-21.10' into buildperspark…
tgravescs Aug 26, 2021
bae3d27
try shim module name
tgravescs Aug 26, 2021
40e77a4
fix modules and rename 311db dir
tgravescs Aug 26, 2021
b849cd0
clean up versions
tgravescs Aug 26, 2021
8bdc911
Merge branch 'buildperspark-db-upmerge' of github.com:tgravescs/spark…
tgravescs Aug 26, 2021
4c06607
update to use versions
tgravescs Aug 26, 2021
1b378ac
install more and fix version
tgravescs Aug 26, 2021
114c674
Merge branch 'buildperspark-db-upmerge' of https://github.com/tgraves…
tgravescs Aug 26, 2021
e19a5d5
working again
tgravescs Aug 26, 2021
e8c69bc
fix spacing
tgravescs Aug 26, 2021
5a64cdb
cleanup
tgravescs Aug 26, 2021
c90f9d9
build script support both
tgravescs Aug 26, 2021
a0039ba
build script changes
tgravescs Aug 26, 2021
2f36b45
fix shim
tgravescs Aug 26, 2021
b83b971
Merge branch 'buildperspark-db-upmerge' of github.com:tgravescs/spark…
tgravescs Aug 26, 2021
f227879
changes
tgravescs Aug 26, 2021
62c5d7f
Merge branch 'buildperspark-db-upmerge' of https://github.com/tgraves…
tgravescs Aug 26, 2021
28536f7
fix line
tgravescs Aug 26, 2021
16a8dde
Fix db311
tgravescs Aug 27, 2021
3d645c3
name files similar
tgravescs Aug 27, 2021
5133137
Add in 301db files
tgravescs Aug 27, 2021
81c234e
more shim files
tgravescs Aug 27, 2021
c53b786
more 301db changes
tgravescs Aug 27, 2021
b4df2ee
301db compiling
tgravescs Aug 27, 2021
f8dfd3e
Merge remote-tracking branch 'origin/branch-21.10' into buildperspark…
tgravescs Aug 27, 2021
0551e9c
revert build script changes
tgravescs Aug 27, 2021
1afc827
move to common
tgravescs Aug 27, 2021
cae26db
update for diffs
tgravescs Aug 27, 2021
21f278f
Merge remote-tracking branch 'origin/branch-21.10' into buildperspark…
tgravescs Aug 30, 2021
d2a922b
Fix issues with missing parameter columnar to row
tgravescs Aug 30, 2021
a00f03b
Fix import order
tgravescs Aug 30, 2021
ca1e658
Update build script to work with Databricks 3.0.1
tgravescs Aug 30, 2021
c9e9de8
Review comments
tgravescs Aug 30, 2021
2e20480
Cleanup extra dependencies and rename dependencies
tgravescs Aug 30, 2021
6c35bf4
remove spark301emr shim
tgravescs Aug 30, 2021
2da96bf
move trampoline and fix 304 compilation
tgravescs Aug 30, 2021
6cd162c
Revert "remove spark301emr shim"
tgravescs Aug 30, 2021
3c47239
revert build change
tgravescs Aug 30, 2021
32bdae9
pom
tgravescs Aug 30, 2021
3754b63
Merge branch 'buildperspark-db-upmerge2-test' into buildsparkdb301
tgravescs Aug 30, 2021
de0c030
301db building
tgravescs Aug 30, 2021
c7ec343
cleanup
tgravescs Aug 30, 2021
d76c8a8
Merge remote-tracking branch 'origin/branch-21.10' into buildsparkdb301
tgravescs Aug 30, 2021
a8a3bd6
Fix spacing and put 301emr back
tgravescs Aug 30, 2021
f5d987b
comment and rename profile
tgravescs Aug 30, 2021
ba75337
Downgrade scala maven plugin version
tgravescs Aug 31, 2021
d7584ff
revert build.sh change
tgravescs Aug 31, 2021
4519b6d
Fixes for 301
tgravescs Aug 31, 2021
dcbce01
Merge branch 'buildsparkdb301' of https://github.com/tgravescs/spark-…
tgravescs Aug 31, 2021
5cef60f
fix build script
tgravescs Aug 31, 2021
0240779
cleanup warnings
tgravescs Aug 31, 2021
81ce609
Fix formatting of pom files
tgravescs Aug 31, 2021
37d5b90
revert finalname
tgravescs Aug 31, 2021
2e2a647
revert build script
tgravescs Aug 31, 2021
1acc092
Comment using spark version for all Databricks dependencies
tgravescs Aug 31, 2021
f0ce057
Merge branch 'buildsparkdb301' of github.com:tgravescs/spark-rapids i…
tgravescs Aug 31, 2021
25ba10a
add comment to shim files
tgravescs Aug 31, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
43 changes: 25 additions & 18 deletions integration_tests/pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -94,7 +94,9 @@
<profile>
<id>with-classifier</id>
<activation>
<property><name>buildver</name></property>
<property>
<name>buildver</name>
</property>
</activation>
<dependencies>
<dependency>
Expand Down Expand Up @@ -166,36 +168,41 @@
</dependencies>
</profile>
<profile>
<id>release311db</id>
<!--
Note that we are using the Spark version for all of the Databricks dependencies as well.
The jenkins/databricks/build.sh script handles installing the jars as maven artifacts.
This is to make it easier and not have to change version numbers for each individual dependency
and deal with differences between Databricks versions
-->
<id>dbdeps</id>
<activation>
<property>
<name>buildver</name>
<value>311db</value>
<name>databricks</name>
</property>
</activation>
<dependencies>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-sql_${scala.binary.version}</artifactId>
<version>${spark311db.version}</version>
<version>${spark.version}</version>
<scope>provided</scope>
</dependency>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-core_${scala.binary.version}</artifactId>
<version>${spark311db.version}</version>
<version>${spark.version}</version>
<scope>provided</scope>
</dependency>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-annotation_${scala.binary.version}</artifactId>
<version>${spark311db.version}</version>
<version>${spark.version}</version>
<scope>provided</scope>
</dependency>
<dependency>
<groupId>org.apache.spark</groupId>
<artifactId>spark-catalyst_${scala.binary.version}</artifactId>
<version>${spark311db.version}</version>
<version>${spark.version}</version>
<scope>provided</scope>
</dependency>
<dependency>
Expand All @@ -206,27 +213,27 @@
</dependency>
<dependency>
<groupId>org.apache.arrow</groupId>
<artifactId>arrow-format</artifactId>
<version>${spark311db.version}</version>
<artifactId>arrow-format</artifactId>
<version>${spark.version}</version>
<scope>compile</scope>
</dependency>
<dependency>
<groupId>org.apache.arrow</groupId>
<artifactId>arrow-memory</artifactId>
<version>${spark311db.version}</version>
<artifactId>arrow-memory</artifactId>
<version>${spark.version}</version>
<scope>compile</scope>
</dependency>
<dependency>
<groupId>org.apache.arrow</groupId>
<artifactId>arrow-memory</artifactId>
<version>${spark311db.version}</version>
<artifactId>arrow-vector</artifactId>
<version>${spark.version}</version>
<scope>compile</scope>
</dependency>
<dependency>
<groupId>org.apache.arrow</groupId>
<artifactId>arrow-vector</artifactId>
<version>${spark311db.version}</version>
<scope>compile</scope>
<groupId>com.fasterxml.jackson.core</groupId>
<artifactId>jackson-annotations</artifactId>
<version>${spark.version}</version>
jlowe marked this conversation as resolved.
Show resolved Hide resolved
<scope>provided</scope>
</dependency>
</dependencies>
</profile>
Expand Down
62 changes: 39 additions & 23 deletions jenkins/databricks/build.sh
Original file line number Diff line number Diff line change
Expand Up @@ -94,31 +94,47 @@ PARQUETCOMMONJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-
PARQUETCOLUMNJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.parquet--parquet-column--org.apache.parquet__parquet-column__1.10.1-databricks6.jar
PARQUETFORMATJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.parquet--parquet-format--org.apache.parquet__parquet-format__2.4.0.jar

ARROWFORMATJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.arrow--arrow-format--org.apache.arrow__arrow-format__2.0.0.jar
ARROWMEMORYJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.arrow--arrow-memory-core--org.apache.arrow__arrow-memory-core__2.0.0.jar
ARROWMEMORYNETTYJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.arrow--arrow-memory-netty--org.apache.arrow__arrow-memory-netty__2.0.0.jar
ARROWVECTORJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.arrow--arrow-vector--org.apache.arrow__arrow-vector__2.0.0.jar

NETWORKCOMMON=----workspace_${SPARK_MAJOR_VERSION_STRING}--common--network-common--network-common-hive-2.3__hadoop-2.7_2.12_deploy.jar
COMMONUNSAFE=----workspace_${SPARK_MAJOR_VERSION_STRING}--common--unsafe--unsafe-hive-2.3__hadoop-2.7_2.12_deploy.jar
LAUNCHER=----workspace_${SPARK_MAJOR_VERSION_STRING}--launcher--launcher-hive-2.3__hadoop-2.7_2.12_deploy.jar

KRYO=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--com.esotericsoftware--kryo-shaded--com.esotericsoftware__kryo-shaded__4.0.2.jar

APACHECOMMONS=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--commons-io--commons-io--commons-io__commons-io__2.4.jar
APACHECOMMONSLANG3=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.commons--commons-lang3--org.apache.commons__commons-lang3__3.10.jar

JSON4S=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.json4s--json4s-ast_2.12--org.json4s__json4s-ast_2.12__3.7.0-M5.jar
if [[ $BASE_SPARK_VERSION == "3.0.1" ]]
then
JSON4S=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.json4s--json4s-ast_2.12--org.json4s__json4s-ast_2.12__3.6.6.jar
APACHECOMMONSLANG3=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.commons--commons-lang3--org.apache.commons__commons-lang3__3.9.jar
HIVESTORAGE=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.hive--hive-storage-api--org.apache.hive__hive-storage-api__2.7.1.jar
ARROWFORMATJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.arrow--arrow-format--org.apache.arrow__arrow-format__0.15.1.jar
ARROWMEMORYJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.arrow--arrow-memory--org.apache.arrow__arrow-memory__0.15.1.jar
ARROWVECTORJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.arrow--arrow-vector--org.apache.arrow__arrow-vector__0.15.1.jar
HIVEEXECJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--patched-hive-with-glue--hive-exec-core_shaded.jar
else
JSON4S=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.json4s--json4s-ast_2.12--org.json4s__json4s-ast_2.12__3.7.0-M5.jar
APACHECOMMONSLANG3=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.commons--commons-lang3--org.apache.commons__commons-lang3__3.10.jar
HIVESTORAGE=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.hive--hive-storage-api--org.apache.hive__hive-storage-api__2.7.2.jar
HIVEEXECJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.hive--hive-exec-core--org.apache.hive__hive-exec-core__2.3.7.jar
ARROWFORMATJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.arrow--arrow-format--org.apache.arrow__arrow-format__2.0.0.jar
ARROWMEMORYJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.arrow--arrow-memory-core--org.apache.arrow__arrow-memory-core__2.0.0.jar
ARROWMEMORYNETTYJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.arrow--arrow-memory-netty--org.apache.arrow__arrow-memory-netty__2.0.0.jar
ARROWVECTORJAR=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.arrow--arrow-vector--org.apache.arrow__arrow-vector__2.0.0.jar
fi

JAVAASSIST=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.javassist--javassist--org.javassist__javassist__3.25.0-GA.jar

PROTOBUFJAVA=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--com.google.protobuf--protobuf-java--com.google.protobuf__protobuf-java__2.6.1.jar

JACKSONCORE=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--com.fasterxml.jackson.core--jackson-databind--com.fasterxml.jackson.core__jackson-databind__2.10.0.jar
JACKSONANNOTATION=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--com.fasterxml.jackson.core--jackson-annotations--com.fasterxml.jackson.core__jackson-annotations__2.10.0.jar

HADOOPCOMMON=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.hadoop--hadoop-common--org.apache.hadoop__hadoop-common__2.7.4.jar
HADOOPMAPRED=----workspace_${SPARK_MAJOR_VERSION_STRING}--maven-trees--hive-2.3__hadoop-2.7--org.apache.hadoop--hadoop-mapreduce-client-core--org.apache.hadoop__hadoop-mapreduce-client-core__2.7.4.jar

# Please note we are installing all of these dependencies using the Spark version (SPARK_VERSION_TO_INSTALL_DATABRICKS_JARS) to make it easier
# to specify the dependencies in the pom files

mvn -B install:install-file \
-Dmaven.repo.local=$M2DIR \
-Dfile=$JARDIR/$CATALYSTJAR \
Expand Down Expand Up @@ -161,6 +177,14 @@ then

else

mvn -B install:install-file \
-Dmaven.repo.local=$M2DIR \
-Dfile=$JARDIR/$HIVEEXECJAR \
-DgroupId=org.apache.hive \
-DartifactId=hive-exec \
-Dversion=$SPARK_VERSION_TO_INSTALL_DATABRICKS_JARS \
-Dpackaging=jar

mvn -B install:install-file \
-Dmaven.repo.local=$M2DIR \
-Dfile=$JARDIR/$COREJAR \
Expand All @@ -177,6 +201,14 @@ else
-Dversion=$SPARK_VERSION_TO_INSTALL_DATABRICKS_JARS \
-Dpackaging=jar

mvn -B install:install-file \
-Dmaven.repo.local=$M2DIR \
-Dfile=$JARDIR/$JACKSONANNOTATION\
-DgroupId=com.fasterxml.jackson.core \
-DartifactId=jackson-annotations \
-Dversion=$SPARK_VERSION_TO_INSTALL_DATABRICKS_JARS \
-Dpackaging=jar

mvn -B install:install-file \
-Dmaven.repo.local=$M2DIR \
-Dfile=$JARDIR/$PROTOBUFJAVA \
Expand Down Expand Up @@ -265,14 +297,6 @@ else
-Dversion=$SPARK_VERSION_TO_INSTALL_DATABRICKS_JARS \
-Dpackaging=jar

mvn -B install:install-file \
-Dmaven.repo.local=$M2DIR \
-Dfile=$JARDIR/$HIVEEXECJAR \
-DgroupId=org.apache.hive \
-DartifactId=hive-exec \
-Dversion=$SPARK_VERSION_TO_INSTALL_DATABRICKS_JARS \
-Dpackaging=jar

mvn -B install:install-file \
-Dmaven.repo.local=$M2DIR \
-Dfile=$JARDIR/$HIVESERDEJAR \
Expand Down Expand Up @@ -329,14 +353,6 @@ else
-Dversion=$SPARK_VERSION_TO_INSTALL_DATABRICKS_JARS \
-Dpackaging=jar

mvn -B install:install-file \
-Dmaven.repo.local=$M2DIR \
-Dfile=$JARDIR/$ARROWMEMORYNETTYJAR \
-DgroupId=org.apache.arrow \
-DartifactId=arrow-memory-netty \
-Dversion=$SPARK_VERSION_TO_INSTALL_DATABRICKS_JARS \
-Dpackaging=jar

mvn -B install:install-file \
-Dmaven.repo.local=$M2DIR \
-Dfile=$JARDIR/$ARROWVECTORJAR \
Expand Down
36 changes: 36 additions & 0 deletions pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -193,6 +193,34 @@
</modules>
</profile>
<profile>
<!-- Note databricks requires 2 properties -Ddatabricks and -Dbuildver=[301db,311db] -->
<id>release301db</id>
<activation>
<property>
<name>buildver</name>
<value>301db</value>
</property>
</activation>
<properties>
<!-- Downgrade scala plugin version due to: https://github.com/sbt/sbt/issues/4305 -->
<scala.plugin.version>3.4.4</scala.plugin.version>
<shim.module.name>spark301db</shim.module.name>
<spark.version.classifier>spark301-databricks</spark.version.classifier>
<!--
Note that we are using the Spark version for all of the Databricks dependencies as well.
The jenkins/databricks/build.sh script handles installing the jars as maven artifacts.
This is to make it easier and not have to change version numbers for each individual dependency
and deal with differences between Databricks versions
-->
<spark.version>${spark301db.version}</spark.version>
<spark.test.version>${spark301db.version}</spark.test.version>
<rat.consoleOutput>true</rat.consoleOutput>
<hive.storage.api.version>${spark301db.version}</hive.storage.api.version>
<protobuf.java.version>${spark301db.version}</protobuf.java.version>
</properties>
</profile>
<profile>
<!-- Note databricks requires 2 properties -Ddatabricks and -Dbuildver=[301db,311db] -->
<id>release311db</id>
<activation>
<property>
Expand All @@ -201,8 +229,16 @@
</property>
</activation>
<properties>
<!-- Downgrade scala plugin version due to: https://github.com/sbt/sbt/issues/4305 -->
<scala.plugin.version>3.4.4</scala.plugin.version>
<shim.module.name>spark311db</shim.module.name>
<spark.version.classifier>spark311-databricks</spark.version.classifier>
<!--
Note that we are using the Spark version for all of the Databricks dependencies as well.
The jenkins/databricks/build.sh script handles installing the jars as maven artifacts.
This is to make it easier and not have to change version numbers for each individual dependency
and deal with differences between Databricks versions
-->
<spark.version>${spark311db.version}</spark.version>
<spark.test.version>${spark311db.version}</spark.test.version>
<rat.consoleOutput>true</rat.consoleOutput>
Expand Down
Loading