-
Notifications
You must be signed in to change notification settings - Fork 240
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update docs for 21.08 release #3080
Merged
Merged
Changes from 1 commit
Commits
Show all changes
12 commits
Select commit
Hold shift + click to select a range
0386f7d
Update docs for 21.08 release
sameerz 758bd01
Modify "or higher" -> "or later" when referring to versions
sameerz 2921966
Update docs/download.md
sameerz 9d085fe
Move AQE for developers doc into development section of web site
sameerz eebda32
Update a change made in the remote repository based on PR feedback
sameerz 92e4914
Add 21.06.1 release information to the download page
sameerz dc7d303
Merge branch 'branch-21.08' into branch-21.08-doc-updates
sameerz 46c7e90
Add information about kryoserialization buffer overflow
sameerz c3290d3
Remove SNAPSHOT from getting-started-on-prem.md
sameerz 4a54003
Update README.md for integration tests to remove SNAPSHOT
sameerz d4b3ef5
Fix table in supported ops doc issue #2779
sameerz 1264395
Fix typo (Abillity -> Ability)
sameerz File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1 @@ | ||
{"cells":[{"cell_type":"code","source":["dbutils.fs.mkdirs(\"dbfs:/databricks/init_scripts/\")\n \ndbutils.fs.put(\"/databricks/init_scripts/init.sh\",\"\"\"\n#!/bin/bash\nsudo wget -O /databricks/jars/rapids-4-spark_2.12-21.06.0.jar https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark_2.12/21.06.0/rapids-4-spark_2.12-21.06.0.jar\nsudo wget -O /databricks/jars/cudf-21.06.1-cuda11.jar https://repo1.maven.org/maven2/ai/rapids/cudf/21.06.1/cudf-21.06.1-cuda11.jar\n\nsudo wget -O /etc/apt/preferences.d/cuda-repository-pin-600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/cuda-ubuntu1804.pin\nsudo wget -O ~/cuda-repo-ubuntu1804-11-0-local_11.0.3-450.51.06-1_amd64.deb https://developer.download.nvidia.com/compute/cuda/11.0.3/local_installers/cuda-repo-ubuntu1804-11-0-local_11.0.3-450.51.06-1_amd64.deb\nsudo dpkg -i ~/cuda-repo-ubuntu1804-11-0-local_11.0.3-450.51.06-1_amd64.deb\nsudo apt-key add /var/cuda-repo-ubuntu1804-11-0-local/7fa2af80.pub\nsudo apt-get update\nsudo apt -y install cuda-toolkit-11-0\"\"\", True)"],"metadata":{},"outputs":[],"execution_count":1},{"cell_type":"code","source":["%sh\ncd ../../dbfs/databricks/init_scripts\npwd\nls -ltr\ncat init.sh"],"metadata":{},"outputs":[],"execution_count":2},{"cell_type":"code","source":[""],"metadata":{},"outputs":[],"execution_count":3}],"metadata":{"name":"generate-init-script","notebookId":2645746662301564},"nbformat":4,"nbformat_minor":0} | ||
{"cells":[{"cell_type":"code","source":["dbutils.fs.mkdirs(\"dbfs:/databricks/init_scripts/\")\n \ndbutils.fs.put(\"/databricks/init_scripts/init.sh\",\"\"\"\n#!/bin/bash\nsudo wget -O /databricks/jars/rapids-4-spark_2.12-21.08.0.jar https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark_2.12/21.08.0/rapids-4-spark_2.12-21.08.0.jar\nsudo wget -O /databricks/jars/cudf-21.08.0-cuda11.jar https://repo1.maven.org/maven2/ai/rapids/cudf/21.08.0/cudf-21.08.0-cuda11.jar\n\nsudo wget -O /etc/apt/preferences.d/cuda-repository-pin-600 https://developer.download.nvidia.com/compute/cuda/repos/ubuntu1804/x86_64/cuda-ubuntu1804.pin\nsudo wget -O ~/cuda-repo-ubuntu1804-11-0-local_11.0.3-450.51.06-1_amd64.deb https://developer.download.nvidia.com/compute/cuda/11.0.3/local_installers/cuda-repo-ubuntu1804-11-0-local_11.0.3-450.51.06-1_amd64.deb\nsudo dpkg -i ~/cuda-repo-ubuntu1804-11-0-local_11.0.3-450.51.06-1_amd64.deb\nsudo apt-key add /var/cuda-repo-ubuntu1804-11-0-local/7fa2af80.pub\nsudo apt-get update\nsudo apt -y install cuda-toolkit-11-0\"\"\", True)"],"metadata":{},"outputs":[],"execution_count":1},{"cell_type":"code","source":["%sh\ncd ../../dbfs/databricks/init_scripts\npwd\nls -ltr\ncat init.sh"],"metadata":{},"outputs":[],"execution_count":2},{"cell_type":"code","source":[""],"metadata":{},"outputs":[],"execution_count":3}],"metadata":{"name":"generate-init-script","notebookId":2645746662301564},"nbformat":4,"nbformat_minor":0} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1 @@ | ||
{"cells":[{"cell_type":"code","source":["dbutils.fs.mkdirs(\"dbfs:/databricks/init_scripts/\")\n \ndbutils.fs.put(\"/databricks/init_scripts/init.sh\",\"\"\"\n#!/bin/bash\nsudo wget -O /databricks/jars/rapids-4-spark_2.12-21.06.0.jar https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark_2.12/21.06.0/rapids-4-spark_2.12-21.06.0.jar\nsudo wget -O /databricks/jars/cudf-21.06.1-cuda11.jar https://repo1.maven.org/maven2/ai/rapids/cudf/21.06.1/cudf-21.06.1-cuda11.jar\"\"\", True)"],"metadata":{},"outputs":[],"execution_count":1},{"cell_type":"code","source":["%sh\ncd ../../dbfs/databricks/init_scripts\npwd\nls -ltr\ncat init.sh"],"metadata":{},"outputs":[],"execution_count":2},{"cell_type":"code","source":[""],"metadata":{},"outputs":[],"execution_count":3}],"metadata":{"name":"generate-init-script","notebookId":2645746662301564},"nbformat":4,"nbformat_minor":0} | ||
{"cells":[{"cell_type":"code","source":["dbutils.fs.mkdirs(\"dbfs:/databricks/init_scripts/\")\n \ndbutils.fs.put(\"/databricks/init_scripts/init.sh\",\"\"\"\n#!/bin/bash\nsudo wget -O /databricks/jars/rapids-4-spark_2.12-21.08.0.jar https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark_2.12/21.08.0/rapids-4-spark_2.12-21.08.0.jar\nsudo wget -O /databricks/jars/cudf-21.08.0-cuda11.jar https://repo1.maven.org/maven2/ai/rapids/cudf/21.08.0/cudf-21.08.0-cuda11.jar\"\"\", True)"],"metadata":{},"outputs":[],"execution_count":1},{"cell_type":"code","source":["%sh\ncd ../../dbfs/databricks/init_scripts\npwd\nls -ltr\ncat init.sh"],"metadata":{},"outputs":[],"execution_count":2},{"cell_type":"code","source":[""],"metadata":{},"outputs":[],"execution_count":3}],"metadata":{"name":"generate-init-script","notebookId":2645746662301564},"nbformat":4,"nbformat_minor":0} |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change | ||||
---|---|---|---|---|---|---|
|
@@ -18,6 +18,61 @@ cuDF jar, that is either preinstalled in the Spark classpath on all nodes or sub | |||||
that uses the RAPIDS Accelerator For Apache Spark. See the [getting-started | ||||||
guide](https://nvidia.github.io/spark-rapids/Getting-Started/) for more details. | ||||||
|
||||||
## Release v21.08.0 | ||||||
Hardware Requirements: | ||||||
|
||||||
The plugin is tested on the following architectures: | ||||||
|
||||||
GPU Architecture: NVIDIA V100, T4 and A10/A30/A100 GPUs | ||||||
|
||||||
Software Requirements: | ||||||
|
||||||
OS: Ubuntu 18.04, Ubuntu 20.04 or CentOS 7, CentOS 8 | ||||||
|
||||||
CUDA & Nvidia Drivers*: 11.0-11.4 & v450.80.02+ | ||||||
|
||||||
Apache Spark 3.0.1, 3.0.2, 3.0.3, 3.1.1, 3.1.2, Cloudera CDP 7.1.6, 7.1.7, Databricks 7.3 ML LTS or 8.2 ML Runtime, and GCP Dataproc 2.0 | ||||||
|
||||||
Apache Hadoop 2.10+ or 3.1.1+ (3.1.1 for nvidia-docker version 2) | ||||||
|
||||||
Python 3.6+, Scala 2.12, Java 8 | ||||||
|
||||||
*Some hardware may have a minimum driver version greater than v450.80.02+. Check the GPU spec sheet | ||||||
for your hardware's minimum driver version. | ||||||
|
||||||
### Download v21.08.00 | ||||||
sameerz marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
* Download the [RAPIDS | ||||||
Accelerator for Apache Spark 21.08.0 jar](https://repo1.maven.org/maven2/com/nvidia/rapids-4-spark_2.12/21.08.0/rapids-4-spark_2.12-21.08.0.jar) | ||||||
* Download the [RAPIDS cuDF 21.08.0 jar](https://repo1.maven.org/maven2/ai/rapids/cudf/21.08.0/cudf-21.08.0-cuda11.jar) | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||
|
||||||
This package is built against CUDA 11.2 and has [CUDA forward | ||||||
compatibility](https://docs.nvidia.com/deploy/cuda-compatibility/index.html) enabled. It is tested | ||||||
on V100, T4, A30 and A100 GPUs with CUDA 11.0-11.4. For those using other types of GPUs which | ||||||
do not have CUDA forward compatibility (for example, GeForce), CUDA 11.2 is required. Users will | ||||||
need to ensure the minimum driver (450.80.02) and CUDA toolkit are installed on each Spark node. | ||||||
|
||||||
### Release Notes | ||||||
New functionality and performance improvements for this release include: | ||||||
* Handling data sets that spill out of GPU memory for group by and windowing operations | ||||||
* Running window rank and dense rank operations on the GPU | ||||||
* Support for the `LEGACY` timestamp | ||||||
* Unioning of nested structs | ||||||
* Adoption of UCX 1.11 for improved error handling for RAPIDS Spark Accelerated Shuffle | ||||||
* Ability to read cached data from the GPU on the supported Databricks runtimes | ||||||
* Enabling Parquet writing of array data types from the GPU | ||||||
* Optimized reads for small files for ORC | ||||||
* Spark Qualification and Profiling Tools | ||||||
* Additional filtering capabilities | ||||||
* Reporting on data types | ||||||
* Reporting on read data formats | ||||||
* Abillity to run the qualification tool on Spark 2.x logs | ||||||
* Ability to run the tool on Apache Spark 3.x, AWS EMR 6.3.0, Dataproc 2.0, Microsoft Azure, and | ||||||
Databricks 7.3 and 8.2 logs | ||||||
* Improved qualification tool performance | ||||||
|
||||||
For a detailed list of changes, please refer to the | ||||||
[CHANGELOG](https://github.com/NVIDIA/spark-rapids/blob/main/CHANGELOG.md). | ||||||
sameerz marked this conversation as resolved.
Show resolved
Hide resolved
|
||||||
|
||||||
## Release v21.06.0 | ||||||
Starting with release 21.06.0, the project is moving to calendar versioning, with the first two | ||||||
digits representing the year, the second two digits representing the month, and the last digit | ||||||
|
@@ -35,7 +90,7 @@ Software Requirements: | |||||
|
||||||
CUDA & Nvidia Drivers*: 11.0 or 11.2 & v450.80.02+ | ||||||
|
||||||
Apache Spark 3.0.1, 3.0.2, 3.1.1, 3.1.2, Cloudera CDP 7.1.7, Databricks 7.3 ML LTS or 8.2 ML Runtime, and GCP Dataproc 2.0 | ||||||
Apache Spark 3.0.1, 3.0.2, 3.1.1, 3.1.2, Cloudera CDP 7.1.6, 7.1.7, Databricks 7.3 ML LTS or 8.2 ML Runtime, and GCP Dataproc 2.0 | ||||||
|
||||||
Apache Hadoop 2.10+ or 3.1.1+ (3.1.1 for nvidia-docker version 2) | ||||||
|
||||||
|
@@ -57,7 +112,7 @@ need to ensure the minimum driver (450.80.02) and CUDA toolkit are installed on | |||||
|
||||||
### Release Notes | ||||||
New functionality for this release includes: | ||||||
* Support for running on Cloudera CDP 7.1.7 and Databricks 8.2 ML | ||||||
* Support for running on Cloudera CDP 7.1.6, CDP 7.1.7 and Databricks 8.2 ML | ||||||
* New functionality related to arrays: | ||||||
* Concatenation of array columns | ||||||
* Casting arrays of floats to arrays of doubles | ||||||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change | ||||
---|---|---|---|---|---|---|
|
@@ -50,8 +50,8 @@ COPY spark-3.0.2-bin-hadoop3.2/examples /opt/spark/examples | |||||
COPY spark-3.0.2-bin-hadoop3.2/kubernetes/tests /opt/spark/tests | ||||||
COPY spark-3.0.2-bin-hadoop3.2/data /opt/spark/data | ||||||
|
||||||
COPY cudf-21.08.0-SNAPSHOT-cuda11.jar /opt/sparkRapidsPlugin | ||||||
COPY rapids-4-spark_2.12-21.08.0-SNAPSHOT.jar /opt/sparkRapidsPlugin | ||||||
COPY cudf-21.08.0-cuda11.jar /opt/sparkRapidsPlugin | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||
COPY rapids-4-spark_2.12-21.08.0.jar /opt/sparkRapidsPlugin | ||||||
COPY getGpusResources.sh /opt/sparkRapidsPlugin | ||||||
|
||||||
RUN mkdir /opt/spark/python | ||||||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change | ||||
---|---|---|---|---|---|---|
|
@@ -1280,7 +1280,7 @@ object RapidsConf { | |||||
|On startup use: `--conf [conf key]=[conf value]`. For example: | ||||||
| | ||||||
|``` | ||||||
|${SPARK_HOME}/bin/spark --jars 'rapids-4-spark_2.12-21.08.0-SNAPSHOT.jar,cudf-21.08.0-SNAPSHOT-cuda11.jar' \ | ||||||
|${SPARK_HOME}/bin/spark --jars 'rapids-4-spark_2.12-21.08.0.jar,cudf-21.08.0-cuda11.jar' \ | ||||||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
Suggested change
|
||||||
|--conf spark.plugins=com.nvidia.spark.SQLPlugin \ | ||||||
|--conf spark.rapids.sql.incompatibleOps.enabled=true | ||||||
|``` | ||||||
|
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.