Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Added a getting started guide for Synapse[skip ci] #4187

Merged
merged 6 commits into from
Nov 23, 2021
Merged
Show file tree
Hide file tree
Changes from 4 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
1 change: 1 addition & 0 deletions docs/FAQ.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,6 +22,7 @@ The RAPIDS Accelerator for Apache Spark officially supports:
- [AWS EMR 6.2+](get-started/getting-started-aws-emr.md)
- [Databricks Runtime 7.3, 9.1](get-started/getting-started-databricks.md)
- [Google Cloud Dataproc 2.0](get-started/getting-started-gcp.md)
- [Azure Synapse](get-started/getting-started-azure-synapse-analytics.md)

Most distributions based on a supported Apache Spark version should work, but because the plugin
replaces parts of the physical plan that Apache Spark considers to be internal the code for those
Expand Down
2 changes: 1 addition & 1 deletion docs/download.md
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,7 @@ Software Requirements:

CUDA & NVIDIA Drivers*: 11.0-11.4 & v450.80.02+

Apache Spark 3.0.1, 3.0.2, 3.0.3, 3.1.1, 3.1.2, 3.2.0, Cloudera CDP 7.1.6, 7.1.7, Databricks 7.3 ML LTS or 8.2 ML Runtime, and GCP Dataproc 2.0
Apache Spark 3.0.1, 3.0.2, 3.0.3, 3.1.1, 3.1.2, 3.2.0, Cloudera CDP 7.1.6, 7.1.7, Databricks 7.3 ML LTS or 8.2 ML Runtime, GCP Dataproc 2.0 and Azure Synapse
viadea marked this conversation as resolved.
Show resolved Hide resolved

Apache Hadoop 2.10+ or 3.1.1+ (3.1.1 for nvidia-docker version 2)

Expand Down
2 changes: 1 addition & 1 deletion docs/get-started/getting-started-alluxio.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
---
layout: page
title: Alluxio
nav_order: 6
nav_order: 7
parent: Getting-Started
---
# Getting Started with RAPIDS and Alluxio
Expand Down
19 changes: 19 additions & 0 deletions docs/get-started/getting-started-azure-synapse-analytics.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,19 @@
---
layout: page
title: Azure Synapse Analytics
nav_order: 5
parent: Getting-Started
---
# Getting started with RAPIDS Accelerator on Azure Synapse Analytics
[Azure Synapse](https://docs.microsoft.com/en-us/azure/synapse-analytics/) is a limitless analytics service that brings
viadea marked this conversation as resolved.
Show resolved Hide resolved
together enterprise data warehousing and Big Data analytics.

Synapse now offers the ability to create Apache Spark pools that use GPUs on the backend to run your Spark workloads on
GPUs for accelerated processing. This is called as

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

These are called GPU-accelerated Apache Spark pools.

[GPU-accelerated Apache Spark pools](https://docs.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-gpu-concept).
Currently it ships with the RAPIDS Accelerator for Apache Spark version 21.10.

Please follow below 2 quickstart guides:

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please follow the below quickstart guides to learn how to create and use GPU pools in Azure Synapse:

1. [Quickstart: Create an Apache Spark GPU-enabled Pool in Azure Synapse Analytics using the Azure portal](https://docs.microsoft.com/en-us/azure/synapse-analytics/quickstart-create-apache-gpu-pool-portal)
2. [Quickstart: Create an Apache Spark notebook to run on a GPU pool](https://docs.microsoft.com/en-us/azure/synapse-analytics/spark/apache-spark-rapids-gpu)

2 changes: 1 addition & 1 deletion docs/get-started/getting-started-kubernetes.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
---
layout: page
title: Kubernetes
nav_order: 5
nav_order: 6
parent: Getting-Started
---

Expand Down
2 changes: 1 addition & 1 deletion docs/get-started/getting-started-workload-qualification.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@
---
layout: page
title: Workload Qualification
nav_order: 7
nav_order: 8
parent: Getting-Started
---
# Getting Started on Spark workload qualification
Expand Down