-
Notifications
You must be signed in to change notification settings - Fork 14.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GCP SparkR Example #8240
GCP SparkR Example #8240
Conversation
Allows you to schedule R, and sparkR jobs on a dataproc cluster. The functionality to run that kind of job is already in dataproc, but it was not so clear how to do that from Airflow.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you please check the R code of the job? Suggested changes allowed me to run this example. I think it also may be necessary to update the dataproc client to google-cloud-dataproc==0.7.0
.
airflow/providers/google/cloud/example_dags/example_dataproc.py
Outdated
Show resolved
Hide resolved
Codecov Report
@@ Coverage Diff @@
## master #8240 +/- ##
==========================================
- Coverage 88.33% 87.68% -0.66%
==========================================
Files 936 940 +4
Lines 45319 45359 +40
==========================================
- Hits 40034 39774 -260
- Misses 5285 5585 +300
Continue to review full report at Codecov.
|
Co-Authored-By: Tomek Urbaszek <turbaszek@gmail.com>
Awesome work, congrats on your first merged pull request! |
Show how to schedule R, and sparkR jobs on a dataproc cluster.
The functionality to run that kind of job is already in dataproc,
but it was not so clear how to do that from Airflow.
This is a replacement for #7864
Make sure to mark the boxes below before creating PR: [x]
In case of fundamental code change, Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in UPDATING.md.
Read the Pull Request Guidelines for more information.