Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Correct an issue for README for tools and also correct s3 solution in Args.scala #2618

Merged
merged 4 commits into from
Jun 7, 2021

Conversation

viadea
Copy link
Collaborator

@viadea viadea commented Jun 7, 2021

Correct an issue for README for tools and also correct s3 solution in Args.scala

Correct an issue for README for tools and also correct s3 solution in Args.scala
@viadea viadea added the documentation Improvements or additions to documentation label Jun 7, 2021
@viadea viadea requested review from tgravescs and nartal1 June 7, 2021 17:49
tgravescs
tgravescs previously approved these changes Jun 7, 2021
Copy link
Collaborator

@tgravescs tgravescs left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

changes look fine, will like have conflicts when #2612 is merged

export AWS_ACCESS_KEY_ID=xxx
export AWS_SECRET_ACCESS_KEY=xxx
# Need to download hadoop-aws-<version>.jar and aws-java-sdk-<version>.jar firstly,
# and then put fs.s3a.access.key and fs.s3a.secret.key in $SPARK_HOME/conf/hdfs-site.xml.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

S3a is a separate filesystem from HDFS. It's typically stored in core-site.xml. But you can also just configure in spark via spark.hadoop.fs.s3a... settings. Environment variables should work too if EnvironmentVariableCredentialsProvider is configured.

I suggest that we don't go into details in our repo, and simply defer/link Hadoop/Spark docs for this.

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@gerashegalov I just added a doc link in README and removed the instructions in 2 Args.scala files.
How is now?thx

@tgravescs
Copy link
Collaborator

build

@tgravescs tgravescs merged commit a7a3d63 into NVIDIA:branch-21.06 Jun 7, 2021
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
… Args.scala (NVIDIA#2618)

* Signed-off-by: Hao Zhu <hazhu@nvidia.com>

Correct an issue for README for tools and also correct s3 solution in Args.scala

* Add doc link for Hadoop-AWS module

* Remove S3 instructions.

* Remove S3 instructions from QualificationArgs
nartal1 pushed a commit to nartal1/spark-rapids that referenced this pull request Jun 9, 2021
… Args.scala (NVIDIA#2618)

* Signed-off-by: Hao Zhu <hazhu@nvidia.com>

Correct an issue for README for tools and also correct s3 solution in Args.scala

* Add doc link for Hadoop-AWS module

* Remove S3 instructions.

* Remove S3 instructions from QualificationArgs
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants