Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use staging buckets by default #57

Merged
merged 5 commits into from
Jun 4, 2024
Merged

Conversation

jashapiro
Copy link
Member

Closes #44

Here I am adding use of the staging buckets that were created in https://github.com/AlexsLemonade/OpenScPCA-infra/pull/33 to the workflow.

The idea is that most runs initiated through workflow triggers will use the staging buckets, with the production buckets reserved for tagged releases. I am not currently enforcing that: setting a run to prod is still possible from the GHA, though I wonder if I should instead make it only run on version tags.

I moved the setting of the simulated data bucket out to a more global param to make it more visible and consistent, but note that I left Docker image definitions with the code that they are used for.

I also updated the readme to cover these options and to describe which buckets are being used for the action outputs.

Copy link
Member

@allyhawkins allyhawkins left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, I would just add a small note that the prod profile is meant to be used for releases.

The run modes available are:

- `test`: runs only a simple test workflow to check configuration
- `simulated`: runs the workflow using simulated data
- `scpca`: runs the workflow using the current ScPCA data release
- `full`: simulates data based on the current ScPCA data release, then runs the workflow using the simulated data and current ScPCA data release (this is same as the behavior of the automatic release workflow)

By default, the output mode will be set to `staging`, so all outputs will be saved to S3 buckets that are not shared with users and can not overwrite current production data.
With the `prod` output mode, results will be accessible visible to users.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
With the `prod` output mode, results will be accessible visible to users.
With the `prod` output mode, results will be accessible to users.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe add a note that prod should only be used for releases?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How's this?

Suggested change
With the `prod` output mode, results will be accessible visible to users.
With the `prod` output mode, results will be accessible to users.
`prod` output mode should used for versioned releases of the workflow, and when running on new ScPCA data releases.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good!

@jashapiro jashapiro merged commit d06b1bc into main Jun 4, 2024
2 checks passed
@jashapiro jashapiro deleted the jashapiro/staging-defaults branch June 4, 2024 20:02
@jashapiro jashapiro restored the jashapiro/staging-defaults branch June 4, 2024 20:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add testing/staging output buckets
2 participants