Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add example of building Slurm on top of Rocky 8 #2155

Merged

Conversation

nick-stroud
Copy link
Collaborator

@nick-stroud nick-stroud commented Jan 22, 2024

Builds the latest Slurm V6 image on top of HPC Rocky 8 image and deploys a simple cluster using the image. This example is meant to be a demonstrator to show how it is possible to build Slurm.

Notes:

  • This calls ansible directly instead of using Slurm Packer module to avoid complexity of importing external Packer module.
  • This does custom ansible install instead of using ansible runner as python upgrade breaks selinux bindings when done from within a virtual environment.

Tested:

  • Manually run blueprint and test how cluster (srun hostname).
  • Added integration test that does deploy/destroy

Submission Checklist

Please take the following actions before submitting this pull request.

  • Fork your PR branch from the Toolkit "develop" branch (not main)
  • Test all changes with pre-commit in a local branch #
  • Confirm that "make tests" passes all tests
  • Add or modify unit tests to cover code changes
  • Ensure that unit test coverage remains above 80%
  • Update all applicable documentation
  • Follow Cloud HPC Toolkit Contribution guidelines #

@nick-stroud
Copy link
Collaborator Author

To Reviewer: I will add documentation and integration test once you have given a first pass approval of the methodology of the blueprint.

@nick-stroud nick-stroud added the release-key-new-features Added to release notes under the "Key New Features" heading. label Jan 22, 2024
@nick-stroud nick-stroud force-pushed the slurm6-build-image-example branch 2 times, most recently from a9e8421 to 76172aa Compare January 22, 2024 19:19
Tom provided an example blueprint that demonstrated this methodology.

Co-authored-by: Tom Downes <tpdownes@google.com>
@tpdownes tpdownes assigned nick-stroud and unassigned tpdownes Jan 23, 2024
@nick-stroud
Copy link
Collaborator Author

I submitted test manually (build 4c971653-eee2-45f3-8d9c-1cb7c2634145) and it is passing.
I will update triggers once it is submitted.

@nick-stroud nick-stroud assigned tpdownes and unassigned nick-stroud Jan 24, 2024
@nick-stroud nick-stroud merged commit 136d0d9 into GoogleCloudPlatform:develop Jan 24, 2024
7 of 33 checks passed
@nick-stroud nick-stroud assigned nick-stroud and unassigned tpdownes Jan 24, 2024
@mr0re1 mr0re1 mentioned this pull request Feb 12, 2024
@nick-stroud nick-stroud mentioned this pull request Feb 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
release-key-new-features Added to release notes under the "Key New Features" heading.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants