Add local ssd RAID0 startup script #2720

alyssa-sm · 2024-06-26T17:22:32Z

This PR adds a startup script that will check for existing local ssds and raid if there are 2 or more.

This script has been tested manually with slurm-gcp v6 and vm-instance with centos, debian, rocky, and ubuntu.

If there are at least 2 local ssds attached, a raid device for them will be created. Using lsblk -o NAME,FSTYPE,LABEL,UUID,MOUNTPOINT shows the following.

NAME    FSTYPE            LABEL               UUID                                 MOUNTPOINT
sda                                                                                
├─sda1  ext4                                  7be27af0-f080-46f6-bdf8-bc495c775f18 /
├─sda14                                                                            
└─sda15 vfat                                  4FFB-6B16                            /boot/efi
nvme0n1 linux_raid_member ssd-test-debian-0:0 0823949c-a4f7-b097-8712-cbbb5a50066a 
└─md0   ext4              LOCALSSD            6d3ba06d-b59c-4df4-ae5e-29f49ccc5e7a /mnt/localssd
nvme0n2 linux_raid_member ssd-test-debian-0:0 0823949c-a4f7-b097-8712-cbbb5a50066a 
└─md0   ext4              LOCALSSD            6d3ba06d-b59c-4df4-ae5e-29f49ccc5e7a /mnt/localssd

When the number of local ssds present is 0 or 1, the script will exit early.

Jun 26 00:23:37 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script: TASK [Exit if less than 2 local ssd] *******************************************
Jun 26 00:23:37 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script:
Jun 26 00:23:37 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script: PLAY RECAP *********************************************************************
Jun 26 00:23:37 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script: localhost                  : ok=6    changed=2    unreachable=0    failed=0    skipped=2    rescued=0    ignored=0
Jun 26 00:23:37 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script:
Jun 26 00:23:37 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script: stderr=Wed Jun 26 00:21:26 +0000 2024 Info [2370]: === start executing runner: 99-running-script-warning.sh-54ee ===
Jun 26 00:23:37 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script: Wed Jun 26 00:21:26 +0000 2024 Info [2370]: === 99-running-script-warning.sh-54ee finished with exit_code=0 ===
Jun 26 00:23:37 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script: Wed Jun 26 00:21:29 +0000 2024 Info [2370]: === start executing runner: install_ansible_automatic.sh-c9b1 ===
Jun 26 00:23:37 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script: Wed Jun 26 00:23:28 +0000 2024 Info [2370]: === install_ansible_automatic.sh-c9b1 finished with exit_code=0 ===
Jun 26 00:23:37 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script: Wed Jun 26 00:23:31 +0000 2024 Info [2370]: === start executing runner: setup-raid.yml-4df9 ===
Jun 26 00:23:37 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script: Wed Jun 26 00:23:37 +0000 2024 Info [2370]: === setup-raid.yml-4df9 finished with exit_code=0 ===
Jun 26 00:23:37 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script:
Jun 26 00:23:38 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script: INFO: Check status of cluster services
Jun 26 00:23:39 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script: INFO: Done setting up compute
Jun 26 00:23:39 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: startup-script exit status 0
Jun 26 00:23:39 ssdtest-computenodeset-0 google_metadata_script_runner[1555]: Finished running startup scripts.
Jun 26 00:23:39 ssdtest-computenodeset-0 systemd[1]: google-startup-scripts.service: Succeeded.

Manual Testing for rebooting.

Deployed a blueprint that boots up a vm-instance for each of the OS as well as a Slurm-GCP v6 controller and login node.
After the instances are setup, verified that the raid0 was set up with lsblk -o NAME,FSTYPE,LABEL,UUID,MOUNTPOINT and saved the output.
Used the command sudo reboot to reboot the instance
When the instances finish setting back up, verified the raid0 was re-established with lsblk -o NAME,FSTYPE,LABEL,UUID,MOUNTPOINT and compared the UUIDs of the raid0 local ssds before and after reboot.

A reboot is successful if it can successful recreate the raid device with the local ssd that were up prior to reboot. Rebooting was successful for all the instances.

Manual Testing for restarting.

Deployed a blueprint that boots up a vm-instance for each of the OS as well as a Slurm-GCP v6 controller and login node.
After the instances are setup, verified that the raid0 was set up with lsblk -o NAME,FSTYPE,LABEL,UUID,MOUNTPOINT.
Manually shut down the instances and then started them back up.
When the instances finish setting back up, verified the raid0 was created with lsblk -o NAME,FSTYPE,LABEL,UUID,MOUNTPOINT.

A restart is successful if it can setup the raid device with the local ssds. Restarting was successful for all the instances except for the Slurm-GCP instances. The current problem is that the Slurm setup verifies that Slurm is already installed and exits early which leads to the startup script not getting executed.

Cases covered by this PR:

Centos, Rocky, Ubuntu, Debian VM instances reboot and restart
Slurm-GCP reboot

Cases that require more thought:

Slurm-GCP restart

modules/scripts/startup-script/variables.tf

modules/scripts/startup-script/files/setup-raid.yml

tpdownes

I want to review the implementation further for refining. I have tested the following blueprint:

# Copyright 2022 Google LLC
#
# Licensed under the Apache License, Version 2.0 (the "License");
# you may not use this file except in compliance with the License.
# You may obtain a copy of the License at
#
#      http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.

---
blueprint_name: startup-vm-instance

vars:
  project_id:  ## Set project id here
  deployment_name: testfix
  region: asia-southeast1
  zone: asia-southeast1-b
  bandwidth_tier: gvnic_enabled

deployment_groups:
- group: first
  modules:
  - id: network1
    source: modules/network/vpc
  - id: script
    source: modules/scripts/startup-script
    settings:
      setup_raid: true
  - id: vm1
    source: modules/compute/vm-instance
    use:
    - network1
    - script
    settings:
      machine_type: g2-standard-24
      local_ssd_count: 2

Which is running CentOS 7. I need to test other OSes that are not EOL, and I understand there may be an issue with Debian behavior upon reboot. I have successfully tested this blueprint with:

rebooting preserved data and is remounted ("WAI")
power off loses data but new disks are formatted and mounted ("WAI")
power off with preview flag to preserve data preserved data and is remounted ("WAI")

modules/scripts/startup-script/files/setup-raid.yml

modules/scripts/startup-script/variables.tf

tpdownes

I did some research into the behavior of mdadm on modern Linux systems. They will be auto-assembled with a predictable name at /dev/md/raid_name if you supply a name during creation. Additionally, the hostname of the machine must match the hostname it was created on. In the real world, I believe there are boot-time race conditions which prevent hostname matching from being 100% reliable even on the same host. We can work around this by supplying --homehost=any during RAID creation.

If you follow these "rules" then you can rely on the system to assemble the RAID array at "/dev/md/raid_name" and ignore the numeric device names. If that file does not exist, then it means that the RAID array must be created. This occurs when the VM is booted for the first time or powered off and powered on without disabling local SSD discard.

This won't handle a Slurm-GCP power off/on cycle but it's a good first step.

modules/scripts/startup-script/files/setup-raid.yml

tpdownes

4 very small changes that I'd like to see. I'm going to begin testing with those changes in place.

modules/scripts/startup-script/files/setup-raid.yml

modules/scripts/startup-script/variables.tf

tpdownes

LGTM. I want @nick-stroud to weigh in on input variable format and then done.

tpdownes

We missed a change that causes the accidental inclusion of the runner in all invocations of startup-script

modules/scripts/startup-script/main.tf

Co-authored-by: Tom Downes <tpdownes@users.noreply.github.com>

modules/scripts/startup-script/main.tf

alyssa-sm self-assigned this Jun 26, 2024

alyssa-sm requested a review from tpdownes June 26, 2024 17:22

alyssa-sm force-pushed the add-raid-startup-script branch from f7a69d6 to 2d09539 Compare July 3, 2024 05:44

alyssa-sm marked this pull request as ready for review July 3, 2024 05:47

alyssa-sm force-pushed the add-raid-startup-script branch from 2d09539 to 34d81a7 Compare July 3, 2024 05:50

alyssa-sm assigned tpdownes and unassigned alyssa-sm Jul 3, 2024

alyssa-sm force-pushed the add-raid-startup-script branch from 34d81a7 to d7e7ed6 Compare July 4, 2024 05:20

nick-stroud reviewed Jul 16, 2024

View reviewed changes

modules/scripts/startup-script/variables.tf Outdated Show resolved Hide resolved

modules/scripts/startup-script/files/setup-raid.yml Outdated Show resolved Hide resolved

modules/scripts/startup-script/files/setup-raid.yml Show resolved Hide resolved

tpdownes requested changes Jul 16, 2024

View reviewed changes

tpdownes assigned alyssa-sm Jul 16, 2024

alyssa-sm force-pushed the add-raid-startup-script branch from d7e7ed6 to 3d6e0ca Compare July 22, 2024 15:14

tpdownes requested changes Jul 22, 2024

View reviewed changes

tpdownes removed their assignment Jul 22, 2024

alyssa-sm force-pushed the add-raid-startup-script branch 4 times, most recently from fa3467d to 31bf852 Compare July 29, 2024 06:28

alyssa-sm requested a review from tpdownes July 29, 2024 06:35

alyssa-sm assigned tpdownes and unassigned alyssa-sm Jul 29, 2024

tpdownes requested changes Jul 29, 2024

View reviewed changes

tpdownes assigned alyssa-sm and unassigned tpdownes Jul 29, 2024

tpdownes reviewed Jul 29, 2024

View reviewed changes

modules/scripts/startup-script/variables.tf Outdated Show resolved Hide resolved

alyssa-sm added the release-key-new-features Added to release notes under the "Key New Features" heading. label Jul 29, 2024

alyssa-sm force-pushed the add-raid-startup-script branch 2 times, most recently from 7b122c0 to a7148b0 Compare July 30, 2024 01:25

alyssa-sm requested a review from tpdownes July 30, 2024 01:29

alyssa-sm assigned tpdownes Jul 30, 2024

alyssa-sm removed their assignment Jul 30, 2024

tpdownes reviewed Jul 30, 2024

View reviewed changes

tpdownes assigned nick-stroud and unassigned tpdownes Jul 30, 2024

tpdownes requested review from tpdownes and nick-stroud July 30, 2024 19:18

alyssa-sm force-pushed the add-raid-startup-script branch from a7148b0 to da5fa89 Compare July 31, 2024 17:34

Add local ssd RAID0 startup script

cd85816

alyssa-sm force-pushed the add-raid-startup-script branch from da5fa89 to cd85816 Compare July 31, 2024 17:48

alyssa-sm assigned tpdownes and unassigned nick-stroud Jul 31, 2024

tpdownes requested changes Jul 31, 2024

View reviewed changes

modules/scripts/startup-script/main.tf Outdated Show resolved Hide resolved

modules/scripts/startup-script/main.tf Outdated Show resolved Hide resolved

tpdownes assigned alyssa-sm and unassigned tpdownes Jul 31, 2024

tpdownes self-requested a review July 31, 2024 19:01

Apply suggestions to avoid inclusion in all instances of startup-script

41f355d

Co-authored-by: Tom Downes <tpdownes@users.noreply.github.com>

tpdownes requested changes Jul 31, 2024

View reviewed changes

modules/scripts/startup-script/main.tf Outdated Show resolved Hide resolved

tpdownes self-requested a review July 31, 2024 19:49

Update modules/scripts/startup-script/main.tf

e8057f6

tpdownes approved these changes Jul 31, 2024

View reviewed changes

alyssa-sm merged commit caf9f82 into GoogleCloudPlatform:develop Jul 31, 2024
10 of 51 checks passed

alyssa-sm deleted the add-raid-startup-script branch July 31, 2024 20:33

tpdownes mentioned this pull request Aug 2, 2024

Enable local SSD formatting solution to set POSIX permissions #2863

Merged

This was referenced Aug 5, 2024

Update local ssd examples to use local ssd startup solution #2870

Merged

Update a3-megagpu-8 example to use local ssd solution #2871

Merged

nick-stroud mentioned this pull request Aug 15, 2024

Release v1.38.0 #2924

Merged

tpdownes mentioned this pull request Oct 17, 2024

Ensure local SSD filesystem is assembled into a RAID even upon power off/on cycles #3129

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add local ssd RAID0 startup script #2720

Add local ssd RAID0 startup script #2720

alyssa-sm commented Jun 26, 2024 •

edited

Loading

tpdownes left a comment

tpdownes left a comment

tpdownes left a comment

tpdownes left a comment

tpdownes left a comment

Add local ssd RAID0 startup script #2720

Add local ssd RAID0 startup script #2720

Conversation

alyssa-sm commented Jun 26, 2024 • edited Loading

Manual Testing for rebooting.

Manual Testing for restarting.

tpdownes left a comment

Choose a reason for hiding this comment

tpdownes left a comment

Choose a reason for hiding this comment

tpdownes left a comment

Choose a reason for hiding this comment

tpdownes left a comment

Choose a reason for hiding this comment

tpdownes left a comment

Choose a reason for hiding this comment

alyssa-sm commented Jun 26, 2024 •

edited

Loading