You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
itsbilal opened this issue
Apr 25, 2024
· 0 comments
· Fixed by #123506
Assignees
Labels
A-storageRelating to our storage engine (Pebble) on-disk storage.C-bugCode not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior.T-storageStorage Team
As part of this change, programmatically determine the major:minor block device numbers of the device where cockroach is running within the roachtest, and use that to stall the appropriate device.
123506: roachtestutil: dynamically determine block device to stall r=RaduBerinde a=itsbilal
Previously, we hardcoded the block device on which to run the disk-stalled* roachtests and the disk-stall operation. This was a flaky approach as sometimes we'd use a local ssd as a block device which had very different numbers than a Google persistent disk.
This change updates the cgroup disk staller to programmatically determine the major/minor device numbers for the block device to stall (the one mounted at /mnt/data1). It also updates the dmsetup disk staller to dynamically determine the block device name mounted at /mnt/data1.
Fixes#123080, #123054.
Epic: none
Release note: None
Co-authored-by: Bilal Akhtar <bilal@cockroachlabs.com>
Previously, we hardcoded the block device on which to run
the disk-stalled* roachtests and the disk-stall operation.
This was a flaky approach as sometimes we'd use a local
ssd as a block device which had very different numbers
than a Google persistent disk.
This change updates the cgroup disk staller to programmatically
determine the major/minor device numbers for the block device
to stall (the one mounted at /mnt/data1). It also updates the
dmsetup disk staller to dynamically determine the block device
name mounted at /mnt/data1.
Fixescockroachdb#123080, cockroachdb#123054.
Epic: none
Release note: None
A-storageRelating to our storage engine (Pebble) on-disk storage.C-bugCode not up to spec/doc, specs & docs deemed correct. Solution expected to change code/behavior.T-storageStorage Team
Currently, the disk-stall roachtest hardcodes the block device major:minor numbers to pass into the cgroup controller:
cockroach/pkg/cmd/roachtest/roachtestutil/disk_stall.go
Line 104 in 7c9bab9
This works for gce nodes with pd, but with local ssds, there's no 8:16 device and instead the /mnt/data1 volume is on a different device:
As part of this change, programmatically determine the major:minor block device numbers of the device where cockroach is running within the roachtest, and use that to stall the appropriate device.
See #121912.
Jira issue: CRDB-38192
The text was updated successfully, but these errors were encountered: