Align parition end on 1 MiB boundary #589

nedbass · 2012-02-29T19:48:21Z

Some devices have exhibited sensitivity to the ending alignment of
partitions. In particular, even if the first partition begins at 1
MiB, we have seen many sd driver task abort errors with certain SSDs
if the first partition doesn't end on a 1 MiB boundary. This occurs
when the vdev label is read during pool creation or importation and
causes a delay of about 30 seconds per device. It can also be
simulated with dd when the pool isn't imported:

dd if=/dev/sda1 of=/dev/null bs=262144 count=1

For the record, this problem was observed with SMARTMOD
SG9XCA2E200GE01 200GB SSDs. Unfortunately I don't have a good
explanation for this behavior. It seems to have something to do with
highly fragmented single-sector requests being issued to the device,
which it may not support. With end-aligned partitions at least
page-sized requests were queued and issued to the driver according
to blktrace. In any case, aligning the partition end is a fairly
innocuous work-around, wasting at most 1 MiB of space.

Issue #574

Some devices have exhibited sensitivity to the ending alignment of partitions. In particular, even if the first partition begins at 1 MiB, we have seen many sd driver task abort errors with certain SSDs if the first partition doesn't end on a 1 MiB boundary. This occurs when the vdev label is read during pool creation or importation and causes a delay of about 30 seconds per device. It can also be simulated with dd when the pool isn't imported: dd if=/dev/sda1 of=/dev/null bs=262144 count=1 For the record, this problem was observed with SMARTMOD SG9XCA2E200GE01 200GB SSDs. Unfortunately I don't have a good explanation for this behavior. It seems to have something to do with highly fragmented single-sector requests being issued to the device, which it may not support. With end-aligned partitions at least page-sized requests were queued and issued to the driver according to blktrace. In any case, aligning the partition end is a fairly innocuous work-around, wasting at most 1 MiB of space. Issue openzfs#574

Rudd-O · 2012-03-01T05:23:10Z

I second this idea.

behlendorf · 2012-03-05T18:00:27Z

Merged as commit 613d88e

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Requires-spl: refs/pull/589/head

Commit f58040c should have removed this comment which is no longer relevant. Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Clemens Fruhwirth <clemens@endorphin.org> Issue openzfs#589

The main complication from the RT patch set is that the RW semaphore locks change such that read locks on an rwsem can be taken only by a single thread. All other threads are locked out. This single thread can take a read lock multiple times though. The underlying implementation changes to a mutex with an additional read_depth count. The implementation can be best understood by inspecting the RT patch. rwsem_rt.h and rt.c give the best insight into how RT rwsem works. My implementation for rwsem_tryupgrade is basically an inversion of rt_downgrade_write found in rt.c. Please see the comments in the code. Unfortunately, I have to drop SPLAT rwlock test4 completely as this test tries to take multiple locks from different threads, which RT rwsems do not support. Otherwise SPLAT, zconfig.sh, zpios-sanity.sh and zfs-tests.sh pass on my Debian-testing VM with the kernel linux-image-4.8.0-1-rt-amd64. Tested-by: kernelOfTruth <kerneloftruth@gmail.com> Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov> Signed-off-by: Clemens Fruhwirth <clemens@endorphin.org> Closes openzfs#5491 Closes openzfs#589 Closes openzfs#308

…penzfs#589)

behlendorf closed this Mar 5, 2012

behlendorf added a commit to behlendorf/zfs that referenced this pull request Dec 15, 2016

Dummy commit to test SPL openzfs#589

d144bf4

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Requires-spl: refs/pull/589/head

behlendorf added a commit to behlendorf/zfs that referenced this pull request Dec 19, 2016

Dummy commit to test SPL openzfs#589

fe96253

Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Requires-spl: refs/pull/589/head

pcd1193182 pushed a commit to pcd1193182/zfs that referenced this pull request Sep 26, 2023

DLPX-82685 zettacache removal: record leftover merge cycles on disk (o…

f899c83

…penzfs#589)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Align parition end on 1 MiB boundary #589

Align parition end on 1 MiB boundary #589

nedbass commented Feb 29, 2012

Rudd-O commented Mar 1, 2012

behlendorf commented Mar 5, 2012

Align parition end on 1 MiB boundary #589

Align parition end on 1 MiB boundary #589

Conversation

nedbass commented Feb 29, 2012

Rudd-O commented Mar 1, 2012

behlendorf commented Mar 5, 2012