Fail point-in-time WAL recovery upon IOError reading WAL #6963

riversand963 · 2020-06-11T04:46:24Z

If options.wal_recovery_mode == WALRecoveryMode::kPointInTimeRecovery, RocksDB stops replaying WAL once hitting an error and discards the rest of the WAL. This can lead to data loss if the error occurs at an offset smaller than the last sync'ed offset.
Ideally, RocksDB point-in-time recovery should permit recovery if the error occurs after last synced offset while fail recovery if error occurs before the last synced offset. However, RocksDB does not track the synced offset of WALs. Consequently, RocksDB does not know whether an error occurs before or after the last synced offset. An error can be one of the following.

WAL record checksum mismatch. This can result from both corruption of synced data and dropping of unsynced data during shutdown. We cannot be sure which one. In order not to defeat the original motivation to permit the latter case, we keep the original behavior of point-in-time WAL recovery.
IOError. This means the WAL can be bad, an indicator of whole file becoming unavailable, not to mention synced part of the WAL. Therefore, we choose to modify the behavior of point-in-time recovery and fail the database recovery.

Test plan (devserver):
make check

ajkr

LGTM!

I wonder if we should reject Open() for all errors returned by the underlying Env. It's difficult right now because we can't distinguish where status->IsCorruption() originates -- it could either be from the underlying Env or from RocksDB detecting checksum mismatch. In any case I think this PR is a step forward and should be landed.

riversand963 · 2020-06-11T23:48:00Z

LGTM!

I wonder if we should reject Open() for all errors returned by the underlying Env. It's difficult right now because we can't distinguish where status->IsCorruption() originates -- it could either be from the underlying Env or from RocksDB detecting checksum mismatch. In any case I think this PR is a step forward and should be landed.

In other places such as https://github.com/facebook/rocksdb/blob/master/db/version_set.h#L1162, RocksDB uses a separate status variable to keep track of IO errors. We probably can follow similar approach by adding a member variable to log::Reader::Reporter.

facebook-github-bot

@riversand963 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-06-12T02:38:27Z

@riversand963 merged this pull request in 717749f.

yiwu-arbug · 2020-06-15T03:05:39Z

Thanks for the improvement!

facebook-github-bot added the CLA Signed label Jun 11, 2020

riversand963 requested review from ajkr and yiwu-arbug June 11, 2020 04:46

riversand963 linked an issue Jun 11, 2020 that may be closed by this pull request

Point-in-time recovery should fail on IO error when reading WAL #6288

Closed

ajkr approved these changes Jun 11, 2020

View reviewed changes

riversand963 added 2 commits June 11, 2020 17:00

init check in

b003ced

update history

847f618

riversand963 force-pushed the wal-replay-ioerror branch from 97e18e7 to 847f618 Compare June 12, 2020 00:11

facebook-github-bot reviewed Jun 12, 2020

View reviewed changes

facebook-github-bot closed this in 717749f Jun 12, 2020

facebook-github-bot added the Merged label Jun 12, 2020

riversand963 deleted the wal-replay-ioerror branch June 12, 2020 03:19

yiwu-arbug mentioned this pull request Jun 28, 2020

Fail DB open when Point-in-time recovery encounter IO error when reading WAL tikv/rocksdb#142

Open

2 tasks

riversand963 mentioned this pull request Mar 22, 2021

Point-in-time recovery should fail on IO error when reading WAL #6288

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fail point-in-time WAL recovery upon IOError reading WAL #6963

Fail point-in-time WAL recovery upon IOError reading WAL #6963

riversand963 commented Jun 11, 2020

ajkr left a comment •

edited

Loading

riversand963 commented Jun 11, 2020

facebook-github-bot left a comment

facebook-github-bot commented Jun 12, 2020

yiwu-arbug commented Jun 15, 2020

Fail point-in-time WAL recovery upon IOError reading WAL #6963

Fail point-in-time WAL recovery upon IOError reading WAL #6963

Conversation

riversand963 commented Jun 11, 2020

ajkr left a comment • edited Loading

Choose a reason for hiding this comment

riversand963 commented Jun 11, 2020

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Jun 12, 2020

yiwu-arbug commented Jun 15, 2020

ajkr left a comment •

edited

Loading