Commit log bootstrapper (placed before peers bootstrapper) is not reliable post topology change UNTIL a snapshot has taken place #900
Labels
C: Bootstrap
C: Commit Log
C: Snapshotting
C: Storage
G: Data Integrity
Guarantees about data integrity
T: Reliability
Imagine an M3DB cluster running with the bootstrapper configuration (filesystem,commitlog,peers,uninitialized) and the following topology (RF=1 for simplicity):
Topology change removes host 4, making the new topology:
For non-active blocks, Host 1 will have incrementally flushed all the data for shard 4. However, for the active block of Shard 4 the data will still be in-memory and not yet flushed to disk. The data will also not be in any snapshot files or commit log files. If the node goes down at this point, the commit log bootstrapper will succeed the bootstrap but data will have been lost.
Potential solutions:
The text was updated successfully, but these errors were encountered: