Fault-tolerant Sharded Key-Value Storage service

Built as a part of MIT-6.824 Distributed Systems Labs

Lab 1: MapReduce (Warmup / Practice exercise)
Lab 2: Raft Protocol
Lab 3: Fault-tolerant Key/Value Service
Lab 4: Sharded Key/Value Service

Key Features

Put / Get / Append calls
Replicated & Fault Tolerant: Able to serve requests as long as a majority of servers are up and can communicate, inspite of other failures or network partitions.
Linearizabile: Users can assume that they are talking to a single machine and that all the requests are processed in a single global order. A call will also observe the effects of calls that have completed before it starts.
Scalable: Supports dynamically adding/reconfiguring servers and shards for boosting performance, with zero downtime, i.e. the requests on unaffected keys can keep going on during the reconfiguration.

Implementation

RAFT Consensus Algorithm

The Key Value Service uses the RAFT Consensus Algorithm to maintain a replicated, fault tolerant and consistent state across peers.
As described in the RAFT Paper, we implement a leader election and a replicated log. The log and some other necessary state is persisted for handling fail-overs and maintaining consistency.

As an optimization for improving memory utilization and recovery times, we implement a snap-shotting and log compaction mechanism all while keeping the state consistent across fail-overs.
We also implement other optimizations like log-conflicted detection (mentioned in the Extended RAFT paper) that aims to reduce the number of RPCs between peers.

Key Value Service

Next, on top of RAFT we build a simple Key Value storage service that supports Put/Get/Append calls and manages snap-shotting and log compaction whenever the memory used by the log approaches a threshold.
The service is consistent and gracefully handles failures and client retries. The client request are idempotent and the service makes sure that duplicate requests/retries and only executed once, across server failures and snapshots!

Sharding the Key Value Store

Since Linearizability is Local/Composable (Ref: Section 3.3), we can scale the system by shading the key space across different and independent group of RAFT Peers.
We can partition the servers into different replica-groups of RAFT Peers, where each group with handle a set of shards and execute independently. This allows us to scale the throughput by adding more groups/servers.

We implement a Shard Controller service that manages the sharding config, i.e. mapping of shard to replica group. The service is again built on top of RAFT to provide high availability and fault tolerance. This service supports Join, Leave RPCs to add/remove replica groups, Move RPC to migrate a shard from one group to another, and Query RPC to query the configuration.
Join and Leave will evenly distribute the shards across groups.

The service as a whole provides linearizability to the clients even when shards are being moved around and reconfigured.
During a reconfiguration of shards, the clients do no perceive any downtime for the unaffected keys!

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
src		src
.check-build		.check-build
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fault-tolerant Sharded Key-Value Storage service

Key Features

Implementation

RAFT Consensus Algorithm

Key Value Service

Sharding the Key Value Store

About

Releases

Packages

Languages

RamneekSingh24/Raft-Sharded-KV

Folders and files

Latest commit

History

Repository files navigation

Fault-tolerant Sharded Key-Value Storage service

Key Features

Implementation

RAFT Consensus Algorithm

Key Value Service

Sharding the Key Value Store

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages