Partition-mapping routing in the gateway to avoid selecting a wrong replica in a multi-replica scenario #35

DawidNiezgodka · 2022-06-14T13:22:36Z

When we have a single mirror, then the situation is clear. All partitions are located on this particular mirror, and when there is a request for the value of a given key, it is retrieved seamlessly.
The problem arises when we have more than one replica of a mirror:

Scenario: Mirror1 stores partitions 1 and 4, and Mirror2 stores partitions 2 and 3. Let's say that we want to get data for the key="x". Our hashing function h says that h("x") = 1 which means that the value for the key "x" is stored in the first partition, which is located in Mirror1.
Currently, a request from gateway goes to the (Kubernetes?) service that chooses a replica in a round-robin fashion. This means that it is statistically half of the time wrong. When this happens, the request has to be redirected to the other replica.

Because of the fact that a mirror knows which partitions it stores (it has a mapping between a partitions and a host), we can use this information to introduce routing based on the partition mapping.

Questions:

How do we expose this info to the gateway?
In which part of the gateway should we put this functionality?

Extra ideas:

Caching
Mirror could announce itself by the gateway (observer pattern?)

The text was updated successfully, but these errors were encountered:

DawidNiezgodka added type/enhancement New feature or request component/gateway labels Jun 14, 2022

DawidNiezgodka self-assigned this Jun 24, 2022

DawidNiezgodka changed the title ~~Partition-mapping routing in the gateway to avoid selecting a wrong mirror in a multi-mirror scenario~~ Partition-mapping routing in the gateway to avoid selecting a wrong replica in a multi-replica scenario Jun 24, 2022

DawidNiezgodka mentioned this issue Jun 28, 2022

Introduces routing based on partition-host mapping #38

Merged

DawidNiezgodka closed this as completed in #38 Jul 28, 2022

torbsto added this to the Improved Gateway performance milestone Aug 1, 2022

torbsto added this to Quick Aug 1, 2022

torbsto moved this to Done in Quick Aug 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Partition-mapping routing in the gateway to avoid selecting a wrong replica in a multi-replica scenario #35

Partition-mapping routing in the gateway to avoid selecting a wrong replica in a multi-replica scenario #35

DawidNiezgodka commented Jun 14, 2022 •

edited

Loading

Partition-mapping routing in the gateway to avoid selecting a wrong replica in a multi-replica scenario #35

Partition-mapping routing in the gateway to avoid selecting a wrong replica in a multi-replica scenario #35

Comments

DawidNiezgodka commented Jun 14, 2022 • edited Loading

DawidNiezgodka commented Jun 14, 2022 •

edited

Loading