Feature Request: VReplication Sequence Initialization #13685

mattlord · 2023-08-02T01:32:45Z

Feature Description

Introduction

This feature request is to propose a new feature to add some additional automation or protections around Vitess sequences specifically during a MoveTables workflow.

Context

When you import/move tables from an unshared keyspace to a sharded one as part of a MoveTables workflow, you need to be careful that you properly initialize any sequences used with a next_id higher than the highest value of the source table's auto_increment column.

This can end up being a bit of a foot gun when you have many sharded tables with auto_increment columns as it would require updating sequences for each table, giving a significant gap of next_ids to ensure you can run SwitchTraffic before the incrementing source databases tables' auto_inc column ids are higher than the sequence's next_id. (This is also a moving target unless writes are stopped on the source).

Goal of the Feature

Make MoveTables aware of any Vitess sequences being used by tables that are being moved from an unsharded keyspace to a sharded one. And then add a flag to either MoveTables or more specifically to the SwitchTraffic sub-command which will either:

Validate that each sequence referenced in the vschema has a next_id higher than the current highest id used in the auto_inc column on the source table. If this is not already the case then print an error and do not SwitchTraffic.

OR

Enable SwitchTraffic to manage the initialization of sequence tables with an appropriate next_id. In this scenario the operator creates the backing sequence tables on an unsharded keyspace, but leaves them empty/uninitialized and lets SwitchTraffic initialize the sequences, inserting appropriate (id, next_id,cache) values.

Use Case(s)

It's common for Vitess users to import external datasets that are not currently sharded, and shard them on import — or to move some tables out of unsharded keyspaces to sharded keyspaces as they grow in size. As it stands now sequences can be a major pitfall in this process. Having some built in checks/protections around this would remove a rough edge.

The text was updated successfully, but these errors were encountered:

mattlord added Type: Feature Component: VReplication labels Aug 2, 2023

mattlord self-assigned this Aug 2, 2023

mattlord added this to the v18.0.0 milestone Aug 2, 2023

mattlord mentioned this issue Aug 2, 2023

VReplication: Initialize Sequence Tables Used By Tables Being Moved #13656

Merged

4 tasks

mattlord closed this as completed in #13656 Aug 9, 2023

mattlord mentioned this issue Apr 8, 2024

Feature Request: Drop auto_increment from sharded column definitions on MoveTables Create #15682

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: VReplication Sequence Initialization #13685

Feature Request: VReplication Sequence Initialization #13685

mattlord commented Aug 2, 2023

Feature Request: VReplication Sequence Initialization #13685

Feature Request: VReplication Sequence Initialization #13685

Comments

mattlord commented Aug 2, 2023

Feature Description

Introduction

Context

Goal of the Feature

Use Case(s)