Sequence Tables
(Updated )
In MySQL, it is common to have an integer primary key that uses the AUTO_INCREMENT
feature for assigning IDs. However, if you have a horizontally-sharded table, you will not be able to use AUTO_INCREMENT
for your ID. In such a setup, the rows of the table are distributed across many instances of MySQL. The separate instances of MySQL do not have a built-in way to coordinate which IDs are in use and which are not. Instead, you will need to use a sequence table.
A sequence table is a special table that contains metadata for managing the incrementing ID values for the column of a horizontally sharded table. Each time you create a new horizontally sharded table, you should create the corresponding sequence table and update the VSchema.
Creating a horizontally-sharded table
We recommend keeping safe migrations enabled for all production databases. Thus, the first step to make schema modifications is to create a new branch, and connect to it via the command line.
Next, to create a horizontally-sharded table, switch to your desired sharded keyspace. Create a new table in this keyspace, and do not use AUTO_INCREMENT
for your ID column. For example, to create a table in the test_sharded
keyspace, run:
USE test_sharded; CREATE TABLE test(id BIGINT UNSIGNED PRIMARY KEY, data JSON);
Next, switch over to the unsharded keyspace that you want to use for sequence tables. Here, you'll create a sequence table. It is good practice to use the same name as the sharded table with _seq
or _sequence
appended. Being consistent with this naming will help maintain a clear association between your data tables and sequence tables.
USE test_unsharded; CREATE TABLE test_seq(id bigint, next_id bigint, cache bigint, primary key(id)) comment 'vitess_sequence';
We also need to update the VSchema of our database. We need to tell Vitess about this new SEQUENCE
, let it know to use the id
column as the shard key, and tell it to use the test_seq
table for fetching auto incrementing IDs.
ALTER VSCHEMA ADD SEQUENCE `test_unsharded`.`test_seq`; ALTER VSCHEMA ON `test_sharded`.`test` ADD VINDEX hash(id) USING hash; ALTER VSCHEMA ON `test_sharded`.`test` ADD auto_increment id USING `test_unsharded`.`test_seq`;
When you are comfortable with your schema changes, create a deploy request and merge.
Sequence table values
Unlike vanilla Vitess, PlanetScale will automatically populate the single required row into any sequence table created with the above steps. After merging your deploy request, you should be able to query the sequence table as follows:
SELECT * FROM test_unsharded.test_seq; +----+---------+-------+ | id | next_id | cache | +----+---------+-------+ | 0 | 1 | 1000 | +----+---------+-------+ 1 row in set (0.04 sec)
id
Should always be 0.next_id
represents the next ID in the sequence to be fetched. You typically want this to start as 1.cache
represents the number of IDs that can be fetched and cached by a VTTablet. For good performance, this should be set to a large number like 1000 or more.
We can check that the sequence table is working in assigning IDs by inserting a new row and then querying for the row with ID 1
.
INSERT INTO test (data) VALUES ('{"errors": [{"message": "Error message", "code": 10}]}'); Query OK, 1 row affected (0.06 sec) SELECT data FROM test WHERE id=1; +--------------------------------------------------------+ | data | +--------------------------------------------------------+ | {"errors": [{"message": "Error message", "code": 10}]} | +--------------------------------------------------------+ 1 row in set (0.05 sec)
Note
Check out the Vitess documentation on sequences for more information.
Need help?
Get help from the PlanetScale Support team, or join our GitHub discussion board to see how others are using PlanetScale.