kv: add ResolveTimestampRequest #73399

ajwerner · 2021-12-02T21:14:56Z

Is your feature request related to a problem? Please describe.

We have these RangeFeeds which send a stream of events and then periodically send checkpoints telling the client when they've seen all events over a span up to some timestamp. The checkpoints trail the present by some amount. They trail by a while (3s) for reasons that arguably relate to cockroach's lack of bufferred writes (#72614) and read pessimism (#52768, though we could do in-memory with broadcasted verification and 2PC like spanner).

Given the fact that the closed timestamp doesn't track the present, it's easy to have a scenario where a write a timestamp t2 commits and the event is sent to the watching RangeFeed, then a separate transaction commits several seconds later at t1. Until the timestamp is resolved, history is mutable. However, if a transactional Scan operation occurs over that keyspan, then, for all intents and purposes other than the RangeFeed, the timestamp is now resolved up to the timestamp of the Scan. That all happens via the TimestampCache.

Sometimes it'd be really nice if the committing of a transaction was immediately followed by the resolving of some key spans. In the multi-tenant zone config design (RFC #66348 hopefully will merge soon) we have an asynchronous task which reconciles changes to descriptors and zone configs into the system tenant. The reconciliation in the current implementation occurs only after all of the watched data has been checkpointed (this allows us to simplify a bunch of stuff).

It's not hard to imagine reasons why such sql statements which change these configurations would want to know when the reconciliation has actually happened. Here's some:

In a serverless setting, there's a risk that the pod will scale down before reconciliation happens.
If we're trying to protect data using protected timestamps, invariants all come from when the protected timestamp actually makes it to the host cluster. If the operation issued by the client has no direct relationship to the reconciliation, it's hard to say much of anything about protected timestamps actually working.

Describe the solution you'd like

This issue proposes that we add a new non-transactional KV request ResolveTimestampRequest which takes a span and a timestamp. The operation is a writing request which scans the entire span of any range it overlaps with (the reason for resolving the whole range is that we don't have a finer-grained notion of a closed timestamp, and that seems fine for my purposes). The request semantically would operate mostly like an MVCC scan that throws away all of the data that is operating at its request timestamp followed by a replicated command which moves the closed timestamp up to its request timestamp. The systems we have in place for concurrency control should take care of the rest of the semantics.

The downstream affect of such a request is that all listening RangeFeeds will end up getting a checkpoint.

Describe alternatives you've considered

We could alternatively move the desire to resolve timestamps into a zone config for ranges such that all writes to some of these ranges auto-resolves to the present. That seems worse and too tightly coupled.

Additional context

The proposal then is that we'd have the schema changes which intend to lead to changes to zone configs issue such a request immediately upon committing (can parallelize with the wait for version checks if you want) and then can expect reconciliation to occur promptly. We'll probably need to build a mechanism to determine the implied zone config changes of a sql statement to make any limits work happen anyway.

Jira issue: CRDB-11574

The text was updated successfully, but these errors were encountered:

ajwerner · 2022-02-02T05:22:50Z

The request is nice because if the data spans multiple ranges, the client knows that. The downside of the request is that if there are multiple clients, then maybe they'd race. That racing could be fine, especially if they all use the same timestamp. Another approach not mentioned is to have the rangefeed itself ask for the range to get closed immediately after an event is published. This might well be the best approach because we can make sure there's only one such request per replica (there's one rangefeed processor at most per replica) and it's the most dynamic. Internally it'd probably end up using exactly this request though, so probably we need something like this regardless.

ajwerner · 2022-02-02T05:27:02Z

The above was a little silly. The problem with the rangefeed issuing it is it doesn’t know if the client is watching more ranges in its span than just the current range. In order to have the event on one range lead to a checkpoint over the whole span, we'd either need to have the client send it or have the server side know about the whole span (or spans). The latter seems bad.

ajwerner · 2022-02-28T21:55:15Z

This proposal is particularly fitting for cluster settings where we don't support writing to the table in a transaction. Though, I suppose, you could just as well set the closed timestamp interval on that table to 0.

github-actions · 2023-08-24T11:08:19Z

We have marked this issue as stale because it has been inactive for
18 months. If this issue is still relevant, removing the stale label
or adding a comment will keep it active. Otherwise, we'll close it in
10 days to keep the issue queue tidy. Thank you for your contribution
to CockroachDB!

ajwerner added the C-enhancement Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception) label Dec 2, 2021

blathers-crl bot added the T-kv KV Team label Dec 2, 2021

ajwerner mentioned this issue Dec 8, 2021

spanconfig: introduce spanconfig.Reconciler #71994

Merged

ajwerner mentioned this issue Jan 7, 2022

server: adopt for settings rangefeed-backed settingswatcher, remove g… #69269

Closed

adityamaru mentioned this issue Mar 25, 2022

backupccl: set shorter closed_timestamp settings in datadriven tests #78489

Merged

github-actions bot added the no-issue-activity label Aug 24, 2023

github-actions bot added the X-stale label Sep 4, 2023

github-actions bot closed this as completed Sep 4, 2023

github-project-automation bot added this to KV Aug 28, 2024

github-project-automation bot moved this to Closed in KV Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kv: add ResolveTimestampRequest #73399

kv: add ResolveTimestampRequest #73399

ajwerner commented Dec 2, 2021 •

edited by cockroach-jira-scripts

Loading

ajwerner commented Feb 2, 2022

ajwerner commented Feb 2, 2022

ajwerner commented Feb 28, 2022

github-actions bot commented Aug 24, 2023

kv: add ResolveTimestampRequest #73399

kv: add ResolveTimestampRequest #73399

Comments

ajwerner commented Dec 2, 2021 • edited by cockroach-jira-scripts Loading

ajwerner commented Feb 2, 2022

ajwerner commented Feb 2, 2022

ajwerner commented Feb 28, 2022

github-actions bot commented Aug 24, 2023

ajwerner commented Dec 2, 2021 •

edited by cockroach-jira-scripts

Loading