TiCDC owner availability check #3354
Labels
area/ticdc
Issues or PRs related to TiCDC.
component/metrics-logging
Metrics and logging component.
subject/new-feature
Denotes an issue or pull request adding a new feature.
Is your feature request related to a problem?
Such as https://github.com/pingcap/ticdc/issues/3352 and https://github.com/pingcap/ticdc/issues/3331, ticdc owner could be stuck and the whole TiCDC cluster will be abnormal. Note the owner may be fallen into some long time wait condition, including IO wait, some logic wait condition etc, instead of etcd session lost.
Describe the feature you'd like
Add a background goroutine to detect the availability of cdc owner, it can check either the metric counter of
ticdc_owner_ownership_counter
or something else, to ensure TiCDC owner works well. If the checker finds TiCDC owner is not available, it could do following thingsDescribe alternatives you've considered
No response
Teachability, Documentation, Adoption, Migration Strategy
No response
The text was updated successfully, but these errors were encountered: