Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

etcd client outCh blocking too long when pause changefeed #10629

Closed
CharlesCheung96 opened this issue Feb 20, 2024 · 1 comment · Fixed by #10686
Closed

etcd client outCh blocking too long when pause changefeed #10629

CharlesCheung96 opened this issue Feb 20, 2024 · 1 comment · Fixed by #10686
Assignees
Labels
affects-7.5 area/ticdc Issues or PRs related to TiCDC. severity/moderate type/bug The issue is confirmed as a bug.

Comments

@CharlesCheung96
Copy link
Contributor

CharlesCheung96 commented Feb 20, 2024

What did you do?

  1. Create changefeed with pulsar sink
  2. Stop the pulsar server
  3. Delete some data from upstream
  4. Pause the changefeed

What did you expect to see?

No response

What did you see instead?

[2024/02/20 20:21:00.048 +08:00] [WARN] [client.go:272] ["etcd client outCh blocking too long, the etcdWorker may be stuck"] [duration=14m2.000284532s] [role=processor]
[2024/02/20 20:21:00.406 +08:00] [WARN] [client.go:272] ["etcd client outCh blocking too long, the etcdWorker may be stuck"] [duration=8m16.000233667s] [role=owner]

ticdc.log

goroutine.log

Versions of the cluster

Upstream TiDB cluster version (execute SELECT tidb_version(); in a MySQL client):

(paste TiDB cluster version here)

Upstream TiKV version (execute tikv-server --version):

(paste TiKV version here)

TiCDC version (execute cdc version):

2eadc08f4bd64d00250e9ce6f7c69eda5498464c
@CharlesCheung96 CharlesCheung96 added type/bug The issue is confirmed as a bug. area/ticdc Issues or PRs related to TiCDC. labels Feb 20, 2024
@asddongmen
Copy link
Contributor

This is because the close method of pulsar client and pulsar producer is blocked when downstream pulsar server is down.
A possible fix is to spawn a goroutine to close pulsar client and producer.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
affects-7.5 area/ticdc Issues or PRs related to TiCDC. severity/moderate type/bug The issue is confirmed as a bug.
Projects
2 participants