-
Notifications
You must be signed in to change notification settings - Fork 9.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
corruption alarm #8420
corruption alarm #8420
Conversation
0ca2287
to
14f0c2f
Compare
Would this cover #8313 as well? |
14f0c2f
to
8e1b0ca
Compare
@heyitsanthony We probably should put this under an alpha feature flag initially? |
etcdserver/apply.go
Outdated
@@ -577,9 +577,11 @@ func (a *applierV3backend) Alarm(ar *pb.AlarmRequest) (*pb.AlarmResponse, error) | |||
break | |||
} | |||
|
|||
plog.Warningf("alarm %v raised by peer %+v", m.Alarm, types.ID(m.MemberID)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should just be %s
? Becaue types.ID
implements String
method?
etcdserver/corrupt.go
Outdated
for _, c := range cli.Endpoints() { | ||
ctx, cancel := context.WithTimeout(context.Background(), s.Cfg.ReqTimeout()) | ||
resp, herr := cli.HashKV(ctx, c, rev) | ||
fmt.Printf("got %+v %v\n", resp, err) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Log instead of print? Or just delete?
13588d7
to
b0dfe76
Compare
Modest thrashing on this one. Putting cmux on the peer port on the integration package caused trouble due to connections hanging in the matcher phase on shutdown. I vendored the mainline cmux since it has a read timeout which eventually cancels the read. |
@@ -216,6 +216,9 @@ func newConfig() *config { | |||
// auth | |||
fs.StringVar(&cfg.AuthToken, "auth-token", cfg.AuthToken, "Specify auth token specific options.") | |||
|
|||
// experimental | |||
fs.DurationVar(&cfg.ExperimentalCorruptCheckTime, "experimental-corrupt-check-time", cfg.ExperimentalCorruptCheckTime, "Duration of time between cluster corruption check passes.") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Also add this to https://github.com/coreos/etcd/blob/master/etcdmain/help.go?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm. thanks.
b0dfe76
to
603f84b
Compare
Has fixes not in fork. Includes SetReadTimeout.
Official release is ahead of the fork.
Adds grpc to peers to get the hashkv rpc.
Fixes #7125