-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[orchagent] Orchagent keeps flooding ERROR log after teamd restarted #5971
Comments
@bingwang-ms when teamd docker goes down, it brings down both syncd and swss dockers along. The error message is expected and should stop when the swss goes down. Later when all of them swss, syncd, teamd is up -- The portchannels should be back as normal |
@judyjoseph
Please refer attached log. |
@judyjoseph pls update |
Hi @judyjoseph |
@judyjoseph please update. |
@bingwang-ms , @yxieca I had tried this earlier today. In this old version SONiC.master.491-af654944, I too see the problem of swss/syncd not restarting when teamd docker goes away, the "python3 /usr/bin/docker-wait-any -s swss -d syncd teamd" wait process also not going off. Could this be some intermediate build when we were converting from python2 --> python3 ? But If i move to the latest sonic master buildimage, I don't see this problem anymore, teamd, syncd, swss all of them goes off and the error message "swss#orchagent: :- removeLag: Failed to remove ref count" stops when swss docker goes away. This is the correct behaviour. Note: I also observed a different behavior with Port Channels not getting cleaned up correctly when teamd docker goes away in the latest master build. I will follow up on that to see what changed as part of the other issue #6199 |
@bingwang-ms, if you could confirm similar behavior with the latest master image, we could close this case and follow up the issue #6199. |
I tried to repro the issue on lattest inage (SONiC.HEAD.215-c4156b87), and it seemd that the issue has been addressed. The error message was gone after swss is restarted. So I think we can close this issue now. But do you have any idea which PR fixed it? |
Thanks @bingwang-ms , ideally this ERROR message was verified to be fixed with #5628. If feel the image with issue was an intermediate image made during the py2 --> py3 conversion where docker-wait script had to be fixed. |
Description
I noticed that orchagent kept flooding ERROR log if
teamd
was restarted when debugging.And portchannel was unable to recover.
Steps to reproduce the issue:
config reload
to initialize DUTteamd
container, sayteammgrd
.teamd
will be restarted, and ERROR is flooding.Describe the results you received:
No error should be observed, and portchannel is recovered.
Describe the results you expected:
Orchagent is flooding ERROR, and portchannel is not recoverd.
Additional information you deem important (e.g. issue happens only occasionally):
syslog.tar.gz
The text was updated successfully, but these errors were encountered: