-
Notifications
You must be signed in to change notification settings - Fork 2.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CI flake: sdnotify-conmon test: getting MAINPID=1 instead of READY #8718
Comments
Didn't create issue yet for that, but it looks like it's belong here... I just upgraded from 2.2.0 to 2.2.1, I get (timeout) error on all (6) different services running with systemd and sdnotify, that in turn makes all services to restart every 90s. systemd config:
Logs are also not showing anything in particular, the services work normally for ~90s, yet all are stuck in
|
@bytniak could you please file this as a new issue? I'm 99.99% sure that what you're seeing is unrelated to the flake reported in this issue. |
- run test: minor cleanup to .containerenv test. Basically, make it do only two podman-runs (they're expensive) and tighten up the results checks - ps test: add ps -a --storage. Requires small tweak to run_podman helper, so we can have "timeout" be an expected result - sdnotify test: workaround for containers#8718 (seeing MAINPID=xxx as last output line instead of READY=1). As found by the newly-added debugging echos, what we are seeing is: MAINPID=103530 READY=1 MAINPID=103530 It's not supposed to be that way; it's supposed to be just the first two. But when faced with reality, we must bend to accommodate it, so let's accept READY=1 anywhere in the output stream, not just as the last line. Signed-off-by: Ed Santiago <santiago@redhat.com>
A friendly reminder that this issue had no activity for 30 days. |
This has not recurred since #8720 merged - but that was a sweep-it-under-the-rug workaround. The real problem may or may not persist (I don't know, because there's no way to track it); and if it does, it may or may not be serious (I don't know, because I'm not systemd-savvy enough); and if it's serious, I don't know if it's a podman problem, or conmon, or systemd itself. I suspect our only rational option here is to close but I'd like someone else to confirm. @giuseppe ? @mheon ? |
I think we can close until and unless we see something similar in the wild - chasing sdnotify bugs is difficult and does not seem to have much payoff for us in general. |
Symptom: the
sdnotify : conmon
test in260-sdnotify.bats
is flaking a lot:This is a reminder for Ed to take a look at
podman/test/system/260-sdnotify.bats
Line 104 in a226e6e
podman stats
slirp check more robust #8648The text was updated successfully, but these errors were encountered: