-
Notifications
You must be signed in to change notification settings - Fork 352
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Test reporter logging is owned by root if first work item is docker work item #9208
Test reporter logging is owned by root if first work item is docker work item #9208
Comments
https://dnceng.visualstudio.com/internal/_git/dotnet-helix-machines/pullrequest/22581?_a=files should probably get this. |
PR is merged and should roll out next week. Changing the reporter to only use console logging involves stripping out logging from the regular helix client so I left it alone. |
The fix from Matt was reverted because it was causing Linux images to get into a boot loop. I reverted the change and the build is green again but I don't know the real cause. More details in #9217 I moved this issue back into backlog |
Color me skeptical because a linux-boot-related change was happening before this, but I'll work on investigating today. |
ah it was old linux versions, drats. https://dnceng.visualstudio.com/internal/_git/dotnet-helix-machines/pullrequest/22617 should fix this and add some logging for why reboots occur. |
I just discussed this with @ilyas1974 and since Image Factory is broken (they've updated a bunch of stuff that broke us and the Image Factory team has not been able to roll back yet) and this is hitting a broad swath of machines I will prepare a hotfix for it. |
@danmoseley FYI |
I figured out where this comes from. Mostly the docker tag
Hunting down the addition of this docker tag, I see it was added by thaystg seven days ago in dotnet/runtime#61776. @thaystg, we're trying to make this not break others in the future but ideally we should prepare a "user is UID 1000, and named helixbot" version of this for Helix testing if at all possible. |
It seems in the scenario where the first work item a machine ever runs is in a docker scenario and uses the reporter, that machine will not be able to appened to the runner logs, as noted below:
bad machine: run.py (reporter log) is owned by root:
Good machine: run.py (reporter log) is owned by helixbot:
We could just change ownership of all log files every work item, but as a shorter term fix that's much safer I will explore making the packing test reporter only log to the console; this will still be logged, and will not create files across the mounted volume with incorrect user ownership.
Adding Error message to track impact
Report
Summary
The text was updated successfully, but these errors were encountered: