-
Notifications
You must be signed in to change notification settings - Fork 4.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Tests crashing in CI with no dump: exit code 137 means SIGKILL Killed #97049
Comments
Tagging subscribers to this area: @dotnet/area-infrastructure-libraries Issue DetailsBuild InformationBuild: https://dev.azure.com/dnceng-public/cbb18261-c48f-4abb-8651-8cdcb5474649/_build/results?buildId=527171 Error MessageFill the error message using step by step known issues guidance. {
"ErrorMessage": "exit code 137 means SIGKILL Killed eg by kill",
"ErrorPattern": "",
"BuildRetry": false,
"ExcludeConsoleLog": false
}
|
We need dumps to make this diagnosable. |
SIGKILL is a pretty unusual way to take down the process... do we know if there's anything in the infra which can produce a SIGKILL? |
Exit code 137 can be caused by OOM. |
It doesn't look like we have a mechanism to grab dumps if it is OOM, though: #52521 |
This seems to fail consistently on all PRs |
I was able to catch a live local repro and attach debugger to it. There is one run away thread with extremely deep stack trace. All other threads are waiting for the GC suspension to finish. The run-away thread keeps allocating memory at very fast pace. You can see that by running The repro is sensitive to timing. It stopped reproing for me if I added any kind of verbose logging.
|
@janvorli Could you please take a look? It is hit by nearly all CI jobs and it looks related to your EH refactoring. |
I will take a look. |
@jkotas do you happen to know which of the tests in the suite was failing when you were able to repro it? I am currently trying to run the |
|
There are multiple issues:
|
Build Information
Build: https://dev.azure.com/dnceng-public/cbb18261-c48f-4abb-8651-8cdcb5474649/_build/results?buildId=527171
Build error leg or test failing: System.Text.Json.Tests
Pull request: #96894
Error Message
Fill the error message using step by step known issues guidance.
Known issue validation
Build: 🔎 https://dev.azure.com/dnceng-public/public/_build/results?buildId=527171
Error message validated:
exit code 137 means SIGKILL Killed
Result validation: ✅ Known issue matched with the provided build.
Validation performed at: 1/25/2024 7:09:47 PM UTC
Report
Summary
The text was updated successfully, but these errors were encountered: