You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jun 6, 2024. It is now read-only.
Sometimes, users' job may be kill by OpenPAI due to several reasons. It's different from
the users' error. We can class these error as system error. And the container can't be reserved
if it's failed due to system error. So more detailed log should be provided. So that users could know the reason why their job container exits and can't be reserverd for job debugging.
An example of system error which will make your job failed:
- Disk pressure
Why is this needed:
If the job container can't be reserved if it failed due to system error. Users may feel confused.
Without this feature, how does the current module work:
User should investigate the job log.
Components that may involve changes:
TBD
The text was updated successfully, but these errors were encountered:
ydye
changed the title
[Job Debugging] provide detail information when the job container is exit.
[Job Debugging] Provide detail information when the job container exits.
Feb 26, 2019
- errorType: user/systempatterns:
pattenExitCode:
exitCode: 132userLog: ""runtimeLog: ""# more can be added herepattenUserLog:
userLog: "This is an error"reason: 'User program terminated by SIGILL'solution: 'Please check the log and retry again'containerExitCode: 132
What would you like to be added:
Why is this needed:
Without this feature, how does the current module work:
Components that may involve changes:
The text was updated successfully, but these errors were encountered: