Replies: 3 comments 1 reply
-
Before migrating to JustJ, my bash scripts built the new content in a new directory so that it was only during the brief window of the final
that anything could go wrong and if it went wrong the result was that the new content was just lost. The brief window ensured that it was very unlikely that a third party build could catch the 'latest' build in a transient state. Currently I get the occasional build for which either platform or EMF are in a transient state. |
Beta Was this translation helpful? Give feedback.
-
Apologies. This should be a JustJ duscussion. I can only transfer within OOMPH. Hopefully someone has greater power / skill. |
Beta Was this translation helpful? Give feedback.
-
There are several steps in the promotion process.
In your failed build we are using p2's mirror infrastructure to mirror the build to be promoted which in this case failed to mirror the update site to the local client structure. That terminated the process. So there is really no need to recover anything. No content on the server was changed. As such, there is simply a need to run the promotion again when the servers are in better shape. Only if the final rsync step failed part way mirroring content to the server would there be the potential for broken content on the server. Note that if the download server itself was in bad shape, we'd fail in the first step... I'm not sure if you saw this: https://git.eclipse.org/r/c/modisco/org.eclipse.modisco/+/207294 |
Beta Was this translation helpful? Give feedback.
-
https://ci.eclipse.org/modisco/job/justj-promoter/43/display/redirect has just failed with
Sadly this EF infrastructure failing is not rare and (on a Sunday) may take quite a few hours to be rectified.
Anyway, JustJ processing has been undermined. (The failed build has been marked as keep forever to facilitate developer investigation.) Last time this happened there was a bad repo entry and my attempts to supersede with later builds or tweak the contents failed. Fortunately the problem was a bad nightly so the brute force solution of deleting the entire nightly tree was available. However if it had occurred for a release build, the brute force approach would not be good.
Is there a recommended way to recover from the EF infrastructure breakdown? Should a JustJ bug be raised to improve resilience?
Beta Was this translation helpful? Give feedback.
All reactions