Skip to content

Commit

Permalink
staging workflow (#26)
Browse files Browse the repository at this point in the history
* staging workflow: init empty RFC

* 0026-staging-workflow: first draft

* 0026-staging-workflow: second draft

Close vcunat#2

* 0026-staging-workflow alternatives: more discussion
  • Loading branch information
vcunat authored and zimbatm committed Nov 15, 2018
1 parent 13b5084 commit 56f5acd
Showing 1 changed file with 103 additions and 0 deletions.
103 changes: 103 additions & 0 deletions rfcs/0026-staging-workflow.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,103 @@
---
feature: staging-workflow
start-date: 2018-03-05
author: Vladimír Čunát (@vcunat)
co-authors: Frederik Rietdijk (@FRidh)
related-issues:
---

# Summary
[summary]: #summary

Define a new workflow for the `staging` branch that can better accomodate the
current and future influx of changes in order to deliver mass-rebuilds faster to
master. As part of this new workflow an additional branch, `staging-next`, shall
be introduced.


# Motivation
[motivation]: #motivation

The current workflow cannot handle the high amount of mass-rebuilds that are
continuously delivered, resulting in long delays for these deliveries to reach
`master`. When a certain delivery causes failures, attemps are typically made to
fix these failures and stabilize `staging` so that the specific delivery can still
reach `master`.

Often it happens that during this period of stabilization other mass-rebuilds
are submitted, and it is not uncommon that these also introduce failures, thus
again increasing the time it takes for a delivery to reach `master`. This is
especially worrysome in case of security fixes that need to be delivered as soon
as possible.

# Detailed design
[design]: #detailed-design

There shall be the following branches:
- `master` is the main branch where all small deliveries go;
- `staging` is branched from `master` and mass-rebuilds and other large deliveries go to this branch;
- `staging-next` is branched from `staging` and only fixes to stabilize and security fixes shall be delivered to this branch. This branch shall be merged into `master` when deemed of sufficiently high quality.

Binary packages shall be build by Hydra for each of these branches. The
following table gives an overview of the branches, the check interval in hours,
amount of shares, and the jobset that they build.


| Branch | Interval | Shares | Jobset
|----------------|----------|--------|-----------
| `master` | 4 | High | release.nix
| `staging-next` | 12 | Medium | release.nix
| `staging` | 6 | Medium | release-small.nix


The check interval of `staging-next` is reduced from 24 hours (the current value
for `staging`) to 12 hours. This can be done because only stabilization fixes
shall be submitted and thus fewer rebuilds shall typically have to be performed.

The `staging` branch shall have a short interval of only 6 hours. This is because
of the relatively small jobset, and to obtain a higher resolution to detect any
troublesome deliveries.

# Drawbacks
[drawbacks]: #drawbacks

A potential drawback of this new workflow is that the additional branch may be
considered complicated and/or more difficult to work with. However, for most
contributors the workflow will remain the same, that is, choose `master` or
`staging` depending on the number of rebuilds.

# Alternatives
[alternatives]: #alternatives

## Maintain the status quo

The current situation could be kept, however, that would not solve any of the
issues mentioned in the "Motivation" section.

## Single branch

Instead of multiple branches only a single branch, say `master`, could be kept
for development. While this removes the issue of merge conflicts, it will result
in continuous mass-rebuilds on `master`, slowing down the delivery of binary
substitutes and thus development.

## Reduce Hydra jobset size

Reducing the size of the Hydra jobset would mean the iteration pace could be
higher, but has the downside of testing fewer packages, and having fewer binary
substitutes available.

The part about fewer binary substitutes could be partially mitigated by adding
another slower larger jobset that wouldn't block the channel.

# Unresolved questions
[unresolved]: #unresolved-questions

- The exact amount of shares, which is something that has the be found out.

# Future work
[future]: #future-work

- Document the new workflow;
- Create the new branch;
- Create a Hydra jobset for the new branch and adjust the existing jobs.

0 comments on commit 56f5acd

Please sign in to comment.