Image update automation design #107

squaremo · 2020-07-17T11:03:00Z

squaremo
Jul 17, 2020
Maintainer

Flux v1 has an image update automation feature that is in widespread use. Flux v2 will need this to have parity with Flux v1 -- but of course we want it to work in the spirit of GitOps Toolkit.

I have been prototyping a set of components that will do the image update automation as well as set the pattern for other kinds of automation. The design, in progress, is here: fluxcd/image-reflector-controller#5. The code is in https://github.com/squaremo/image-reflector-controller and https://github.com/squaremo/image-automation-controller.

The main areas of uncertainty are:

what the specification for image automation looks like (Flux v1 looks at the annotations in the individual resources, but there may be alternatives)
whether it's worth having a separate controller to run (generic) update jobs, rather than doing that in the automation controller.

Update: I have marked the kyaml setters design as the answer, since the implementation seems to work satisfactorily in practice. There are nonetheless lots of good ideas and points elsewhere in the discussion, and I'll be referring back here. Thanks everyone!

Answered by squaremo

Sep 22, 2020

Design: Automation via kyaml setters2

In this design:

the sites for image updates are marked by setter comments:
https://github.com/kubernetes-sigs/kustomize/tree/master/kyaml/setters2 (but this is mostly transparent for the user)
there's a link between the particular setter (i.e., set of markers) and an image policy

Target scenarios

"I have an image I want to automate"
"I want to automate all the images I push to my DockerHub org"
"Each of these app repos has a handful of images to automate"

Questions ...

How does this translate into ImageRepository and ImagePolicy objects?

There are some alternatives:

A) Implicitly -- look at the current value for the setters, and perhaps some auxil…

View full answer

squaremo · 2020-07-28T10:55:12Z

squaremo
Jul 28, 2020
Maintainer Author

There's now a run-through of the "working at all" implementation: https://github.com/squaremo/image-automation-controller#readme

(The implementation is split into two controllers: image-reflector, which scans image repositories, and image-automation, which updates git by looking at the outcome of the scans)

This is what the manifests look like:

# image.yaml
# Declares an image repository to scan. This is where things like registry
# auth, and scan interval, would go, for example
apiVersion: image.fluxcd.io/v1alpha1
kind: ImageRepository
metadata:
  name: app-image
spec:
  image: squaremo/cuttlefacts-app

# policy.yaml
# Defines a rule for "latest image" of some image repo. There could be
# different ways of filtering and ordering (similar to Flux v1, with glob and
# regexp, but potentially also things like "order by commit history")
apiVersion: image.fluxcd.io/v1alpha1
kind: ImagePolicy
metadata:
  name: app-policy
spec:
  imageRepository:
    name: app-image
  policy:
    semver:
      range: 1.x

# repo.yaml
# This is a type from source-controller, but used here to give access to
# the git repository, for making updates.
apiVersion: source.fluxcd.io/v1alpha1
kind: GitRepository
metadata:
  name: cuttlefacts-auto
spec:
  url: ssh://git@github.com/squaremo/cuttlefacts-app-automated
  interval: 1m
  secretRef:
    name: cuttlefacts-auto-deploy

# update.yaml
# This defines an automated update to do; in this case, update files
#  in the given repo to use the latest image as calculated by the given
# ImagePolicy. It also gives parameters for the commits that will be made;
# here you could tell it to push to a fresh branch, or even open a PR, say.
apiVersion: image.fluxcd.io/v1alpha1
kind: ImageUpdateAutomation
metadata:
  name: update-app
spec:
  gitRepository:
    name: cuttlefacts-auto
  update:
    imagePolicy:
      name: app-policy
  commit:
    authorName: Updatebot
    authorEmail: bot@example.com
    messageTemplate: |
      Here look what I did

0 replies

squaremo · 2020-07-28T12:12:23Z

squaremo
Jul 28, 2020
Maintainer Author

Things that this sets out to solve:

Better visibility, reuse, and more explicit control, via custom resources
Tidy up the hodge-podge of options for authentication (possibly by making authentication explicit)
Be able to run this in a cluster or namespace of its own, rather than needing to cohabit with the workloads using the images (another angle on this: have RBAC permissions distinct from those for looking at the workloads)
Make the calculation of changes to apply more flexible:
- selecting which fields in objects to update (updating arbitrary CRDs has been asked for a few times)
- selecting which objects to update (Flux v1 has a somewhat ad-hoc scheme)
- selecting which images to update and what to update them to
Better persistent storage for image metadata; memcached is a cache (which is how we used it originally!), and we need a database
Being smarter about what needs to be scanned at the image registry, to be more sensitive to rate limiting. Flux v1 fetches everything, because it doesn't know ahead of time what will be queried.

8 replies

squaremo Jul 28, 2020
Maintainer Author

This has the resource mentioned in the README (though not exactly in the files given): https://github.com/squaremo/cuttlefacts-app-automated/tree/master/automation

squaremo Jul 28, 2020
Maintainer Author

Ah, inline -- yes good idea

stefanprodan Jul 28, 2020
Maintainer

Ok I'll try to post here a simple use-case based on your samples.

squaremo Jul 28, 2020
Maintainer Author

I've inlined the examples (or near enough) used in the README, above.

squaremo Jul 28, 2020
Maintainer Author

NB the implementation is sketchy and missing watches on various things -- you may occasionally need to delete and recreate objects to get it to do something.

stefanprodan · 2020-07-28T13:49:27Z

stefanprodan
Jul 28, 2020
Maintainer

Uses case: automated updates for a public container image

Git-to-cluster reconciliation

Having the following deployment in Git:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: podinfo
  namespace: default
spec:
  selector:
    matchLabels:
      app: podinfo
  template:
    metadata:
      labels:
        app: podinfo
    spec:
      containers:
      initContainers:
      - name: init
        image: stefanprodan/podinfo:4.0.0
        command:
        - sh
        - -c
        - "sleep 1"
      - name: podinfod
        image: docker.io/stefanprodan/podinfo:4.0.0
        imagePullPolicy: IfNotPresent
        ports:
        - name: http
          containerPort: 9898
          protocol: TCP
        command:
        - ./podinfo
        - --port=9898

Having a GitRepository registered with a read-write deploy key:

apiVersion: source.fluxcd.io/v1alpha1
kind: GitRepository
metadata:
  name: podinfo
  namespace: gitops-system
spec:
  interval: 1m
  url: ssh://git@github.com/stefanprodan/podinfo-deploy
  secretRef:
    name: ssh-podinfo
  ref:
    branch: master

Having a Kustomization reconciling podinfo:

apiVersion: kustomize.fluxcd.io/v1alpha1
kind: Kustomization
metadata:
  name: podinfo
  namespace: gitops-system
spec:
  interval: 5m
  sourceRef:
    kind: GitRepository
    name: podinfo
  path: "./default/"
  prune: true
  healthChecks:
    - kind: Deployment
      name: podinfo
      namespace: default
  timeout: 2m

Image update based on semver range

Register the podinfo container registry:

apiVersion: image.fluxcd.io/v1alpha1
kind: ImageRepository
metadata:
  name: podinfo
spec:
  image: docker.io/stefanprodan/podinfo

Define the image updated policy using a semver range:

apiVersion: image.fluxcd.io/v1alpha1
kind: ImagePolicy
metadata:
  name: podinfo
spec:
  imageRepository:
    name: podinfo
  policy:
    semver:
      range: ">=4.0.0 <5.0.0"

Define the image update automation:

apiVersion: image.fluxcd.io/v1alpha1
kind: ImageUpdateAutomation
metadata:
  name: podinfo
spec:
  gitRepository:
    name: podinfo
  update:
    imagePolicy:
      name: podinfo
  commit:
    authorName: fluxcdbot
    authorEmail: fluxcdbot@@users.noreply.github.com
    messageTemplate: |
      Bump podinfo version

Questions

Does the patching work for both init stefanprodan/podinfo and podinfod docker.io/stefanprodan/podinfo giving the URL differences?
How can I set different policies for the init container vs podinfod?
How can I set different policies for different deployments?
How can I configure the automation to push changes to a staging branch instead of GitRepository.spe.ref.branch?
Is the tag appended to the commit message? does the template offer some variables?

1 reply

squaremo Jul 28, 2020
Maintainer Author

Does the patching work for both init stefanprodan/podinfo and podinfod docker.io/stefanprodan/podinfo giving the URL differences?

It'll look in both initContainers and containers (but only work for things that have those in .spec.template.spec, for the minute; it'll need some finessing to work for e.g., CronJobs, and it would be good to have a generic way of telling it where image fields are)
it compares the canonical form of the image, so e.g., alpine == library/alpine == index.docker.io/library/alpine
but it'll substitute the value exactly as it is from the image policy's status, which depends on what the spec is. There's good reasons to try to keep whatever format the workload uses, so I filed Use workload's format for images image-automation-controller#5

How can I set different policies for the init container vs podinfod?

How can I set different policies for different deployments?

I've done the most basic spec that is useful -- replace every instance of a particular image. There's lots of ways to go from here:

specifying particular workloads and particular containers
- ... by wildcard; by annotations in the files; by selector
replacing more than one image at a time
determining the images to replace by some other means than ImagePolicy

.. and so on. I wouldn't expect to implement even all of those -- we need to figure out what is the simple, 80-90% solution.

How can I configure the automation to push changes to a staging branch instead of GitRepository.spe.ref.branch?

That would go in the .spec.commit field of the automation. I would think it'd have a generateName variation, too, so it creates a fresh branch for an update. Github/gitlab/etc. integrations can work in there too, but that starts impinging on "workflow" enough that a separate component -- possibly third-party -- might be in order.

Is the tag appended to the commit message? does the template offer some variables?

It doesn't at present -- that's just me wanting to get something working and not colouring everything in properly. That's exactly the sort of thing that would go in the template though, yep.

stefanprodan · 2020-07-31T14:00:40Z

stefanprodan
Jul 31, 2020
Maintainer

Proposal: select targets

The image update automation could target Kubernetes objects by label/namespace or target a kustomization.yaml file by path.

Target Kubernetes objects

Select a group of Kubernetes objects such as deployments, daemonsets, statefulsets, cronjobs, jobs or pods:

kind: ImageUpdateAutomation
spec:
  update:
    imagePolicy:
      name: app-policy
    selector:
      app: my-app

Select Kubernetes objects filtered by labels and namespaces:

kind: ImageUpdateAutomation
spec:
  update:
    imagePolicy:
      name: app-policy
    selector:
      app: my-app
    namespaces:
      - dev
      - staging

The above proposal addresses only Kubernetes native kinds and doesn't allow targeting a specific container/initContainer but all of them. I don't think there is valid use case for targeting containers.

Target kustomizations

Select a kustomization.yaml file inside the Git repository:

kind: ImageUpdateAutomation
spec:
  gitRepository:
    name: app-repo
    kustomization:
      path: "./deploy/overlays/dev" # <- kustomize edit set image app=app:semver

6 replies

adusumillipraveen Aug 14, 2020

path: "./deploy/overlays/dev" -> I am guesssing this where kustomize edit set image will be run ?

It would good to support environment variables in this. For example : ./deploy/overlays/$NAMESPACE/$NAME/.
Also, does this work in case of HelmRelease CRDs ? ( I doubt Kustomize edit will have effect on them )
Is it a good idea to give user an option to user to configure the path of the yaml itself and probably the node in yaml where flux has to patch the image ?

squaremo Aug 17, 2020
Maintainer Author

path: "./deploy/overlays/dev" -> I am guesssing this where kustomize edit set image will be run ?

Yep, that's right. Naming the kustomization.yaml file would be accepted too (and give the same result).

It would good to support environment variables in this. For example : ./deploy/overlays/$NAMESPACE/$NAME/.

Where do those environment variables come from?

Also, does this work in case of HelmRelease CRDs ? ( I doubt Kustomize edit will have effect on them )

It won't; they'd have to be patched, in place, or possibly (with a certain amount of extra fiddliness) as a JSON patch.

Is it a good idea to give user an option to user to configure the path of the yaml itself and probably the node in yaml where flux has to patch the image ?

Maybe in the case of custom resources, yes. This is a property of the resource definition, I would think, but as a fallback it could go in annotations on each object.

adusumillipraveen Aug 17, 2020

Where do those environment variables come from?

I am expecting this to be same as how flux does currently, it gives some environment variables like in https://docs.fluxcd.io/en/1.17.1/references/fluxyaml-config-files.html#execution-context-of-commands. FLUX_WORKLOAD, FLUX_WL_NS

adusumillipraveen Aug 17, 2020

Maybe in the case of custom resources, yes. This is a property of the resource definition, I would think, but as a fallback it could go in annotations on each object.

Agree as long as they can be dynamic using environment variables. Users having a specific folder pattern can patch those annotations itself using something like :

path: ../image-update-patch.yaml
target:
kind: HelmRelease

squaremo Aug 25, 2020
Maintainer Author

Where do those environment variables come from?

I am expecting this to be same as how flux does currently, it gives some environment variables like in https://docs.fluxcd.io/en/1.17.1/references/fluxyaml-config-files.html#execution-context-of-commands. FLUX_WORKLOAD, FLUX_WL_NS

I can see this being quite tricky to set up reliably, since so much is implicit. You would need a kustomization.yaml for each possible interpolated value of the path, then another kustomization.yaml to mention all of those others as bases so they'll be taken into account. In the case that you want to apply different image updates to different workloads via kustomizations, I think it'd be much simpler for the kustomization path to be static, and to use multiple automation objects to group the updates.

squaremo · 2020-08-04T11:17:56Z

squaremo
Aug 4, 2020
Maintainer Author

Chasing up a couple of things I mentioned ...

replacing more than one image at a time

For example, naming several policies to dereference when doing an update. Pros: no need to have an automation object for each image policy; commits can group together changes. Cons: complication; using in combination with a single selector might not make much sense (in general, you'd want to pick out which policies apply to which workloads).

determining the images to replace by some other means than ImagePolicy

For example, a fixed value that is updated by hand.

current implementation (one automation per policy):

Pros: simple, easy to use as a primitive for a higher layer. Cons: a proliferation of objects; you don't usually want to change the same image in lots of workloads, making selectors less useful.

2 replies

adusumillipraveen Aug 14, 2020

Agree, the current setup looks slightly complicated and too many objects. Would be good to probably look at using regex patterns as well ?

squaremo Aug 17, 2020
Maintainer Author

Let's take the XKCD reference as read .. I would prefer to avoid regular expressions. Pros: terse. Cons: hard to write, especially if you are trying to include or exclude things that don't exist yet (e.g., files someone will add to your git repo in the future).

moshloop · 2020-08-17T07:12:59Z

moshloop
Aug 17, 2020

How about something like the below?

apiVersion: image.fluxcd.io/v1alpha1
kind: ImageUpdateAutomation
metadata:
  name: podinfo
spec:
  gitRepository:
    name: podinfo
  policy:
    semver:
      range: ">=4.0.0 <5.0.0"
  images:
    registries:
        - name: docker.io
    include: 
       - docker.io/*
    exclude: 
        - docker.io/library/*
  commit:
    authorName: fluxcdbot
    authorEmail: fluxcdbot@@users.noreply.github.com
    messageTemplate: |
      Bump podinfo version

apiVersion: image.fluxcd.io/v1alpha1
kind: ImageRegistry
metadata:
  name: docker.io
spec:
  url: docker.io
  secretRef: docker.io-auth
  interval: 10m

3 replies

squaremo Aug 17, 2020
Maintainer Author

This is an interesting, different factoring. Reading between the lines a little, this is how I think it would work:

you don't mention repositories explicitly
you use ImageRegistry to provide credentials for a registry (e.g., Docker Hub)
in the ImageUpdateAutomation, each image that's used in a workload and belongs to an ImageRegistry mentioned and doesn't get filtered out by include/exclude will be updated.
the update will be according to the semver range (or other policy) given

I can see a pretty big downside here, if I've interpreted accurately: the semver range will apply to all the images found, which are likely to be on substantially different versions. It's difficult to see it working any way other than having a policy per image, however that's expressed in the custom resources.

moshloop Aug 17, 2020

I agree that a semver policy by range doesn't make much sense, but a semver policy by level is probably a better way of describing the intent:

policy:
  semver:
      level: patch

squaremo Aug 17, 2020
Maintainer Author

I like the idea of giving level for semver updates, I hadn't thought of that. I guess this would be useful if you want to automate a lot of things at a time, but only ever bump the patch level. That seems like it might be a common requirement, though it's the only variation on it that makes much sense (you might want to bump the minor version if something is v0.x, I suppose).

The general idea of giving selection rules in ImageUpdateAutomation rather than naming things explicitly is a good one. You would have to be careful not to provide overlapping rules -- being able to give multiple sets of rules for a repo, in one place, would help with that.

squaremo · 2020-09-08T10:39:16Z

squaremo
Sep 8, 2020
Maintainer Author

To paraphrase some of the things that have been mentioned in the discussion above ...

Limitations in the toytown implementation

The current implementation mentions a single image policy and a git repository, and updates every workload using the image in question, according to the policy. This is limited in several ways:

You can't restrict it to updating only certain workloads, either by file path or by properties of the workload (e.g., its name, or labels)

For some scenarios this is fine, since you'll only be automating the update of say, a few images you build in your own CI. But it might cause problems if there's different configs in the git repo, needing different policies -- say if you have different directories for staging vs production.

You can only update one image at a time

Some installations might have hundreds of images to be automated. That's 1. a lot of ImageRepository and ImagePolicy resources to manage by hand, and 2. a lot of commits.

Taking these one at a time:

For this reason it's tempting to have some way of specifying the images to update intensionally; e.g., all images used by the selected workloads (perhaps with further filtering, e.g., exclude/include patterns); or, with markers in the files.
This suggests some grouping -- either of explicitly-mentioned policy objects, or using a set as in 1), and committing all image changes made, in one commit.

When applying a group of policies, it makes less sense to have narrow selection of workloads -- there is a tension between coarse-grained operations, and pin-pointing exactly what you want to change, because images often are close to 1-1 with workloads.

Targeting kustomizations

Kustomizations are explicitly supported by GOTK. To automate these, you have to target the kustomization.yaml file, rather than the YAMLs in the base. This would be an alternative to targeting specific files or workloads.

In a design where you mark images for update (see above), ideally you would mark a field in the kustomization in the same way.

Parameterising the location of the kustomization file

@adusumillipraveen suggested that the path to the kustomization might be parameterised, similarly to how .flux.yaml commands receive various arguments in environment variables. This could end up being quite brittle, though; it's probably better to simply have automation objects for each of the variations you want (assuming the automation objects become a bit more expressive).

Targeting custom resources

It's a reasonable expectation that HelmRelease objects could be patched with new images too -- this works in Flux v1. This raises two issues:

Can arbitrary custom resources be patched?

This requires either a blunt instrument (replace the image string wherever you see it; not useful if, say, the tag is in its own field), or a little language for pointing at fields that contain image references.

Can HelmReleases (or arbitrary custom resources) be patched when targeting a kustomization file?

Kustomize knows how to update image fields of the usual Kubernetes types, but not how to patch the values in a HelmRelease file.

A different factoring entirely, that focuses on which images to automate

@moshloop suggested a different factoring of the definition of an automation; instead of naming ImagePolicy objects or workloads, you select the images which you want to track, and give the parameters for scanning a particular registry in another object.

This is quite close to how you might think of what you want as a user ("I want images in my own dockerhub org to be automated"). A potential downside is that it might become fiddly once you want to narrow down which workloads are affected, say. Another is that it may be hard to compose automation objects, since there's a lot of opportunity for specifications to overlap.

1 reply

bigkevmcd Sep 10, 2020

Limitations in the toytown implementation

You can't restrict it to updating only certain workloads, either by file path or by properties of the workload (e.g., its name, or labels)

For some scenarios this is fine, since you'll only be automating the update of say, a few images you build in your own CI. But it might cause problems if there's different configs in the git repo, needing different policies -- say if you have different directories for staging vs production.

"[Different] configs in the git repo" is the strategy put forward in the Kubectl/Kustomize documentation https://kubectl.docs.kubernetes.io/pages/app_composition_and_deployment/structure_directories.html

You can only update one image at a time

Some installations might have hundreds of images to be automated. That's 1. a lot of ImageRepository and ImagePolicy resources to manage by hand, and 2. a lot of commits.

For this reason it's tempting to have some way of specifying the images to update intensionally; e.g., all images used by the selected workloads (perhaps with further filtering, e.g., exclude/include patterns); or, with markers in the files.

This suggests some grouping -- either of explicitly-mentioned policy objects, or using a set as in 1), and committing all image changes made, in one commit.

Some of my thinking for this has been around labels, specifically, a LabelSelector

I've looked at an "Application", which is really a namespace/LabelSelector combination, and all workloads (within a namespace) matching the LabelSelector are considered for image updates, this has some benefits in that you can be fairly targeted in your approach, with different policies for different LabelSelectors.

Another option, would be to annotate objects that should be updated, simply you'd have app.example.com/update-image: my-org/my-image:v1.0.x annotation, (this could be update-image-0/update-image-1, for multiple images in the same object). and reconcile these and look for the latest image.

One downside to this approach (annotating the resource), is the lack of places to store things like open-PRs to patch the changes that you've discovered in an early reconciliation (you need to store the set of images that the PR references, or, you'd have to parse the PR/diff), this is easier with a CR, which you can store the data in a Status field.

Targeting kustomizations

Kustomizations are explicitly supported by GOTK. To automate these, you have to target the kustomization.yaml file, rather than the YAMLs in the base. This would be an alternative to targeting specific files or workloads.

In a design where you mark images for update (see above), ideally you would mark a field in the kustomization in the same way.

Parameterising the location of the kustomization file

@adusumillipraveen suggested that the path to the kustomization might be parameterised, similarly to how .flux.yaml commands receive various arguments in environment variables. This could end up being quite brittle, though; it's probably better to simply have automation objects for each of the variations you want (assuming the automation objects become a bit more expressive).

Targeting custom resources

Can arbitrary custom resources be patched?

This requires either a blunt instrument (replace the image string wherever you see it; not useful if, say, the tag is in its own field), or a little language for pointing at fields that contain image references.

Can HelmReleases (or arbitrary custom resources) be patched when targeting a kustomization file?

Kustomize knows how to update image fields of the usual Kubernetes types, but not how to patch the values in a HelmRelease file.

I've done a bit of this in my own work, being able to say Repo/Filename/key to be patched.

For example, you might want to patch bases/backend/deployment.yaml to replace the key spec.template.spec.containers.0.image in repo https://github.com/my-org/my-repo.git.

squaremo · 2020-09-22T15:36:12Z

squaremo
Sep 22, 2020
Maintainer Author

Design: Automation via kyaml setters2

In this design:

the sites for image updates are marked by setter comments:
https://github.com/kubernetes-sigs/kustomize/tree/master/kyaml/setters2 (but this is mostly transparent for the user)
there's a link between the particular setter (i.e., set of markers) and an image policy

Target scenarios

"I have an image I want to automate"
"I want to automate all the images I push to my DockerHub org"
"Each of these app repos has a handful of images to automate"

Questions ...

How does this translate into ImageRepository and ImagePolicy objects?

There are some alternatives:

A) Implicitly -- look at the current value for the setters, and perhaps some auxiliary information, and coalesce into objects.

B) Higher-level description: the repositories and policies are defined in the same place as the setters (or define the setters), and they are created by the controller interpreting that (again, perhaps with auxiliary information, like credentials, from elsewhere)

C) Explicitly -- there's a map from setter to image policy name, and the latter is assumed to exist.

One question that might decide it is: if the markers are in the files, where do the setter definitions live? Is there any sense in having them as a resource, or should they just be in a file in the repo?

There's no particular benefit to having another object over and above the automation object, so you might expect to put this information in the automation object. If the definitions live in a resource in the cluster, it is a little harder to make sure they are in sync with the markers in the files in git.

If they live in a file or files in the repo, that's one extra file that needs to be maintained -- but maybe that's not too bad, since you need to set up the markers anyway.

Providing credentials

Making available the credentials for accessing a particular image registry or image repo is a runtime concern. This implies it should be an object in the cluster (even if it's synced from git).

In the case C) above, image repository objects are created explicitly, and they will include (a reference to) the credentials.

But in A) and B) above, the image repository objects may be created implicitly, and something will have to decide which credentials to use.

** Using tags vs whole image vs other bits .. **

Sometimes you might want to set only the tag, or set the tag and repo separately. This might need to be specified alongside the setter/policy, or perhaps synthetic names used e.g., foo-image/tag.

Unanswered questions

is it useful to have two automations refer to the same (part) repo?
if credentials are a separate spec, is it useful to be able to refer to the same credentials spec from different automation objects?
would it be a problem to have duplicate scans? (e.g., if more than one image repository was created?) -- or: what uniquenesses do I want to strive for?

Hypothetical user guide for C)

You are here because you want to use Flux to automate updates in one or more git repositories. You might run the automation in the same cluster as you run your apps that are being automated; or you might run it in another cluster, or at least in a separate namespace.

!!! note Unlike Flux v1, automation in Flux v2 does not require the workloads to be running in the same cluster.

In the following I tell people to create stuff directly in the
cluster; you really want to put it in some directory as manifests,
then sync it, but that adds a step each time and it gets tedious.

To start the process, you'll have to give the automation access to your git repository. Create a GitRepository with the Flux command-line:

gotk create gitrepository app-repo --rw --url git@github.com:squaremo/flux-v2-example

The --rw argument would need to be added to
the CLI so it creates a read/write deploy key.

First we'll create a policy for updating a particular image; then we'll apply that policy in the deployment file for the app.

This is the explicit ImagePolicy version. Note this suffers a little
from the dissonance between having to create things in the cluster
and correlating those with things in the repo.

gotk create imagerepository app-image --image squaremo/helloworld
gotk create imagepolicy app-image-policy --image-ref app-image --semver "1.0.x"

NB if the repo requires credentials, you have a little more work to
do.

Now we have an image policy that yields a particular image with its tag, which we'll attach to a deployment.

More speculative toolery. config because it's not creating an
object in the cluster, it's creating a setter in a file in the git
repo.

gotk config automation --path ./app/deployment.yaml --policy app-image-policy

The --path is checked for any references to the image repo mentioned in the policy, and where found, it is marked so it'll get updated by the automation. The marker sits alongside the field to be updated:

apiVersion: apps/v1
kind: Deployment
metadata:
  name: helloworld
  spec:
    template:
      spec:
        containers:
        - name: hello
        - image: squaremo/helloworld:v1.0.0 # {"$ref": "app-image-policy"}
# ...

Commit the change so it'll be seen by automation:

git add ./app/deployment.yaml
git commit -m "Add automation to deployment"
git push

There can be other ways of selecting which fields, and other values
that should go in the field, e.g., just the tag.

Lastly, create an automation object to tell the controller to run automation using this file:

gotk create image-automation app-auto --git-repo-ref app-repo --path ./app/

This will update any files under app/, so if you had other images to be updated, you could go through the above steps to create a policy and mark fields to be updated.

You can monitor the automation object to see when the automation has run:

kubectl watch image-automation app-auto

6 replies

squaremo Sep 23, 2020
Maintainer Author

The format that supports the hypothetical user guide above needs to map setters (referred to in the markers put in files) with policies.

There's different ways to do this. The marker could just use the policy name -- thus obviating the need for a mapping in a Fluxfile -- but any variation from that, needs a file to keep the names in sync. It's possible that some bookkeeping needs to go here as well, but I hope not, because that introduces the possibility of the substituted values (throughout the YAMLs) getting out of sync with the bookkeeping in the Fluxfile.

apiVersion: update.fluxcd.io/v1alpha1 # not a k8s resource, just following the convention
kind: Fluxfile
automation:
  images:
  - policy: app-image-policy
    setter: app-image-policy # the same because it wasn't supplied a name

stefanprodan Sep 24, 2020
Maintainer

I think we should use the convention that the marker has the same name as the policy and get rid of Fluxfile e.g.:

gotk create automation --path ./app/deployment.yaml --policy app-image-policy

squaremo Sep 24, 2020
Maintainer Author

The marker could just use the policy name -- thus obviating the need for a mapping in a Fluxfile

However, another purpose of the Fluxfile is to be the target for an automation object; i.e., it represents a chosen set of fields to be updated. So it may be useful to have it anyway, even if the information is otherwise redundant. Specifying a path is another way of restricting which updates are made, of course.

squaremo Sep 24, 2020
Maintainer Author

I've updated it to a simpler design in which the setter is assumed to have the name of the policy.

stefanprodan Sep 24, 2020
Maintainer

Can you please change the create commands and remove the --name to be in-line with all the other commands:

gotk create source git app-repo --url=ssh://git@github.com/squaremo/flux-v2-example
gotk create imagerepository app-image --image=squaremo/helloworld
gotk create imagepolicy app-image-policy --image=app-image --semver=1.0.x

The gotk CLI follows kubectl conventions: [verb] [service] [name] [args]

seaneagan · 2020-10-09T21:10:37Z

seaneagan
Oct 9, 2020

Does this proposal include chart version update automation? I know the helm-controller has support for specifying a semver range already, but if you want more control/history/observability of your chart version updates than that, it would be useful to drive those through git commits (or even pull requests if you want to gate with human code review) instead.

5 replies

stefanprodan Oct 10, 2020
Maintainer

Any Kubernetes YAML will be supported for image patching, here is a HelmRelease example:

apiVersion: helm.toolkit.fluxcd.io/v2beta1
kind: HelmRelease
spec:
  values:
    image: repo/image:version # {"$ref": "gotk-system:app-image-policy"}

And

apiVersion: helm.toolkit.fluxcd.io/v2beta1
kind: HelmRelease
spec:
  values:
    image:
      repository: repo/image
      tag: version # {"$ref": "gotk-system:app-image-policy:tag"}

As for chart versions, this is out of scope, the image-reflector-controller is specialised in container registries scanning.

squaremo Oct 10, 2020
Maintainer Author

If this turns out to work well for images, automating chart version automation could work in a very similar way.

bvoq Dec 31, 2021

any updates on this? I was wondering if it is now a feature. flux2 already supports helmrepositories so perhaps one could just replace imageRepositoryRef with helmRepositoryRef.

squaremo Jan 4, 2022
Maintainer Author

flux2 already supports helmrepositories so perhaps one could just replace imageRepositoryRef with helmRepositoryRef.

I think it would need to be a different kind, for a couple of reasons:

the name ImageUpdateAutomation would be wrong
you would want to mention the chart name as well I think, and charts can only have semver versions, so the spec would look a bit different too

squaremo Jan 4, 2022
Maintainer Author

(but no, no material progress on Helm chart automation, sorry!)

seaneagan · 2020-10-13T20:22:42Z

seaneagan
Oct 13, 2020

Any thoughts about supporting code review gating of the git updates? So rather than direct git commits, it would create pull requests / patchsets (gerrit terminology). If there is an open pull request when an update comes in it would update the existing pull request rather than creating a new one, so that outdated pull requests aren't left around. Note that this is similar to Dependabot. There have been requests there for container image, HelmRelease and Helm chart dependency versions. However those are not implemented yet, and Dependabot is specific to Github and doesn't support Gerrit for example. So it would be great to have that functionality in gotk.

1 reply

squaremo Oct 14, 2020
Maintainer Author

I've left room in the automation spec for this; it's not in the set of things needed for Flux v1 parity, but it would certainly be a great improvement after that is met.

nabadger · 2020-11-05T08:35:53Z

nabadger
Nov 5, 2020

I was wondering if there was any thought around supporting other templating solutions here (i.e. not just helm or kustomize).

In flux v1. the solution seems fairly generic (via .flux.yaml generators), in that the image-updates could be maintained in a separate flux-patch.yaml - it was possible to have this applied against rendered manifests.

This was useful because it basically overwrites whatever is in your rendered manifests (that seemed to meet our use-case).

It seems like my question is similar to those around HelmRelease CRDs or Charts (although in my case, it's just Jsonnet).

11 replies

seh Feb 10, 2021

fluxcd/kustomize-controller#253 proposes lexical interpolation. I'm advocating for structural editing via something like JSON Patch (which uses JSON Pointer). That is, rather than assuming that the input is lexical YAML, operate at the information model level: There are objects with fields, and arrays, and values of a few types, and we can point into such objects and arrays to indicate which values should be changed.

The YAML comment that tells this controller to replace the container image name says, "Replace this value right here." I'm advocating for also being able to say, "Replace that value over there," for cases like JSON where you can't insert those directives into an object that conforms to an unforgiving schema.

squaremo Feb 11, 2021
Maintainer Author

Yep, I would prefer operating on data structures rather than pictures of data structures, all things being equal. The proposal in fluxcd/kustomize-controller#253 is a reasonable punt nonetheless, I think, because envsubst is dead simple, widely understood, and easy to troubleshoot when you inevitably drop quotes or confounding whitespace in the middle of some syntax.

My point above was that I'm reluctant to introduce more ways of doing similar things -- with the envsubst proposal there is now

kustomize patching
envsubst
kyaml setters

.. all of which interpolate values into manifests, but have different modes of use. Reasoning about how these are going to interact is challenging. That's why I'd prefer to go in the direction of envsubst, despite its pitfalls.

However, possibly good news: Kustomize supports JSON Patch, and Flux v2 supports Kustomize -- so if JSON Patch is what you want, you already have the means. (Though to work with the automation as it is, you'd still need to use the marker comment in the patch ..)

seh Feb 11, 2021

And as of today, you can't use JSON Patch written in YAML with kustomize (per kptdev/kpt#1342 (comment)).

squaremo Feb 19, 2021
Maintainer Author

Inline patches won't work because they are encoded as strings. I tried this example which uses a patch in a separate file https://github.com/kubernetes-sigs/kustomize/blob/master/examples/jsonpatch.md, converting the patch file to YAML, and it seems to work. Am I missing a crucial detail?

seh Feb 19, 2021

Did you try to use a setter or substitution inside the patch file?

haraldatbmw · 2020-11-16T07:31:45Z

haraldatbmw
Nov 16, 2020

Is the ImagePolicy semver only? Or can I configure it to get the latest tag. So that the latest sha256 id of the image gets replaced by the setter?

This would be my dev-environment use-case to always deploy the latest version of my app.

6 replies

haraldatbmw Nov 16, 2020

I mean "latest" in the sense of the most recently built image. I need Flux to force a redeployment every time a new image was created.

haraldatbmw Nov 16, 2020

If I understand the "git revision policy" correct you require the image to be tagged with the git commit hash?

squaremo Nov 16, 2020
Maintainer Author

This is what Flux v1 attempts (unless told to use semver), but it has some flaws:

Docker images don't have a reliable build timestamp. You have to label them, or accept the possibility of missing deployments
it involves downloading the metadata layer of every image, which is much more intensive than just getting the tags

However, we'll probably have to implement the latter at some stage, since labels will be needed for the git ordering anyway.

I've posted fluxcd/image-reflector-controller#43 for implementing build timestamp ordering.

If I understand the "git revision policy" correct you require the image to be tagged with the git commit hash?

No, although that's the most efficient way to do it. You can also label the image with the revision.

haraldatbmw Nov 16, 2020

Is the current image-reflector-controller already runnable or do I have to wait for the integration into the flux2 project.

squaremo Nov 16, 2020
Maintainer Author

You can run it -- instructions for the whole shebang are given in the README for https://github.com/fluxcd/image-automation-controller.

EDIT: made sure there are instructions right there.

andrei-dascalu · 2021-01-21T23:58:27Z

andrei-dascalu
Jan 21, 2021

Hello! Is there are roadmap for other policies in addition to semver? I'm thinking in the way flux v1 has regexp and glob with either full regexp or some simple patterns. Also, does semver work with prefix like vX.X.X? Or with suffixes like -mybuild ?
Thanks!

1 reply

kingdonb Jan 22, 2021
Collaborator

The development is moving fast, and roadmap does not currently reflect all the minute details about current implementation progress. (Roadmap is here: https://toolkit.fluxcd.io/roadmap/)

Even the docs are not 100% current at this moment, as an example which I do not believe has yet surfaced in docs, regex filters are implemented in fluxcd/image-reflector-controller#75 and available in image-reflector-controller 0.3.0 and above, from 5 days ago.

I just reviewed for the latest information, semver_test.go and I see tests indicating the "v" prefix is actually considered valid semver by the parser (not strict semver v2), on the other hand I do not see support for any suffixes.

For the latest news in advance of the docs re-converging with the latest development, you can directly follow development efforts in image-reflector-controller. As of yesterday I learned from flux devs, the latest image automation supports includes, besides semver:

regex filters, calver, build numbers

The semver support as it is implemented is apparently not strictly semver v2, eg. +suffix and -suffix as build metadata or pre-release respectively are not supported and suffixes will not parse as valid semver.

Image update automation design #107

squaremo Jul 17, 2020 Maintainer

Design: Automation via kyaml setters2

Target scenarios

Questions ...

Replies: 13 comments · 51 replies

squaremo Jul 28, 2020 Maintainer Author

squaremo Jul 28, 2020 Maintainer Author

squaremo Jul 28, 2020 Maintainer Author

squaremo Jul 28, 2020 Maintainer Author

stefanprodan Jul 28, 2020 Maintainer

squaremo Jul 28, 2020 Maintainer Author

squaremo Jul 28, 2020 Maintainer Author

stefanprodan Jul 28, 2020 Maintainer

Uses case: automated updates for a public container image

Git-to-cluster reconciliation

Image update based on semver range

Questions

squaremo Jul 28, 2020 Maintainer Author

stefanprodan Jul 31, 2020 Maintainer

Proposal: select targets

Target Kubernetes objects

Target kustomizations

squaremo Aug 17, 2020 Maintainer Author

squaremo Aug 25, 2020 Maintainer Author

squaremo Aug 4, 2020 Maintainer Author

squaremo Aug 17, 2020 Maintainer Author

squaremo Aug 17, 2020 Maintainer Author

squaremo Aug 17, 2020 Maintainer Author

squaremo Sep 8, 2020 Maintainer Author

Limitations in the toytown implementation

Targeting kustomizations

Targeting custom resources

A different factoring entirely, that focuses on which images to automate

Limitations in the toytown implementation

Targeting kustomizations

Targeting custom resources

squaremo Sep 22, 2020 Maintainer Author

Design: Automation via kyaml setters2

Target scenarios

Questions ...

Unanswered questions

Hypothetical user guide for C)

squaremo Sep 23, 2020 Maintainer Author

stefanprodan Sep 24, 2020 Maintainer

squaremo Sep 24, 2020 Maintainer Author

squaremo Sep 24, 2020 Maintainer Author

stefanprodan Sep 24, 2020 Maintainer

stefanprodan Oct 10, 2020 Maintainer

squaremo Oct 10, 2020 Maintainer Author

squaremo Jan 4, 2022 Maintainer Author

squaremo Jan 4, 2022 Maintainer Author

squaremo Oct 14, 2020 Maintainer Author

squaremo Feb 11, 2021 Maintainer Author

squaremo Feb 19, 2021 Maintainer Author

squaremo Nov 16, 2020 Maintainer Author

squaremo
Jul 17, 2020
Maintainer

Replies: 13 comments 51 replies

squaremo
Jul 28, 2020
Maintainer Author

squaremo
Jul 28, 2020
Maintainer Author

squaremo Jul 28, 2020
Maintainer Author

squaremo Jul 28, 2020
Maintainer Author

stefanprodan Jul 28, 2020
Maintainer

squaremo Jul 28, 2020
Maintainer Author

squaremo Jul 28, 2020
Maintainer Author

stefanprodan
Jul 28, 2020
Maintainer

squaremo Jul 28, 2020
Maintainer Author

stefanprodan
Jul 31, 2020
Maintainer

squaremo Aug 17, 2020
Maintainer Author

squaremo Aug 25, 2020
Maintainer Author

squaremo
Aug 4, 2020
Maintainer Author

squaremo Aug 17, 2020
Maintainer Author

squaremo Aug 17, 2020
Maintainer Author

squaremo Aug 17, 2020
Maintainer Author

squaremo
Sep 8, 2020
Maintainer Author

squaremo
Sep 22, 2020
Maintainer Author

squaremo Sep 23, 2020
Maintainer Author

stefanprodan Sep 24, 2020
Maintainer

squaremo Sep 24, 2020
Maintainer Author

squaremo Sep 24, 2020
Maintainer Author

stefanprodan Sep 24, 2020
Maintainer

stefanprodan Oct 10, 2020
Maintainer

squaremo Oct 10, 2020
Maintainer Author

squaremo Jan 4, 2022
Maintainer Author

squaremo Jan 4, 2022
Maintainer Author

squaremo Oct 14, 2020
Maintainer Author

squaremo Feb 11, 2021
Maintainer Author

squaremo Feb 19, 2021
Maintainer Author

squaremo Nov 16, 2020
Maintainer Author