Updates for usability of GKE self managed setup script #759

KatrinaHoffert · 2019-05-24T19:30:02Z

For issue #758.

This does several things to make this setup much easier. But in short, it allows a single command to be used to setup for the vast majority of cases (with just ./gke-self-managed.sh -n CLUSTER_NAME -z ZONE).

The CLI flags are updated to actually work (ie, represent those used by the GLBC now).
The gce.conf file is automatically populated from querying the cluster. All options can be instead provided as CLI flags if the querying gets them wrong or to speed things up slightly (the time to query is pretty negligible compared to the time to restart the cluster to turn off the default GLBC, though).
The glbc.yaml file is automatically setup with the image instead of letting users create an invalid deployment that won't be caught till runtime.
The script defaults to building and pushing the image, making the script usable for quickly testing changes. The image to use can be specified as a flag instead. The registry to use is customizable the same way make push does.
We handle progress better. No more winging it by hoping the API server is up and running.

Opted not to make a test for it this ticket. We really should have some kinda test (since the script got terribly broken in 2 ways), but it's quite difficult to test given that it requires cluster ownership, needs push access to some container registry, requires an existing cluster, and is very slow no matter how you cut it (it needs to restart a cluster and realistically, a test would also need to create an ingress, which takes a few minutes). I'll make a new issue for testing it, but it's likely to be lower priority as the ROI is complicated.

This is a non-breaking change, since the existing gce.conf file is used as a template and if it were already filled in, those fields would just be ignored. The files that need modification are now copied to and .gitignored, too, so no risks of accidental commits or annoyance of having them show up in the git status.

k8s-ci-robot · 2019-05-24T19:30:10Z

Hi @KatrinaHoffert. Thanks for your PR.

I'm waiting for a kubernetes member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

KatrinaHoffert · 2019-05-24T19:40:37Z

And yes, my bash skills are not that great. There's probably tons of room for micro-optimizations and all with the tools I use for filtering text and all. I tried to use gcloud's formatting as much as possible. I don't think it's a big area to nitpick, though, since the amount of time to update the cluster massively dominates the runtime.

There's some areas for future improvement that I did not do due to time constraints. Future improvements could be:

Check the cluster actually exists, as some commands assume it does.
Don't require the CWD to be the script's location.
Better handling of resources already existing (can often be skipped to allow resuming).
Stream the make output with tee.
There's probably some gcloud --format magic that could be used to avoid that ugly cut for getting instance group names from the cluster.
We use the output of gcloud container clusters describe up to three times and it could be cached.
We should check the gcloud config stuff and permissions before we do anything.
We are dependent on the default GLBC HTTP backend already existing. We probably don't need to (just allow a random node port in that case?).
The heck is manual_glbc_provision?
Shebang is not on first line.
Most new commands I added don't handle errors.

MrHohn · 2019-05-24T19:53:48Z

/ok-to-test

.gitignore

docs/deploy/resources/glbc.yaml

docs/deploy/gke/gke-self-managed.sh

rramkumar1

In general, looks good to me. Just a couple more nits before I will lgtm officially.

docs/deploy/resources/gce.conf

docs/deploy/gke/README.md

docs/deploy/gke/gke-self-managed.sh

- Removed arguments that no longer exist and those that are defaults. - Updated command to not depend on sh (not available by default). Use same pattern as most GKE containers do. - Image URL now ensures that the YAML would be invalid (more obvious if you miss it).

All values are overridable in case the generation messes up.

We build and push automatically if the user doesn't provide --image-url.

Better UX this way, too. Also more output about progress and consistency in file names.

Also fixed details on what the script does. It was confusing before and had some order wrong.

1. Restored confirmation for cluster being ready after disabling the default GLBC. Also made this affected by `--no-confirm`, since the intent of that was to allow non-interactive usage (cause forced interactive scripts are gross). Made that set `--quiet` for gcloud commands while I was at it, since some of those can confirm (and made it a versatile "provide arbitrary extra gcloud flags" thing). Only applied to mutating gcloud commands. 2. Build and push no longer implicit. As a result, either the `--image-url` or `--build-and-push` must be set. This is now validated. 3. Added validation for if CONTAINER_PREFIX is set while I was at it, since it can break build-and-push. I doubt anyone sets it, but better safe than sorry. This just avoids an inevitible error. 4. Placeholder used in GLBC YAML changed. 5. Using `.gen` as extension for generated files. 6. Make command is now logged so user sees what we're doing. 7. Clarified help text.

This is very useful for testing. It uses kubectl --dry-run where possible, so we're actually testing the structure of the YAML files (the creation of the key does not use this since it depends on running a real command). This command can also be used to quickly and cleanly get the commands this script would run for manual usage. Fixed bug with --cleanup where it would ignore any args after the --cleanup tag (which could result in running the wrong things!).

Also fixed dumb quote bug that wouldn't play nice with eval.

KatrinaHoffert · 2019-06-04T17:46:21Z

All done, @rramkumar1

rramkumar1 · 2019-06-04T17:52:27Z

/lgtm

Thanks for all these changes!

k8s-ci-robot · 2019-06-04T17:52:38Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: KatrinaHoffert, rramkumar1

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [rramkumar1]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels May 24, 2019

k8s-ci-robot requested review from bowei and freehan May 24, 2019 19:30

KatrinaHoffert mentioned this pull request May 24, 2019

GKE self managed setup script should have an e2e test #760

Closed

k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels May 24, 2019

rramkumar1 reviewed May 30, 2019

View reviewed changes

.gitignore Outdated Show resolved Hide resolved

docs/deploy/resources/glbc.yaml Outdated Show resolved Hide resolved

docs/deploy/gke/gke-self-managed.sh Show resolved Hide resolved

docs/deploy/gke/gke-self-managed.sh Outdated Show resolved Hide resolved

KatrinaHoffert force-pushed the issue-758 branch from 7b39487 to 790fb14 Compare May 31, 2019 18:07

rramkumar1 reviewed Jun 3, 2019

View reviewed changes

docs/deploy/resources/gce.conf Outdated Show resolved Hide resolved

docs/deploy/gke/README.md Outdated Show resolved Hide resolved

docs/deploy/gke/gke-self-managed.sh Outdated Show resolved Hide resolved

docs/deploy/gke/gke-self-managed.sh Show resolved Hide resolved

KatrinaHoffert added 7 commits June 4, 2019 13:22

Made gce.conf generated

b7b6b22

All values are overridable in case the generation messes up.

GKE self managed script now automated GLBC yaml

b7576f2

We build and push automatically if the user doesn't provide --image-url.

No more winging it with time estimates

c64aaf9

Better UX this way, too. Also more output about progress and consistency in file names.

Updated readme for new steps

672cd4f

Also fixed details on what the script does. It was confusing before and had some order wrong.

KatrinaHoffert force-pushed the issue-758 branch from 790fb14 to 09d3fa5 Compare June 4, 2019 17:40

Cleaner gce.conf + review changes

147a374

Also fixed dumb quote bug that wouldn't play nice with eval.

KatrinaHoffert force-pushed the issue-758 branch from 09d3fa5 to 147a374 Compare June 4, 2019 17:45

k8s-ci-robot assigned rramkumar1 Jun 4, 2019

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Jun 4, 2019

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jun 4, 2019

k8s-ci-robot merged commit 688894c into kubernetes:master Jun 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updates for usability of GKE self managed setup script #759

Updates for usability of GKE self managed setup script #759

KatrinaHoffert commented May 24, 2019

k8s-ci-robot commented May 24, 2019

KatrinaHoffert commented May 24, 2019 •

edited

Loading

MrHohn commented May 24, 2019

rramkumar1 left a comment

KatrinaHoffert commented Jun 4, 2019

rramkumar1 commented Jun 4, 2019

k8s-ci-robot commented Jun 4, 2019

Updates for usability of GKE self managed setup script #759

Updates for usability of GKE self managed setup script #759

Conversation

KatrinaHoffert commented May 24, 2019

k8s-ci-robot commented May 24, 2019

KatrinaHoffert commented May 24, 2019 • edited Loading

MrHohn commented May 24, 2019

rramkumar1 left a comment

Choose a reason for hiding this comment

KatrinaHoffert commented Jun 4, 2019

rramkumar1 commented Jun 4, 2019

k8s-ci-robot commented Jun 4, 2019

KatrinaHoffert commented May 24, 2019 •

edited

Loading