Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Migrate merge-blocking jobs to dedicated cluster: pull-kubernetes-e2e-gce-network-proxy-grpc #18853

Closed
spiffxp opened this issue Aug 14, 2020 · 12 comments
Assignees
Labels
area/jobs kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. sig/testing Categorizes an issue or PR as relevant to SIG Testing.

Comments

@spiffxp
Copy link
Member

spiffxp commented Aug 14, 2020

What should be cleaned up or changed:

This is part of #18550

To properly monitor the outcome of this, you should be a member of k8s-infra-prow-viewers@kubernetes.io. PR yourself into https://github.com/kubernetes/k8s.io/blob/master/groups/groups.yaml#L603-L628 if you're not a member.

Migrate pull-kubernetes-e2e-gce-network-proxy-grpc to k8s-infra-prow-build by adding a cluster: k8s-infra-prow-build field to the job:

NOTE: migrating this job is not as straightforward as some of the other #18550 issues, because:

  • it doesn't run for every single PR
  • so in terms of appropriate amount of PR traffic, either manually trigger this job, or look for open PRs that have triggered this job (e.g. using the job-history link below), and explicitly /test <this-job-name> on them

Once the PR has merged, note the date/time it merged. This will allow you to compare before/after behavior.

Things to watch for the job

Things to watch for the build cluster

Keep this open for at least 24h of weekday PR traffic. If everything continues to look good, then this can be closed.

/wg k8s-infra
/sig testing
/area jobs
/help

@spiffxp spiffxp added the kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. label Aug 14, 2020
@k8s-ci-robot k8s-ci-robot added wg/k8s-infra sig/testing Categorizes an issue or PR as relevant to SIG Testing. help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. area/jobs labels Aug 14, 2020
@bartsmykla
Copy link

/assign
/remove-help

@k8s-ci-robot k8s-ci-robot removed the help wanted Denotes an issue that needs help from a contributor. Must meet "help wanted" guidelines. label Aug 17, 2020
bartsmykla pushed a commit to bartsmykla/test-infra that referenced this issue Aug 18, 2020
Migration of the job to the new cluster: `k8s-infra-prow-build` as a
result of kubernetes#18853

Signed-off-by: Bart Smykla <bsmykla@vmware.com>
@bartsmykla
Copy link

@spiffxp I opened the PR to test if migration succeeded and the job actually failed with the error:

W0820 08:13:15.836] ERROR: (gcloud.compute.project-info.describe) Could not fetch resource:
W0820 08:13:15.836]  - Required 'compute.projects.get' permission for 'projects/k8s-network-proxy-e2e' 

@spiffxp
Copy link
Member Author

spiffxp commented Aug 21, 2020

So it turns out I missed this bit in the job definition: - --gcp-project=k8s-network-proxy-e2e

You should delete that and see if the job still functions (this is equivalent to setting --gcp-project-type=gce-project)

If it fails, then we'll need to create a custom project type and understand what's unique about k8s-network-proxy-e2e. I would reach out to whomever added/authored the job to start.

@spiffxp
Copy link
Member Author

spiffxp commented Sep 1, 2020

I've opened #19079 to try out the fix I suggested in #18853 (comment)

@spiffxp
Copy link
Member Author

spiffxp commented Sep 1, 2020

Per kubernetes/kubernetes#94127 (comment) I think that worked

@RobertKielty
Copy link
Member

Hi @bartsmykla, have you had a chance to submit the changes?
cc @spiffxp

@spiffxp
Copy link
Member Author

spiffxp commented Sep 9, 2020

This job was made non-blocking via #19033 I would like to see it restored to blocking if we're confident it's been fixed

@spiffxp
Copy link
Member Author

spiffxp commented Sep 9, 2020

I'm keeping an eye on https://prow.k8s.io/pr-history/?org=kubernetes&repo=kubernetes&pr=94127 and think we should trigger a few more runs

@spiffxp
Copy link
Member Author

spiffxp commented Sep 11, 2020

https://prow.k8s.io/job-history/gs/kubernetes-jenkins/pr-logs/directory/pull-kubernetes-e2e-gce-network-proxy-grpc

It appears to be intermittently timing out

If I look back to before we started messing with this job (e.g. https://prow.k8s.io/job-history/gs/kubernetes-jenkins/pr-logs/directory/pull-kubernetes-e2e-gce-network-proxy-grpc?buildId=1299167300326264832), I see it also intermittently times out

Since it's pre-existing behavior, I'm willing to call this good enough.

@bartsmykla
Copy link

I agree and think we could call this done

@spiffxp
Copy link
Member Author

spiffxp commented Nov 5, 2020

/close

@k8s-ci-robot
Copy link
Contributor

@spiffxp: Closing this issue.

In response to this:

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/jobs kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. sig/testing Categorizes an issue or PR as relevant to SIG Testing.
Projects
None yet
Development

No branches or pull requests

4 participants