Respect volume name when reusing PVCs #1122

pebrc · 2019-06-20T17:06:34Z

Part of #877

puts the volume name into the PVCs labels to include it in the comparison
enables the previously commented e2e that tests the default case of reuse
adds another e2e test to make sure PVCs are reused while respecting their name

I ran into an issue with scheduling pods with more than one PVC, because we use volumeBindingMode: Immediate more often than not the volumes would be allocated in two different zones which meant that the pod became unschedulable as it cannot be in two zones at once ...

The solution I found was to switch to volumeBindingMode: WaitForFirstConsumer which creates the volumes only once the pod has been scheduled to a node. The only problem with this is that in k8s 1.11 this does not work because volumes need to be pre-provisioned. In 1.12 DynamicProvisioningScheduling is no longer feature gated and volume provisioning with late binding works as expected. So this PR also

adds a storage class template with WaitForFirstConsumer
ups the default k8s version to 1.12
skips the new e2e test on 1.11

sebgl

I'm OK with the label fix, maybe not that much with the e2e test approach.

At first I thought the storageClass patch was not the right way to solve the problem: patching our e2e tests to patch the default GKE storageClass does not help ECK users to not have this problem.
But looking more at the doc around PVCs it looks like using PVs without volumeBindingMode: waitForFirstConsumer is kind of broken by design. There is no other way to deal with zonal volumes and affinity rules.

I think maybe we should stick to using GKE 1.11 by default (it's still GCP default?). Skip the multi-PV test on k8s<1.12, create a new storageClass with a custom name with volumeBindingMode: waitForFirstConsumer only in the test that requires it, and use it only there.
This way other tests still rely on GKE defaults, and help us notice anything wrong for a "default" usage?

operators/config/dev/default-storage.yaml

operators/test/e2e/failure_test.go

operators/Makefile

sebgl · 2019-06-21T07:43:45Z

operators/test/e2e/failure_test.go

+}
+
+func TestKillCorrectPVReuse(t *testing.T) {
+	s := stack.NewStackBuilder("test-failure-pvc").


I'm wondering if we should maybe patch the storageClass for this test only (like: in the test).
Also skip this test if the k8s version we are testing against is <1.12.

sebgl · 2019-06-21T09:19:12Z

I think we should take into consideration the pod name label when retrieving several PVCs for a single pod.
Use case: I have 2 PVCs for my pod, one for data, another one for logs. When these PVCs get reused for another pod, I want to keep logs and data tied together. It would be confusing to have the logs volume of one pod mounted along with the data volume of another pod.
We don't care about the actual pod name, but we do care about all volumes we mount for a pod to have the same pod name?

…aned-pvcs

pebrc · 2019-06-25T07:42:33Z

I think we should take into consideration the pod name label when retrieving several PVCs for a single pod.

Do you think we should try to address this in this PR?

sebgl · 2019-06-25T07:45:52Z

I think we should take into consideration the pod name label when retrieving several PVCs for a single pod.

Do you think we should try to address this in this PR?

I'm fine with doing it in a follow-up PR, but I think #877 cannot be considered fixed until we do it (or another issue is needed).

pebrc · 2019-06-25T07:51:43Z

@sebgl I think I have a slight preference for doing another PR. I changed this PRs description so that it does not auto-close the issue.

pebrc · 2019-06-25T11:04:11Z

@sebgl can you take another look at this PR. I tried to address your feedback.

…stead

pebrc · 2019-06-25T19:03:33Z

@sebgl I removed the external definition of a provider specific storage class and the corresponding flag. Instead I am using the existing default storage class as a template to create a derivative with late volume binding as discussed. 👍 for the idea.

sebgl

Thanks for making the changes 👍 it's nice to keep these tests independent of any cloud provider.
I left 2 minor comments, and one I think is very important (hence "changes requested"). Otherwise LGTM.

operators/test/e2e/failure_test.go

operators/test/e2e/params/params.go

sebgl · 2019-06-26T07:14:12Z

operators/test/e2e/failure_test.go

+					}
+					for _, pod := range pods {
+						if stringsutil.StringInSlice(pod.Name, survivingPodNames) {
+							continue


At this point chances are the deleted pod is not back into the cluster yet, so we only iterate on pods we don't care about, continue on each one, then return nil? Which skips the test entirely?
Should we run WithSteps(stack.CheckStackSteps(s, k)...). first, then this test? Also probably simpler to get the expected pod through its name directly instead of filtering on pods we don't care about?

This comment was pure gold. You should get a🥇 for that. It surfaced that the actual fix was not fixing anymore since I merged master into this branch 😞 which was hidden by this flaw in the test ...

sebgl

LGTM

pebrc added 3 commits June 18, 2019 15:49

Add volume name as label to PVCs to avoid incorrect re-use

15bea17

Add e2e test to verify volume reuse respects volume name

35ac74d

Add WaitForFirstConsumer storage class and move to k8s 1.12

ac9f606

pebrc requested review from sebgl and barkbay June 20, 2019 17:10

sebgl requested changes Jun 21, 2019

View reviewed changes

pebrc added 7 commits June 24, 2019 09:04

wip

3422efe

Merge remote-tracking branch 'upstream/master' into fix-multiple-orph…

6bb22a7

…aned-pvcs

Predicate test on k8s version create storage class only for this test

f43a90a

Make test handle multiple pods correctly

9bec7f4

Fix e2e batch mode to deal with storage class argument

0fc1e90

Use 3 nodes in e2e test and add missing rbac role

43ac1dd

Update godoc for e2e test

e45e917

Remove test stub

7ecfac8

pebrc added 6 commits June 25, 2019 13:08

Reformat batch job yaml

806bd52

Remove external storage class template and modify default template in…

d5ed694

…stead

whitespace

8e618f5

always set vm.max_map_count

bbb061f

refactor default storage class check

c620d77

godoc

6d18a40

sebgl requested changes Jun 26, 2019

View reviewed changes

Fix e2e test assumptions, fix pvc label comparison

052d26d

sebgl approved these changes Jul 1, 2019

View reviewed changes

pebrc merged commit 83a0c81 into elastic:master Jul 1, 2019

pebrc mentioned this pull request Jul 15, 2019

GetOrphanedVolumeClaim does not manage multiple volumes #877

Closed

pebrc added >bug Something isn't working v0.9.0 labels Jul 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Respect volume name when reusing PVCs #1122

Respect volume name when reusing PVCs #1122

pebrc commented Jun 20, 2019 •

edited

Loading

sebgl left a comment •

edited

Loading

sebgl Jun 21, 2019

sebgl commented Jun 21, 2019 •

edited

Loading

pebrc commented Jun 25, 2019

sebgl commented Jun 25, 2019

pebrc commented Jun 25, 2019

pebrc commented Jun 25, 2019

pebrc commented Jun 25, 2019

sebgl left a comment

sebgl Jun 26, 2019

pebrc Jun 26, 2019

sebgl left a comment

Respect volume name when reusing PVCs #1122

Respect volume name when reusing PVCs #1122

Conversation

pebrc commented Jun 20, 2019 • edited Loading

sebgl left a comment • edited Loading

Choose a reason for hiding this comment

sebgl Jun 21, 2019

Choose a reason for hiding this comment

sebgl commented Jun 21, 2019 • edited Loading

pebrc commented Jun 25, 2019

sebgl commented Jun 25, 2019

pebrc commented Jun 25, 2019

pebrc commented Jun 25, 2019

pebrc commented Jun 25, 2019

sebgl left a comment

Choose a reason for hiding this comment

sebgl Jun 26, 2019

Choose a reason for hiding this comment

pebrc Jun 26, 2019

Choose a reason for hiding this comment

sebgl left a comment

Choose a reason for hiding this comment

pebrc commented Jun 20, 2019 •

edited

Loading

sebgl left a comment •

edited

Loading

sebgl commented Jun 21, 2019 •

edited

Loading