Che fails on version 1.4.1 #1269

gorkem · 2017-08-16T03:16:57Z

Che has a service named che-host that has a route configured. When the service is trying to reach itself through the internal names it fails, public route works fine. When I ssh into the pod and try to curl to http://che-host it fails after a long wait where as http://localhost and with internal ip address works fine.

Also when I enable the logs I see lines like follow for every pod.
1610 docker_sandbox.go:263] Couldn't find network status for eclipse-che/che-3-deploy through plugin: invalid network status for

We also did a minishift ssh and compared /etc/resolv.conf with 1.3.1 and 1.4.1 only has a single nameserver entry where as 1.3.1 also includes and entry for search.

@amisevsk Anything else I am missing?

The text was updated successfully, but these errors were encountered:

gbraad · 2017-08-16T03:19:32Z

Interesting, I'll have a look. BTW, did you use the B2D or CentOS image?

gorkem · 2017-08-16T03:27:21Z

I seem to have b2d

gbraad · 2017-08-16T03:40:32Z

I did a quick run of the versions you mentioned, using B2D, but I never get an entry for 'search' in /etc/resolv.conf

Minishift v.1.4.1
- OpenShift v3.6 - nameserver 192.168.122.1
- OpenShift v1.5.1 - nameserver 192.168.122.1
Minishift v1.3.1
- OpenShift v1.5.1 - nameserver 192.168.122.1

With CentOS (v1.3.1 - OS v1.5.1)

# Generated by NetworkManager
nameserver 192.168.122.1
nameserver 192.168.42.1

With CentOS (v1.4.1 - OS v3.6.0)

# Generated by NetworkManager
nameserver 192.168.122.1
nameserver 192.168.42.1

Which is as expected...

gorkem · 2017-08-16T03:44:12Z

Hmmm. I do not have those entries anymore on my 1.3.1 either

amisevsk · 2017-08-16T05:34:04Z

I've managed to narrow this down somewhat.

Running minishift 1.4.1 and Openshift version

OpenShift Master: v3.6.0+c4dd4cf
Kubernetes Master: v1.6.1+5115d708d7

I am able to reproduce the issue. Running minishift 1.4.1 and OpenShift v1.5.1 it does not occur.

However, the issue is actually that pods cannot resolve their own service or service's clusterIP. Starting a second pod, I can curl che-host without issue, but from within che host I cannot. There are no real networking issues except that pods cannot access their own services -- localhost and external work fine.

After a bit of digging, I came across this section of kubernetes documentation that seems related.

@gbraad Is there a setting that has changed between 1.5.1 and 3.6? I don't see this issue on OpenShift Online, running

OpenShift Master: v3.6.173.0.5 (online version 3.5.0.20)
Kubernetes Master: v1.6.1+5115d708d7

gbraad · 2017-08-16T05:54:58Z

Is there a setting that has changed between 1.5.1 and 3.6?

I do not have enough visibility on this, but hopefully @csrwng or @bparees knows more about this. This could very well be related to how oc cluster up sets up the configuration. You could try this with a new VM and just running oc cluster up. This would exclude any of our configuration that happens.

csrwng · 2017-08-16T14:04:27Z

We've seen this issue in cluster up before -- see openshift/origin#12111
There were 2 different problems we found with code,

/sys/devices/virtual/net was not getting mounted into the origin container as read/write and the kubelet was not able to set the hairpin mode on the docker bridge
/var/lib/docker was hardcoded as the host's docker directory and in some machines that wasn't the real docker directory

If either of these is still the cause, one thing you can try to solve the issue is running:

ifconfig docker0 promisc

on the minishift vm

amisevsk · 2017-08-17T02:17:00Z

@csrwng It looks like your command solved the issue for me.

Regarding the problems you listed,

/sys/devices/virtual/net seems to be mounted RW in the origin container:

            {
                "Source": "/sys/devices/virtual/net",
                "Destination": "/sys/devices/virtual/net",
                "Mode": "rw",
                "RW": true,
                "Propagation": "rprivate"
            },

I'm not sure how to check, but /var/lib/docker is a symlink to /mnt/sda1/var/lib/docker, which seems to be correct.

We've figured out a workaround for our issue (using localhost instead of the service), but I would still like to know what's going wrong.

gbraad · 2017-08-17T05:58:42Z

I would still like to know what's going wrong.

So do I, as we might have to provide this as a known issue. Especially since the PR openshift/origin#12744 seems to have been available since Feb 1st. And therefore this should have been observed since v1.5.1?

csrwng · 2017-08-17T14:10:03Z

@gbraad if you tell minishift v1.4.1 to run origin version v3.6, does it obtain the v3.6 oc client to run 'oc cluster up'? or does it use the one bundled in the image? If the latter, what version of the 'oc' binary is included in the minishift v1.4.1 image?

praveenkumar · 2017-08-17T14:43:22Z

run origin version v3.6, does it obtain the v3.6 oc client to run 'oc cluster up'?

@csrwng Yes it obtain 3.6 oc client to deploy 3.6 cluster.

LalatenduMohanty · 2017-08-21T06:23:10Z

@csrwng I am confused. I guess Origin 3.6 oc binary has the fix i.e. openshift/origin#12744 right?

csrwng · 2017-08-21T12:37:56Z

Yes it should have the fix

praveenkumar · 2017-09-08T11:01:00Z

@gorkem can you try out it with minishift 1.5.0 without any workaround and let us know if it still fail because here we are using 3.6.0 as default openshift version.

coolbrg · 2017-09-13T09:20:28Z

Any update @gorkem ? Have you got chance to try minishift 1.5.0 ?

gorkem · 2017-09-13T14:19:20Z

I was able to use it with minishift 1.5.1 without any hacks.

coolbrg · 2017-09-13T14:48:24Z

Thanks, @gorkem for confirming. We are closing this issue now. If you facing anything please feel free to open the new issue.

gbraad added kind/bug priority/major status/needs-info labels Aug 16, 2017

amisevsk mentioned this issue Aug 17, 2017

Use localhost instead of che-host service name for checking KC redhat-developer/rh-che#270

Merged

gbraad mentioned this issue Aug 17, 2017

Issue #1195 - Add integration test for flags #1272

Closed

LalatenduMohanty added this to the v1.6.0 milestone Aug 30, 2017

coolbrg self-assigned this Sep 13, 2017

coolbrg closed this as completed Sep 13, 2017

coolbrg added resolution/out-of-date priority/major and removed priority/major status/needs-info labels Sep 13, 2017

garagatyi mentioned this issue Jan 12, 2018

Processes unavailable by service: port openshift/origin#17981

Closed

bbrowning mentioned this issue Jul 6, 2018

Please explain the issue with Minishift's networking projectodd/openwhisk-openshift#27

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Che fails on version 1.4.1 #1269

Che fails on version 1.4.1 #1269

gorkem commented Aug 16, 2017

gbraad commented Aug 16, 2017 via email

gorkem commented Aug 16, 2017

gbraad commented Aug 16, 2017 •

edited

Loading

gorkem commented Aug 16, 2017

amisevsk commented Aug 16, 2017

gbraad commented Aug 16, 2017

csrwng commented Aug 16, 2017

amisevsk commented Aug 17, 2017

gbraad commented Aug 17, 2017

csrwng commented Aug 17, 2017

praveenkumar commented Aug 17, 2017 •

edited by gbraad

Loading

LalatenduMohanty commented Aug 21, 2017 •

edited

Loading

csrwng commented Aug 21, 2017

praveenkumar commented Sep 8, 2017

coolbrg commented Sep 13, 2017 •

edited

Loading

gorkem commented Sep 13, 2017

coolbrg commented Sep 13, 2017

Che fails on version 1.4.1 #1269

Che fails on version 1.4.1 #1269

Comments

gorkem commented Aug 16, 2017

gbraad commented Aug 16, 2017 via email

gorkem commented Aug 16, 2017

gbraad commented Aug 16, 2017 • edited Loading

gorkem commented Aug 16, 2017

amisevsk commented Aug 16, 2017

gbraad commented Aug 16, 2017

csrwng commented Aug 16, 2017

amisevsk commented Aug 17, 2017

gbraad commented Aug 17, 2017

csrwng commented Aug 17, 2017

praveenkumar commented Aug 17, 2017 • edited by gbraad Loading

LalatenduMohanty commented Aug 21, 2017 • edited Loading

csrwng commented Aug 21, 2017

praveenkumar commented Sep 8, 2017

coolbrg commented Sep 13, 2017 • edited Loading

gorkem commented Sep 13, 2017

coolbrg commented Sep 13, 2017

gbraad commented Aug 16, 2017 •

edited

Loading

praveenkumar commented Aug 17, 2017 •

edited by gbraad

Loading

LalatenduMohanty commented Aug 21, 2017 •

edited

Loading

coolbrg commented Sep 13, 2017 •

edited

Loading