Preserve OVS data path flows when Antrea is restarted gracefully #355

antoninbas · 2020-02-03T21:14:32Z

Describe what you are trying to solve
At the moment, data path flows are deleted by ovs-vswitchd when the daemon gracefully exits. This means that for planned rolling upgrades of Antrea, we are currently bringing the network down. A simple experiment reveals that it can take more than 10s for the network to come back up:

64 bytes from 10.10.1.250: seq=59 ttl=64 time=0.178 ms
64 bytes from 10.10.1.250: seq=60 ttl=64 time=0.079 ms

64 bytes from 10.10.1.250: seq=72 ttl=64 time=0.294 ms
64 bytes from 10.10.1.250: seq=73 ttl=64 time=0.465 ms

That time may go up as the number of flows increase, and it may take even longer for network policies to be enforced again.

Describe the solution you have in mind
A patch was recently merged into OVS master to change this behavior: openvswitch/ovs@79eadaf. If we include that patch, we should be able to support hitless upgrades for Antrea.

Unfortunately Antrea currently uses OVS 2.11.0, and the patch is not even included in the latest OVS release (2.13.0). Until we can upgrade to a release > 2.13, we should consider applying the patch manually when we build the antrea/openvswitch Docker image.

Describe the main design/architecture of your solution

apply the patch when building antrea/openvswitch
nothing else is required. Flows will be preserved when ovs-vswitchd exits. If this is part of an upgrade and the flows installed by Antrea have changed, flows which have become stale will be deleted on restart (this is already implemented, by leveraging the round ID in the flow cookies).
as part of cleanup (Deleting Antrea should do a complete cleanup #181), we will take care of deleting the bridge, and therefore all the data path flows.

Alternative solutions that you considered
Have 2 different DaemonSets, one for antrea-agent and one for antrea-ovs, so that antrea-agent can be upgraded without disrupting the OVS daemons. It seems undesirable to have 2 DaemonSets for Antrea though, and we may also want to upgrade the Docker image for antrea-ovs from time to time (security vulnerabilities).

Test plan
I will add an e2e test which will send UDP packets using iperf between 2 Pods (or send ICMP packets with a small interval?). The test will verify that no packets have been lost after restarting Antrea.

The text was updated successfully, but these errors were encountered:

Without this patch, when the antrea-ovs container exits gracefully (e.g. as part of a RollingUpdate for the DaemonSet), flows are deleted, which breaks connectivity (potentially for > 10s). A recent OVS patch changed the default behavior of ovs-vswitchd so that data path flows are no longer deleted on exit. Because this patch is not yet available in any released OVS version (and won't be for > 6 months), we apply it manually when we build our base OVS Docker image. With this support, it seems that the Antrea agent logic which is in charge of replaying the flows after each reconnection from the OpenFlow client to OVS, is no longer needed. While it doesn't hurt to keep it, I cannot think of a scenario where it is useful. I suggest we open a separate issue to keep track of removing that code. Fixes antrea-io#355

Without this patch, when the antrea-ovs container exits gracefully (e.g. as part of a RollingUpdate for the DaemonSet), datapath flows are deleted, which breaks existing connections. It takes 10 seconds for exiting connections to start working again. A recent OVS patch changed the default behavior of ovs-vswitchd so that datapath flows are no longer deleted on exit. Because this patch is not yet available in any released OVS version (and won't be for > 6 months), we apply it manually when we build our base OVS Docker image. Fixes antrea-io#355

antoninbas · 2020-02-05T20:54:01Z

To clarify: the changes in #358 only apply to kernel datapath flows. New connections that don't match any flow in the datapath will not be established until ovs-vswitchd has restarted and Antrea has replayed the flows. If kube-proxy selects an endpoint on a Node that's affected for a ClusterIP, the connection won't be established until then (or until there is a TCP SYN timeout and the app tries opening a new connection).

Without this patch, when the antrea-ovs container exits gracefully (e.g. as part of a RollingUpdate for the DaemonSet), datapath flows are deleted, which breaks existing connections. It takes 10 seconds for exiting connections to start working again. A recent OVS patch changed the default behavior of ovs-vswitchd so that datapath flows are no longer deleted on exit. Because this patch is not yet available in any released OVS version (and won't be for > 6 months), we apply it manually when we build our base OVS Docker image. Fixes antrea-io#355 Disable new e2e test for Kind When using the netdev datapath, packet forwarding is done by the ovs-vswitchd daemon itself, so even existing connections are broken.

Without this patch, when the antrea-ovs container exits gracefully (e.g. as part of a RollingUpdate for the DaemonSet), datapath flows are deleted, which breaks existing connections. It takes 10 seconds for exiting connections to start working again. A recent OVS patch changed the default behavior of ovs-vswitchd so that datapath flows are no longer deleted on exit. Because this patch is not yet available in any released OVS version (and won't be for > 6 months), we apply it manually when we build our base OVS Docker image. This is a step towards non-disruptive upgrade. However, there are other factors in play: 1) when the OVS daemons are stopped, the VXLAN device (vxlan_sys_4789) goes away, which impacts connectivity between Nodes, and 2) when ovs-vswitchd restarts, datapath flows are flushed and we have to wait until the Antrea Agent has replayed the flows. * Disable new e2e test for Kind When using the netdev datapath, packet forwarding is done by the ovs-vswitchd daemon itself, so even existing connections are broken. * Use arping instead of ping for test When using ping the test can be flaky: if one of the hosts need to send an ARP request to refresh its ARP entry while the test is ongoing, odds are that there is no cached flow in the datapath. If this happens, the host will mark the ARP entry has "incomplete" and the ping will start failing. Fixes antrea-io#355

Without this patch, when the antrea-ovs container exits gracefully (e.g. as part of a RollingUpdate for the DaemonSet), datapath flows are deleted, which breaks existing connections. It takes 10 seconds for exiting connections to start working again. A recent OVS patch changed the default behavior of ovs-vswitchd so that datapath flows are no longer deleted on exit. Because this patch is not yet available in any released OVS version (and won't be for > 6 months), we apply it manually when we build our base OVS Docker image. This is a step towards non-disruptive upgrade. However, there are other factors in play: 1) when the OVS daemons are stopped, the VXLAN device (vxlan_sys_4789) goes away, which impacts connectivity between Nodes, and 2) when ovs-vswitchd restarts, datapath flows are flushed and we have to wait until the Antrea Agent has replayed the flows. * Disable new e2e test for Kind When using the netdev datapath, packet forwarding is done by the ovs-vswitchd daemon itself, so even existing connections are broken. * Use arping instead of ping for test When using ping the test can be flaky: if one of the hosts need to send an ARP request to refresh its ARP entry while the test is ongoing, odds are that there is no cached flow in the datapath. If this happens, the host will mark the ARP entry has "incomplete" and the ping will start failing. Fixes #355

antoninbas added the proposal A concrete proposal for adding a feature label Feb 3, 2020

antoninbas self-assigned this Feb 3, 2020

antoninbas added this to the Antrea v0.4.0 release milestone Feb 3, 2020

antoninbas mentioned this issue Feb 4, 2020

Preserve OVS datapath flows after an agent restart #358

Merged

antoninbas closed this as completed in #358 Feb 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preserve OVS data path flows when Antrea is restarted gracefully #355

Preserve OVS data path flows when Antrea is restarted gracefully #355

antoninbas commented Feb 3, 2020

antoninbas commented Feb 5, 2020

Preserve OVS data path flows when Antrea is restarted gracefully #355

Preserve OVS data path flows when Antrea is restarted gracefully #355

Comments

antoninbas commented Feb 3, 2020

antoninbas commented Feb 5, 2020