← Back to team overview

yahoo-eng-team team mailing list archive

[Bug 1794809] [NEW] Gateway ports are down after reboot of control plane nodes

 

Public bug reported:

Sometimes when control plane nodes are going down and then up it may happen that for L3 HA routers, failover of active router will happen and in such case if L3 agent will be running before openvswitch agent on host, gateway port may be in "binding failed" state on new MASTER agent.
That will cause no connectivity to floating IPs on this router.

I tested this on Queens but it seems that there wasn't any changes in
this since Queens.

One possible solution might be to trigger another bind attempt for all
ports which are binding_failed on host when L2 agent from this host is
revived. I will investigate if that would work.

** Affects: neutron
     Importance: Medium
     Assignee: Slawek Kaplonski (slaweq)
         Status: Confirmed


** Tags: l3-ha

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to neutron.
https://bugs.launchpad.net/bugs/1794809

Title:
  Gateway ports are down after reboot of control plane nodes

Status in neutron:
  Confirmed

Bug description:
  Sometimes when control plane nodes are going down and then up it may happen that for L3 HA routers, failover of active router will happen and in such case if L3 agent will be running before openvswitch agent on host, gateway port may be in "binding failed" state on new MASTER agent.
  That will cause no connectivity to floating IPs on this router.

  I tested this on Queens but it seems that there wasn't any changes in
  this since Queens.

  One possible solution might be to trigger another bind attempt for all
  ports which are binding_failed on host when L2 agent from this host is
  revived. I will investigate if that would work.

To manage notifications about this bug go to:
https://bugs.launchpad.net/neutron/+bug/1794809/+subscriptions


Follow ups