← Back to team overview

yahoo-eng-team team mailing list archive

[Bug 1851609] [NEW] Add an option for graceful l3 agent shutdown

 

Public bug reported:

If KillMode in systemd config of a neutron l3 agent service is set to
'process' - it will not kill child processes on main service stop - this
is useful when we don't want data-plane downtime on agent stop/restart
due to keepalived exit.

However in some cases graceful cleanup on l3 agent shutdown is needed -
like with containerised control plane, when kubernetes kills l3-agent
pod, it automatically kills its children (keepalived processes) in non-
graceful way, so that keepalived does not clear VIPs. This leads to a
situation when same VIP is present on different nodes and hence to long
downtime.

The proposal is to add a new l3 agent config so that it handles stop
(SIGTERM) by deleting all routers. For HA routers it results in graceful
keepalived shutdown.

** Affects: neutron
     Importance: Medium
     Assignee: Oleg Bondarev (obondarev)
         Status: New


** Tags: l3-ha l3-ipam-dhcp

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to neutron.
https://bugs.launchpad.net/bugs/1851609

Title:
  Add an option for graceful l3 agent shutdown

Status in neutron:
  New

Bug description:
  If KillMode in systemd config of a neutron l3 agent service is set to
  'process' - it will not kill child processes on main service stop -
  this is useful when we don't want data-plane downtime on agent
  stop/restart due to keepalived exit.

  However in some cases graceful cleanup on l3 agent shutdown is needed
  - like with containerised control plane, when kubernetes kills
  l3-agent pod, it automatically kills its children (keepalived
  processes) in non-graceful way, so that keepalived does not clear
  VIPs. This leads to a situation when same VIP is present on different
  nodes and hence to long downtime.

  The proposal is to add a new l3 agent config so that it handles stop
  (SIGTERM) by deleting all routers. For HA routers it results in
  graceful keepalived shutdown.

To manage notifications about this bug go to:
https://bugs.launchpad.net/neutron/+bug/1851609/+subscriptions


Follow ups