yahoo-eng-team team mailing list archive
-
yahoo-eng-team team
-
Mailing list archive
-
Message #69099
[Bug 1731595] [NEW] L3 HA: multiple agents are active at the same time
Public bug reported:
OS: Xenial, Ocata from Ubuntu Cloud Archive
We have three neutron-gateway hosts, with L3 HA enabled and a min of 2, max of 3. There are approx. 400 routers defined.
At some point (we weren't monitoring exactly) a number of the routers
changed from being one active, and 1+ others standby, to >1 active.
This included each of the 'active' namespaces having the same IP
addresses allocated, and therefore traffic problems reaching instances.
Removing the routers from all but one agent, and re-adding, resolved the
issue. Restarting one l3 agent also appeared to resolve the issue, but
very slowly, to the point where we needed the system alive again faster
and reverted to removing/re-adding.
At the same time, a number of routers were listed without any agents
active at all. This situation appears to have been resolved by adding
routers to agents, after several minutes downtime.
I'm finding it very difficult to find relevant keepalived messages to
indicate what's going on, but what I do notice is that all the agents
have equal priority and are configured as 'backup'.
I am trying to figure out a way to get a reproducer of this, it might be
that we need to have a large number of routers configured on a small
number of gateways.
** Affects: neutron
Importance: Undecided
Status: New
** Tags: canonical-bootstack
--
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to neutron.
https://bugs.launchpad.net/bugs/1731595
Title:
L3 HA: multiple agents are active at the same time
Status in neutron:
New
Bug description:
OS: Xenial, Ocata from Ubuntu Cloud Archive
We have three neutron-gateway hosts, with L3 HA enabled and a min of 2, max of 3. There are approx. 400 routers defined.
At some point (we weren't monitoring exactly) a number of the routers
changed from being one active, and 1+ others standby, to >1 active.
This included each of the 'active' namespaces having the same IP
addresses allocated, and therefore traffic problems reaching
instances.
Removing the routers from all but one agent, and re-adding, resolved
the issue. Restarting one l3 agent also appeared to resolve the
issue, but very slowly, to the point where we needed the system alive
again faster and reverted to removing/re-adding.
At the same time, a number of routers were listed without any agents
active at all. This situation appears to have been resolved by adding
routers to agents, after several minutes downtime.
I'm finding it very difficult to find relevant keepalived messages to
indicate what's going on, but what I do notice is that all the agents
have equal priority and are configured as 'backup'.
I am trying to figure out a way to get a reproducer of this, it might
be that we need to have a large number of routers configured on a
small number of gateways.
To manage notifications about this bug go to:
https://bugs.launchpad.net/neutron/+bug/1731595/+subscriptions
Follow ups
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: Corey Bryant, 2018-07-03
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: Corey Bryant, 2018-07-03
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: Corey Bryant, 2018-07-03
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: Corey Bryant, 2018-02-05
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: Launchpad Bug Tracker, 2018-02-05
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: Corey Bryant, 2018-02-01
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: Corey Bryant, 2018-02-01
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: Corey Bryant, 2018-02-01
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: Corey Bryant, 2017-12-14
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: Launchpad Bug Tracker, 2017-12-14
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: Corey Bryant, 2017-12-12
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: Corey Bryant, 2017-12-11
-
[Bug 1731595] Please test proposed package
From: James Page, 2017-12-05
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: Corey Bryant, 2017-12-01
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: OpenStack Infra, 2017-12-01
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: Corey Bryant, 2017-11-30
-
[Bug 1731595] Re: L3 HA: multiple agents are active at the same time
From: James Page, 2017-11-27