yahoo-eng-team team mailing list archive
-
yahoo-eng-team team
-
Mailing list archive
-
Message #26465
[Bug 1385234] Re: OVS tunneling between multiple neutron nodes misconfigured if amqp is restarted
** Changed in: tripleo
Status: Fix Committed => Fix Released
--
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to neutron.
https://bugs.launchpad.net/bugs/1385234
Title:
OVS tunneling between multiple neutron nodes misconfigured if amqp is
restarted
Status in OpenStack Neutron (virtual network service):
Confirmed
Status in Messaging API for OpenStack:
Incomplete
Status in tripleo - openstack on openstack:
Fix Released
Bug description:
At completion of a deployment with multiple controllers, by observing
the gre tunnels created in OVS by the neutron ovs-agent, one will find
that some neutron nodes may miss the tunnels in between them or to the
computes.
This is due to ovs-agents getting disconnected from the rabbit cluster
without them noticing and as a result, being unable to receive updates
from other nodes or publish updates.
The disconnection may happen following a reconfig of a rabbit node,
the VIP moving over a different node when rabbit is load balanced, or
even _during_ tripleo overcloud deployment due to rabbit cluster
configuration changes.
This was observed using Kombu 3.0.33 as well as 2.5.
Use of some aggressive (low) kernel keepalive probes interval seems to
improve the reliability but a more appropriate fix seems to be support
for heartbeat in oslo.messaging
To manage notifications about this bug go to:
https://bugs.launchpad.net/neutron/+bug/1385234/+subscriptions
References