yahoo-eng-team team mailing list archive
-
yahoo-eng-team team
-
Mailing list archive
-
Message #54958
[Bug 1611612] Re: linuxbridge and dhcp agents race removing tap
Reviewed: https://review.openstack.org/353264
Committed: https://git.openstack.org/cgit/openstack/neutron/commit/?id=72720f9aa30169809e41e6dfbafc4e3561716ea5
Submitter: Jenkins
Branch: master
commit 72720f9aa30169809e41e6dfbafc4e3561716ea5
Author: Darragh O'Reilly <darragh.oreilly@xxxxxxx>
Date: Wed Aug 10 05:58:50 2016 +0000
lb-agent: handle exception when bridge slave already removed
An exception can happen when a network is deleted because the
lb-agent tries to removes the dhcp tap from the bridge at about
the same time as the dhcp-agent is deleting the tap. The unhandled
exception means the bridge does not get deleted and a log error.
Closes-Bug: #1611612
Change-Id: Ia9a6b5fc49e239769e850e9486454e81e3a4b96f
** Changed in: neutron
Status: In Progress => Fix Released
--
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to neutron.
https://bugs.launchpad.net/bugs/1611612
Title:
linuxbridge and dhcp agents race removing tap
Status in neutron:
Fix Released
Bug description:
When a network is deleted, an exception can happen because the lb-
agent tries to removes the dhcp tap from the bridge at about the same
time as the dhcp-agent is deleting the tap. The unhandled exception
results in the bridge not getting cleaned up and an error and
stacktrace in the logs.
http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%20%5C%22self.remove_interface%5C%22
Traceback (most recent call last):
File "/usr/local/lib/python2.7/dist-packages/oslo_messaging/rpc/server.py", line 133, in _process_incoming
res = self.dispatcher.dispatch(message)
File "/usr/local/lib/python2.7/dist-packages/oslo_messaging/rpc/dispatcher.py", line 150, in dispatch
return self._do_dispatch(endpoint, method, ctxt, args)
File "/usr/local/lib/python2.7/dist-packages/oslo_messaging/rpc/dispatcher.py", line 121, in _do_dispatch
result = func(ctxt, **new_args)
File "/opt/stack/new/neutron/neutron/plugins/ml2/drivers/linuxbridge/agent/linuxbridge_neutron_agent.py", line 803, in network_delete
self.agent.mgr.delete_bridge(bridge_name)
File "/opt/stack/new/neutron/neutron/plugins/ml2/drivers/linuxbridge/agent/linuxbridge_neutron_agent.py", line 521, in delete_bridge
self.remove_interface(bridge_name, interface)
File "/opt/stack/new/neutron/neutron/plugins/ml2/drivers/linuxbridge/agent/linuxbridge_neutron_agent.py", line 568, in remove_interface
if bridge_device.delif(interface_name):
File "/opt/stack/new/neutron/neutron/agent/linux/bridge_lib.py", line 80, in delif
return self._brctl(['delif', self.name, interface])
File "/opt/stack/new/neutron/neutron/agent/linux/bridge_lib.py", line 55, in _brctl
return ip_wrapper.netns.execute(cmd, run_as_root=True)
File "/opt/stack/new/neutron/neutron/agent/linux/ip_lib.py", line 876, in execute
log_fail_as_error=log_fail_as_error, **kwargs)
File "/opt/stack/new/neutron/neutron/agent/linux/utils.py", line 138, in execute
raise RuntimeError(msg)
RuntimeError: Exit code: 1; Stdin: ; Stdout: ; Stderr: device tap1aa0d45a-39 is not a slave of brq6d449049-5c
To manage notifications about this bug go to:
https://bugs.launchpad.net/neutron/+bug/1611612/+subscriptions
References