yahoo-eng-team team mailing list archive
-
yahoo-eng-team team
-
Mailing list archive
-
Message #25008
[Bug 1334447] Re: Potential race between neutron port update and node pxe boot
** Also affects: neutron
Importance: Undecided
Status: New
--
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to neutron.
https://bugs.launchpad.net/bugs/1334447
Title:
Potential race between neutron port update and node pxe boot
Status in OpenStack Bare Metal Provisioning Service (Ironic):
Triaged
Status in OpenStack Neutron (virtual network service):
New
Bug description:
Splitting off a new bug from Bug #1300589 to track progress on one
specific issue.
There is a potential race condition in the pxe driver where it will
update neutron ports with DHCP data for a soon-to-be-booted ironic
node. This data sets, among other things, the tftp next server
address. The update is asynchronous on the Neutron side. After the
update request is sent, the node is immediately powered on. If using
the ssh power driver with fast booting virtual machines, there is a
potential race where the node attempts pxe boot before the neutron
agents have processed the updates and reconfigured DHCP servers
appropriately. Copying from the original bug, Robert describes the
specific issues in the driver:
This is the problem:
_create_token_file(task, node)
_update_neutron(task, node)
manager_utils.node_set_boot_device(task, node, 'pxe', persistent=True)
manager_utils.node_power_action(task, node, states.REBOOT)
There's no synchronisation with neutron (neither poll nor call-back)
to know that (all) the dnsmasq processes serving that port have been
updated - and a VM that boots quickly may DHCP before the dnsmasq is
hupped.
To manage notifications about this bug go to:
https://bugs.launchpad.net/ironic/+bug/1334447/+subscriptions