yahoo-eng-team team mailing list archive
-
yahoo-eng-team team
-
Mailing list archive
-
Message #60059
[Bug 1430999] Re: get_devices_details_list times out in a scaled up environment
[Expired for neutron because there has been no activity for 60 days.]
** Changed in: neutron
Status: Incomplete => Expired
--
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to neutron.
https://bugs.launchpad.net/bugs/1430999
Title:
get_devices_details_list times out in a scaled up environment
Status in neutron:
Expired
Bug description:
The rpc call get_devices_details_list made from within the
neutron_openvswitch_agent can timeout as an environment scales up and
the port list grows. This prevents the neutron_openvswitch_agent from
starting successfully. Increasing the rpc_response_timeout is one
solution but doesn't seem to be ideal. Another solution is to
repeatedly call get_devices_details_list with a fixed subset size of
devices. The subset size could be set so that the rpc call always
completes within the default rpc_response_timeout and a timeout
failure would be independent of scale.
When update_device_list(devices_up=[600 ports]) called by agent, it is getting timeout. I see response comping for this RPC after 2 minutes(Default RPC timeout is 1 minute).
As there is timeout, agent is again trying to process these 600 ports.
As agent is busy processing these ports all time, it is ignoring wiring up of new port added by nova and hence nova is not receiving network-vif-plugged notification, and failing building of instance with "Timeout waiting for vif plugging callback for instance".
To manage notifications about this bug go to:
https://bugs.launchpad.net/neutron/+bug/1430999/+subscriptions
References