← Back to team overview

yahoo-eng-team team mailing list archive

[Bug 1955503] [NEW] [ovn] OVN agents showing as dead until neutron services restarted

 

Public bug reported:

My apologies if this is already a resolved issue; I couldn't readily
find an existing bug but I recognize my software versions are somewhat
behind here.

High level description: Had an issue today where "openstack network
agent list" was frequently showing all OVN agents as offline.  I root-
caused this to 2 of the neutron-servers consistently returning
alive=false for all OVN network agents while 1 of the neutron servers
consistently returned alive=true.  Upon restarting neutron (pause/resume
via neutron-api charm action), the affected neutron servers started
returning alive=true.

Workaround: Restarting neutron services appears to resolve the issue;
"openstack network agent list" now consistently shows all OVN agents as
alive.

Relevant software versions in use:
* OpenStack series: Ussuri
* Neutron version: 16.4.0 (e.g. neutron-common package at 2:16.4.0-0ubuntu3~cloud0)
* Charm versions:
  * neutron-api: cs:neutron-api-288
  * neutron-api-plugin-ovn: cs:neutron-api-plugin-ovn-1

Perceived severity: Not a blocker since there's a workaround, but when
it occurs, it causes very scary looking alerts in Nagios due to all of
OVN appearing offline.

My apologies for this being perhaps somewhat scarce on details; I need
to jump to debug another issue, but wanted to ensure at least something
is filed here.  Thank you.

** Affects: neutron
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to neutron.
https://bugs.launchpad.net/bugs/1955503

Title:
  [ovn] OVN agents showing as dead until neutron services restarted

Status in neutron:
  New

Bug description:
  My apologies if this is already a resolved issue; I couldn't readily
  find an existing bug but I recognize my software versions are somewhat
  behind here.

  High level description: Had an issue today where "openstack network
  agent list" was frequently showing all OVN agents as offline.  I root-
  caused this to 2 of the neutron-servers consistently returning
  alive=false for all OVN network agents while 1 of the neutron servers
  consistently returned alive=true.  Upon restarting neutron
  (pause/resume via neutron-api charm action), the affected neutron
  servers started returning alive=true.

  Workaround: Restarting neutron services appears to resolve the issue;
  "openstack network agent list" now consistently shows all OVN agents
  as alive.

  Relevant software versions in use:
  * OpenStack series: Ussuri
  * Neutron version: 16.4.0 (e.g. neutron-common package at 2:16.4.0-0ubuntu3~cloud0)
  * Charm versions:
    * neutron-api: cs:neutron-api-288
    * neutron-api-plugin-ovn: cs:neutron-api-plugin-ovn-1

  Perceived severity: Not a blocker since there's a workaround, but when
  it occurs, it causes very scary looking alerts in Nagios due to all of
  OVN appearing offline.

  My apologies for this being perhaps somewhat scarce on details; I need
  to jump to debug another issue, but wanted to ensure at least
  something is filed here.  Thank you.

To manage notifications about this bug go to:
https://bugs.launchpad.net/neutron/+bug/1955503/+subscriptions



Follow ups