← Back to team overview

yahoo-eng-team team mailing list archive

[Bug 1918405] Re: new ovn-metadata-agent created when existing one dies

 

This isn't really neutron issue. I opened devstack bug for it:
https://bugs.launchpad.net/devstack/+bug/1918656

** Changed in: neutron
       Status: New => Incomplete

** Changed in: neutron
       Status: Incomplete => Invalid

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to neutron.
https://bugs.launchpad.net/bugs/1918405

Title:
  new ovn-metadata-agent  created when existing one dies

Status in neutron:
  Invalid

Bug description:
  This bug is reproduced during the execution of devstack-tobiko-gate-
  ovn job. It is not reproduced always.


  1) Create resources
  https://1002eb5dcbc9d30cd274-231091edc31f40b3d022f1a19bae7260.ssl.cf1.rackcdn.com/776837/2/check/devstack-tobiko-gate-ovn/27904c1/tobiko_results_03_create_resources_scenario.html
  tobiko/tests/scenario/neutron/test_agents.py::NeutronAgentTest::test_agents_are_alive - PASS - All 2 Neutron agents are alive

  2) Faults
  https://1002eb5dcbc9d30cd274-231091edc31f40b3d022f1a19bae7260.ssl.cf1.rackcdn.com/776837/2/check/devstack-tobiko-gate-ovn/27904c1/tobiko_results_04_faults_faults.html
  tobiko/tests/faults/neutron/test_agents.py::OvnControllerTest::test_restart_ovn_controller - FAIL
  a) Stops ovn-controller: Service 'ovn-controller' stopped on host 'ubuntu-focal-vexxhost-ca-ymq-1-0023280630'
  b) Start ovn-controller: Starting service 'ovn-controller' on host 'ubuntu-focal-vexxhost-ca-ymq-1-0023280630'.
  c) At this moment, ovn-metadata-agent is not alive (I don't understand why ovn-metadata dies when ovn-controller was restarted):
          "alive": true,
          "availability_zone": "",
          "binary": "ovn-controller",
  ...
          "alive": false,
          "availability_zone": "",
          "binary": "neutron-ovn-metadata-agent",
  d) At some point, a new ovn-metadata-agent is created - One metadata agent is alive and the other one is not (this clearly looks like a bug to me):
          "alive": true,
          "availability_zone": "",
          "binary": "ovn-controller",
  ...
          "alive": false,
          "availability_zone": "",
          "binary": "neutron-ovn-metadata-agent",
  ...
          "alive": true,
          "availability_zone": "",
          "binary": "neutron-ovn-metadata-agent",
  The test fails because of the dead metadata agent.

  3) Verify resources
  https://1002eb5dcbc9d30cd274-231091edc31f40b3d022f1a19bae7260.ssl.cf1.rackcdn.com/776837/2/check/devstack-tobiko-gate-ovn/27904c1/tobiko_results_05_verify_resources_scenario.html
  tobiko/tests/scenario/neutron/test_agents.py::NeutronAgentTest::test_agents_are_alive - FAIL - due to the same reason: there are two ovn-metadata-agents and one of them is dead.

To manage notifications about this bug go to:
https://bugs.launchpad.net/neutron/+bug/1918405/+subscriptions


References