← Back to team overview

yahoo-eng-team team mailing list archive

[Bug 1603268] Re: Unstable grenade multinode job

 

Seems like grafana dashboard shows that the issue went away, and the job
is now on par with other multinode jobs in terms of rate of failures in
check queue. Closing the bug.

** Changed in: neutron
       Status: In Progress => Fix Released

-- 
You received this bug notification because you are a member of Yahoo!
Engineering Team, which is subscribed to neutron.
https://bugs.launchpad.net/bugs/1603268

Title:
  Unstable grenade multinode job

Status in neutron:
  Fix Released

Bug description:
  Grafana is showing the gate-grenade-dsvm-neutron-multinode being
  unstable since July 12th [1,2]. The issue seems related to Grenade
  verify phase getting stalled [3]. Probably MTU related as the SSH
  times out.

  2016-07-09 10:16:00.716 | ++ /opt/stack/new/grenade/projects/70_cinder/resources.sh:verify_noapi:177 :   resource_get cinder cinder_server_ip
  2016-07-09 10:16:00.717 | ++ /opt/stack/new/grenade/inc/plugin:resource_get:137 :   local project=cinder
  2016-07-09 10:16:00.718 | ++ /opt/stack/new/grenade/inc/plugin:resource_get:138 :   local key=cinder_server_ip
  2016-07-09 10:16:00.720 | +++ /opt/stack/new/grenade/inc/plugin:resource_get:139 :   iniget /opt/stack/save/grenade_db.ini cinder cinder_server_ip
  2016-07-09 10:16:00.733 | ++ /opt/stack/new/grenade/inc/plugin:resource_get:139 :   local value=172.24.5.51
  2016-07-09 10:16:00.734 | ++ /opt/stack/new/grenade/inc/plugin:resource_get:140 :   echo 172.24.5.51
  2016-07-09 10:16:00.737 | + /opt/stack/new/grenade/projects/70_cinder/resources.sh:verify_noapi:177 :   local server_ip=172.24.5.51
  2016-07-09 10:16:00.739 | + /opt/stack/new/grenade/projects/70_cinder/resources.sh:verify_noapi:178 :   ping_check_public 172.24.5.51 30
  2016-07-09 10:16:00.740 | + /opt/stack/new/grenade/functions:ping_check_public:56 :   local ip=172.24.5.51
  2016-07-09 10:16:00.741 | + /opt/stack/new/grenade/functions:ping_check_public:57 :   local boot_timeout=30
  2016-07-09 10:16:00.743 | + /opt/stack/new/grenade/functions:ping_check_public:58 :   local expected=True
  2016-07-09 10:16:00.745 | + /opt/stack/new/grenade/functions:ping_check_public:59 :   local check_command=
  2016-07-09 10:16:00.746 | + /opt/stack/new/grenade/functions:ping_check_public:60 :   [[ True = \T\r\u\e ]]
  2016-07-09 10:16:00.747 | + /opt/stack/new/grenade/functions:ping_check_public:61 :   check_command='while ! ping -c1 -w1 172.24.5.51; do sleep 1; done'
  2016-07-09 10:16:00.749 | + /opt/stack/new/grenade/functions:ping_check_public:65 :   timeout 30 sh -c 'while ! ping -c1 -w1 172.24.5.51; do sleep 1; done'
  2016-07-09 10:16:00.755 | PING 172.24.5.51 (172.24.5.51) 56(84) bytes of data.
  2016-07-09 10:16:00.755 | 64 bytes from 172.24.5.51: icmp_seq=1 ttl=63 time=2.81 ms
  2016-07-09 10:16:00.755 |
  2016-07-09 10:16:00.755 | --- 172.24.5.51 ping statistics ---
  2016-07-09 10:16:00.755 | 1 packets transmitted, 1 received, 0% packet loss, time 0ms
  2016-07-09 10:16:00.755 | rtt min/avg/max/mdev = 2.811/2.811/2.811/0.000 ms
  2016-07-09 10:16:00.757 | + /opt/stack/new/grenade/projects/70_cinder/resources.sh:verify_noapi:181 :   timeout 30 ssh -v -o ConnectTimeout=10 -o UserKnownHostsFile=/dev/null -o StrictHostKeyChecking=no -i /opt/stack/save/cinder_key.pem cirros@172.24.5.51 sync
  2016-07-09 10:16:00.761 | OpenSSH_6.6.1, OpenSSL 1.0.1f 6 Jan 2014
  2016-07-09 10:16:00.762 | debug1: Reading configuration data /etc/ssh/ssh_config
  2016-07-09 10:16:00.762 | debug1: /etc/ssh/ssh_config line 19: Applying options for *
  2016-07-09 10:16:00.762 | debug1: Connecting to 172.24.5.51 [172.24.5.51] port 22.
  2016-07-09 10:16:00.763 | debug1: fd 3 clearing O_NONBLOCK
  2016-07-09 10:16:00.763 | debug1: Connection established.
  2016-07-09 10:16:00.764 | debug1: identity file /opt/stack/save/cinder_key.pem type -1
  2016-07-09 10:16:00.764 | debug1: identity file /opt/stack/save/cinder_key.pem-cert type -1
  2016-07-09 10:16:00.764 | debug1: Enabling compatibility mode for protocol 2.0
  2016-07-09 10:16:00.764 | debug1: Local version string SSH-2.0-OpenSSH_6.6.1p1 Ubuntu-2ubuntu2.7
  2016-07-09 10:16:00.770 | debug1: Remote protocol version 2.0, remote software version dropbear_2012.55
  2016-07-09 10:16:00.770 | debug1: no match: dropbear_2012.55
  2016-07-09 10:16:00.770 | debug1: SSH2_MSG_KEXINIT sent
  2016-07-09 10:16:00.773 | debug1: SSH2_MSG_KEXINIT received
  2016-07-09 10:16:00.773 | debug1: kex: server->client aes128-ctr hmac-md5 none
  2016-07-09 10:16:00.773 | debug1: kex: client->server aes128-ctr hmac-md5 none
  2016-07-09 10:16:00.774 | debug1: sending SSH2_MSG_KEXDH_INIT

  ==============================
  2016-07-09 10:16:00.774 | debug1: expecting SSH2_MSG_KEXDH_REPLY
  2016-07-09 10:16:30.760 | + 
  /opt/stack/new/grenade/projects/70_cinder/resources.sh:verify_noapi:1 :   exit_trap
  ==============================

  
  2016-07-09 10:16:30.762 | + /opt/stack/new/grenade/functions:exit_trap:103 :   local r=124
  2016-07-09 10:16:30.763 | 1 verify_noapi /opt/stack/new/grenade/projects/70_cinder/resources.sh
  2016-07-09 10:16:30.763 | 173 verify /opt/stack/new/grenade/projects/70_cinder/resources.sh
  2016-07-09 10:16:30.763 | 224 main /opt/stack/new/grenade/projects/70_cinder/resources.sh
  2016-07-09 10:16:30.763 | Exit code: 124

  [1] http://grafana.openstack.org/dashboard/db/neutron-failure-rate?panelId=5&fullscreen
  [2] http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%2270_cinder%2Fresources.sh%3Averify_noapi%3A1%20%3A%20%20%20exit_trap%5C%22
  [3] http://logs.openstack.org/46/321146/2/gate/gate-grenade-dsvm-neutron-multinode/988f952/logs/grenade.sh.txt.gz#_2016-07-14_18_03_14_092

To manage notifications about this bug go to:
https://bugs.launchpad.net/neutron/+bug/1603268/+subscriptions


References