← Back to team overview

openstack team mailing list archive

Re: Rebooted, now can't ping my guest

 

On Tue, Mar 5, 2013 at 11:19 AM, Sylvain Bauza
<sylvain.bauza@xxxxxxxxxxxx>wrote:

>  You should be close to the solution. Looking at your GRE tunnels, I only
> see a one-to-one tunnel in between your compute node and your network node
> (provided your netnode is 10.10.10.1). Could you please confirm that your
> controller is either on the compute node or on the network node ?
>

My network node is 10.10.10.1.  My controller is an independent node, and
not on the 10.10.10.x network (following this
architecture<http://docs.openstack.org/folsom/basic-install/content/basic-install_requirements.html>although
I have changed the external subnets).


> One could suggest to restart nova-compute and check.
>

I tried, and it did nothing.

I think something else might be going on...I do seem to be getting the
correct ARP entries for 10.5.5.5 on my network node:

root@os-network:/var/log/quantum# arp -an
? (10.5.5.4) at <incomplete> on tap45ffdc5f-da
? (10.5.5.3) at <incomplete> on tap45ffdc5f-da
? (10.10.10.2) at 00:50:56:81:25:73 [ether] on eth1
? (192.168.0.1) at 00:50:56:81:48:02 [ether] on eth0
? (10.5.5.7) at fa:16:3e:06:17:3c [ether] on tap45ffdc5f-da
? (10.5.5.5) at fa:16:3e:8d:2c:f9 [ether] on qr-9f9041ce-65
? (10.5.5.5) at fa:16:3e:8d:2c:f9 [ether] on tap45ffdc5f-da
? (10.42.38.1) at 00:07:b4:01:b5:01 [ether] on eth3
root@os-network:/var/log/quantum# arp -i qr-9f9041ce-65 -d 10.5.5.5
root@os-network:/var/log/quantum# arp -i tap45ffdc5f-da -d 10.5.5.5
root@os-network:/var/log/quantum# arp -an
? (10.5.5.4) at <incomplete> on tap45ffdc5f-da
? (10.5.5.3) at <incomplete> on tap45ffdc5f-da
? (10.10.10.2) at 00:50:56:81:25:73 [ether] on eth1
? (192.168.0.1) at 00:50:56:81:48:02 [ether] on eth0
? (10.5.5.7) at fa:16:3e:06:17:3c [ether] on tap45ffdc5f-da
? (10.5.5.5) at fa:16:3e:8d:2c:f9 [ether] on tap45ffdc5f-da
? (10.42.38.1) at 00:07:b4:01:b5:01 [ether] on eth3
root@os-network:/var/log/quantum# arp -an
? (10.5.5.4) at <incomplete> on tap45ffdc5f-da
? (10.5.5.3) at <incomplete> on tap45ffdc5f-da
? (10.10.10.2) at 00:50:56:81:25:73 [ether] on eth1
? (192.168.0.1) at 00:50:56:81:48:02 [ether] on eth0
? (10.5.5.7) at fa:16:3e:06:17:3c [ether] on tap45ffdc5f-da
? (10.5.5.5) at fa:16:3e:8d:2c:f9 [ether] on qr-9f9041ce-65
? (10.5.5.5) at fa:16:3e:8d:2c:f9 [ether] on tap45ffdc5f-da
? (10.42.38.1) at 00:07:b4:01:b5:01 [ether] on eth3
root@os-network:/var/log/quantum#

I'm not clear on what the interfaces are, but q-9f9041ce-65 is 10.5.5.1 on
the network node, so he seems to be seeing the traffic.  tap45ffdc5f-da is
listed as 10.5.5.2, and I have no idea what function that is performing.

root@os-network:/var/log/quantum# ping 10.5.5.7
PING 10.5.5.7 (10.5.5.7) 56(84) bytes of data.
64 bytes from 10.5.5.7: icmp_req=1 ttl=64 time=1.93 ms
64 bytes from 10.5.5.7: icmp_req=2 ttl=64 time=2.08 ms
^C
--- 10.5.5.7 ping statistics ---
2 packets transmitted, 2 received, 0% packet loss, time 1001ms
rtt min/avg/max/mdev = 1.931/2.007/2.083/0.076 ms
root@os-network:/var/log/quantum# arp -an
? (10.5.5.4) at fa:16:3e:e0:17:f0 [ether] on tap45ffdc5f-da
? (10.5.5.3) at <incomplete> on tap45ffdc5f-da
? (10.10.10.2) at 00:50:56:81:25:73 [ether] on eth1
? (192.168.0.1) at 00:50:56:81:48:02 [ether] on eth0
? (10.5.5.7) at fa:16:3e:06:17:3c [ether] on tap45ffdc5f-da
? (10.5.5.5) at fa:16:3e:8d:2c:f9 [ether] on tap45ffdc5f-da
? (10.42.38.1) at 00:07:b4:01:b5:01 [ether] on eth3
root@os-network:/var/log/quantum# ping 10.5.5.5
PING 10.5.5.5 (10.5.5.5) 56(84) bytes of data.
^C
--- 10.5.5.5 ping statistics ---
9 packets transmitted, 0 received, 100% packet loss, time 8062ms

root@os-network:/var/log/quantum#


> Also, could you please tcpdump your network node on your management IP and
> check if you see GRE packets coming from your compute node (while pinging
> or trying to get a lease) ?
>

Threw a sniff up at http://pastebin.com/giwZysxW.  There were 4 pings from
10.5.5.7 (starting line 47), followed by 4 pings from 10.5.5.5.
Interesting to see the 10.5.5.3 and .4 references...I don't have passwords
for those images (sshed in with the keys), so I rebooted them while
sniffing here: http://pastebin.com/xpbgnhxu  The network node ARP table
never populated with .3 or .4, either.

It looks like quantum-openvswitch-agent is started:

root@os-compute-01:~# ps -ef | egrep quantum | egrep -v grep
quantum  11504     1  1 09:27 ?        00:01:44 python
/usr/bin/quantum-openvswitch-agent --config-file=/etc/quantum/quantum.conf
--config-file=/etc/quantum/plugins/openvswitch/ovs_quantum_plugin.ini
--log-file=/var/log/quantum/openvswitch-agent.log
root@os-compute-01:~#

Follow ups

References