← Back to team overview

kernel-packages team mailing list archive

[Bug 996369] Re: bond slave interface sometimes does not come up on boot

 

We can confirm similar issues with Precise. However, testing a default
Precise 12.04.3 installation with any generic Quantal or Raring LTS
kernel seems to work fine.

Tests preformed:
Continues looping boot using a similar script as mentioned by Chris J Arges. Counted amount of successful boots - working bond0 with both slaves up - with different kernels.

Symptoms after boot:
- Bond0 down with no slaves
- Bond0 comes up with only 1 slave
- Bond0 comes up with 2 slaves with 1 interface marked down

Hardware:
Dell M610-II
01:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5709S Gigabit Ethernet (rev 20)
01:00.1 Ethernet controller: Broadcom Corporation NetXtreme II BCM5709S Gigabit Ethernet (rev 20)

Network configuration:
root@test:~# cat /etc/network/interfaces
auto lo
iface lo inet loopback

auto eth0
iface eth0 inet manual
  bond-master bond0
  bond-primary eth0 eth1

auto eth1
iface eth1 inet manual
  bond-master bond0
  bond-primary eth0 eth1

auto bond0
iface bond0 inet static
  address 10.1.1.2
  netmask 255.255.255.0
  broadcast 10.1.1.255
  gateway 10.1.1.1
  bond_arp_validate 3
  bond_mode active-backup
  bond_arp_interval 200
  bond_arp_ip_target 10.1.1.1
  bond_slaves none

root@test:~# cat /etc/modprobe.d/bonding.conf
alias bond0 bonding
options bonding mode=1 arp_interval=200 arp_ip_target=10.1.1.1

Test results:
Ubuntu 12.04.3 LTS with linux-image-3.2.0-54-generic (3.2.0-54.82) - Failed several boots with the listed symptoms
Ubuntu 12.04.3 LTS with linux-image-3.5.0-41-generic (3.5.0-41.64~precise1) - 300+ boots
Ubuntu 12.04.3 LTS with linux-image-3.8.0-31-generic (3.8.0-31.46~precise1) - 300+ boots

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/996369

Title:
  bond slave interface sometimes does not come up on boot

Status in “linux” package in Ubuntu:
  Incomplete
Status in “linux” source package in Precise:
  Confirmed
Status in “linux” source package in Quantal:
  Confirmed
Status in “linux” source package in Raring:
  Confirmed
Status in “linux” source package in Saucy:
  Incomplete

Bug description:
  bug report:
  Hi guys,

  I'm running ubuntu12.04 server on a HP DL380G7. 
  server01 ~ # lsb_release -rd
  Description:	Ubuntu 12.04 LTS
  Release:	12.04

  I've got 2 ethernet cards with 4x 10G interfaces.
  I've got a bond on 4x10G where only eth4 and eth6 are patched.

  server01 ~ # ethtool -i eth4
  driver: be2net
  version: 4.0.100u
  firmware-version: 4.0.360.15
  bus-info: 0000:0e:00.0
  supports-statistics: yes
  supports-test: yes
  supports-eeprom-access: yes
  supports-register-dump: no

  server01 ~ # ethtool -i eth6
  driver: be2net
  version: 4.0.100u
  firmware-version: 4.0.360.15
  bus-info: 0000:15:00.0
  supports-statistics: yes
  supports-test: yes
  supports-eeprom-access: yes
  supports-register-dump: no

  After a reboot I expect to see a bond0 interface with 2 RUNNING SLAVE
  interfaces.

  However, sometimes I only see 1 interface coming up in the bond after
  a reboot and when I manually bring the other interface up (ifconfig
  eth6 up) it works:

  # cat /proc/net/bonding/bond0 
  Ethernet Channel Bonding Driver: v3.7.1 (April 27, 2011)

  Bonding Mode: IEEE 802.3ad Dynamic link aggregation
  Transmit Hash Policy: layer2 (0)
  MII Status: up
  MII Polling Interval (ms): 100
  Up Delay (ms): 0
  Down Delay (ms): 0

  802.3ad info
  LACP rate: slow
  Min links: 0
  Aggregator selection policy (ad_select): stable
  Active Aggregator Info:
  	Aggregator ID: 2
  	Number of ports: 1
  	Actor Key: 33
  	Partner Key: 32773
  	Partner Mac Address: 00:23:04:ee:be:01

  Slave Interface: eth6
  MII Status: down      <<<<<<<<<<<< SHOULD BE UP
  Speed: 10000 Mbps
  Duplex: full
  Link Failure Count: 1
  Permanent HW addr: 00:9c:02:3c:c9:70
  Aggregator ID: 1
  Slave queue ID: 0

  Slave Interface: eth4
  MII Status: up        <<<<<<<<<<<< ONLY INTERFACE THAT IS UP
  Speed: 10000 Mbps
  Duplex: full
  Link Failure Count: 0
  Permanent HW addr: 00:9c:02:3c:99:98
  Aggregator ID: 2
  Slave queue ID: 0

  Slave Interface: eth7
  MII Status: down
  Speed: Unknown
  Duplex: Unknown
  Link Failure Count: 0
  Permanent HW addr: 00:9c:02:3c:c9:74
  Aggregator ID: 3
  Slave queue ID: 0

  Slave Interface: eth5
  MII Status: down
  Speed: Unknown
  Duplex: Unknown
  Link Failure Count: 0
  Permanent HW addr: 00:9c:02:3c:99:9c
  Aggregator ID: 4
  Slave queue ID: 0

  server01 ~ # ifconfig
  bond0     Link encap:Ethernet  HWaddr 00:9c:02:3c:c9:70  
            inet6 addr: fe80::29c:2ff:fe3c:c970/64 Scope:Link
            UP BROADCAST RUNNING MASTER MULTICAST  MTU:1500  Metric:1
            RX packets:169071 errors:0 dropped:54 overruns:0 frame:0
            TX packets:1236 errors:0 dropped:0 overruns:0 carrier:0
            collisions:0 txqueuelen:0 
            RX bytes:18435358 (18.4 MB)  TX bytes:174727 (174.7 KB)

  eth4      Link encap:Ethernet  HWaddr 00:9c:02:3c:c9:70  
            UP BROADCAST RUNNING SLAVE MULTICAST  MTU:1500  Metric:1
            RX packets:169071 errors:0 dropped:45 overruns:0 frame:0
            TX packets:1235 errors:0 dropped:0 overruns:0 carrier:0
            collisions:0 txqueuelen:1000 
            RX bytes:18435358 (18.4 MB)  TX bytes:174637 (174.6 KB)

  eth5      Link encap:Ethernet  HWaddr 00:9c:02:3c:c9:70  
            UP BROADCAST SLAVE MULTICAST  MTU:1500  Metric:1
            RX packets:0 errors:0 dropped:0 overruns:0 frame:0
            TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
            collisions:0 txqueuelen:1000 
            RX bytes:0 (0.0 B)  TX bytes:0 (0.0 B)

  eth7      Link encap:Ethernet  HWaddr 00:9c:02:3c:c9:70  
            UP BROADCAST SLAVE MULTICAST  MTU:1500  Metric:1
            RX packets:0 errors:0 dropped:0 overruns:0 frame:0
            TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
            collisions:0 txqueuelen:1000 
            RX bytes:0 (0.0 B)  TX bytes:0 (0.0 B)

  lo        Link encap:Local Loopback  
            inet addr:127.0.0.1  Mask:255.0.0.0
            inet6 addr: ::1/128 Scope:Host
            UP LOOPBACK RUNNING  MTU:16436  Metric:1
            RX packets:1333 errors:0 dropped:0 overruns:0 frame:0
            TX packets:1333 errors:0 dropped:0 overruns:0 carrier:0
            collisions:0 txqueuelen:0 
            RX bytes:102784 (102.7 KB)  TX bytes:102784 (102.7 KB)

  vlan888   Link encap:Ethernet  HWaddr 00:9c:02:3c:c9:70  
            inet addr:1.1.0.50  Bcast:1.1.0.63  Mask:255.255.255.240
            inet6 addr: 2222:2222:ffff::11/124 Scope:Global
            inet6 addr: fe80::29c:2ff:fe3c:c970/64 Scope:Link
            UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
            RX packets:168250 errors:0 dropped:0 overruns:0 frame:0
            TX packets:661 errors:0 dropped:0 overruns:0 carrier:0
            collisions:0 txqueuelen:0 
            RX bytes:15995674 (15.9 MB)  TX bytes:41938 (41.9 KB)

  vlan889   Link encap:Ethernet  HWaddr 00:9c:02:3c:c9:70  
            inet addr:1.1.0.5  Bcast:1.1.0.15  Mask:255.255.255.240
            inet6 addr: 2222:2222:ffff::105/120 Scope:Global
            inet6 addr: fe80::29c:2ff:fe3c:c970/64 Scope:Link
            UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
            RX packets:708 errors:0 dropped:0 overruns:0 frame:0
            TX packets:571 errors:0 dropped:0 overruns:0 carrier:0
            collisions:0 txqueuelen:0 
            RX bytes:59088 (59.0 KB)  TX bytes:126565 (126.5 KB)

  
  server01 ~ # ifconfig eth6 up

  logging in /var/log/syslog:
  May  8 07:35:14 server01 kernel: [  201.620795] 8021q: adding VLAN 0 to HW filter on device eth6
  May  8 07:35:14 server01 kernel: [  201.627183] bonding: bond0: link status definitely up for interface eth6, 10000 Mbps full duplex.

  # cat /proc/net/bonding/bond0 
  Ethernet Channel Bonding Driver: v3.7.1 (April 27, 2011)

  Bonding Mode: IEEE 802.3ad Dynamic link aggregation
  Transmit Hash Policy: layer2 (0)
  MII Status: up
  MII Polling Interval (ms): 100
  Up Delay (ms): 0
  Down Delay (ms): 0

  802.3ad info
  LACP rate: slow
  Min links: 0
  Aggregator selection policy (ad_select): stable
  Active Aggregator Info:
  	Aggregator ID: 2
  	Number of ports: 2
  	Actor Key: 33
  	Partner Key: 32773
  	Partner Mac Address: 00:23:04:ee:be:01

  Slave Interface: eth6
  MII Status: up
  Speed: 10000 Mbps
  Duplex: full
  Link Failure Count: 1
  Permanent HW addr: 00:9c:02:3c:c9:70
  Aggregator ID: 2
  Slave queue ID: 0

  Slave Interface: eth4
  MII Status: up
  Speed: 10000 Mbps
  Duplex: full
  Link Failure Count: 0
  Permanent HW addr: 00:9c:02:3c:99:98
  Aggregator ID: 2
  Slave queue ID: 0

  Slave Interface: eth7
  MII Status: down
  Speed: Unknown
  Duplex: Unknown
  Link Failure Count: 0
  Permanent HW addr: 00:9c:02:3c:c9:74
  Aggregator ID: 3
  Slave queue ID: 0

  Slave Interface: eth5
  MII Status: down
  Speed: Unknown
  Duplex: Unknown
  Link Failure Count: 0
  Permanent HW addr: 00:9c:02:3c:99:9c
  Aggregator ID: 4
  Slave queue ID: 0

  Network configuration:
  # cat /etc/network/interfaces 
  # This file describes the network interfaces available on your system
  # and how to activate them. For more information, see interfaces(5).

  # The loopback network interface
  auto lo
  iface lo inet loopback

  # The primary network interface
  auto eth4 eth5 eth6 eth7
  iface eth4 inet manual
    bond-master bond0
  iface eth5 inet manual
    bond-master bond0
  iface eth6 inet manual
    bond-master bond0
  iface eth7 inet manual
    bond-master bond0

  auto bond0
  iface bond0 inet static
    bond-slaves none
    bond-mode 802.3ad  
    bond-miimon 100
    address 0.0.0.0
    netmask 0.0.0.0
    ip-proxy-arp 0

  auto vlan888
  iface vlan888 inet static
    vlan_raw_device bond0
    address 1.1.0.50
    netmask 255.255.255.240
    ip-proxy-arp 0
    post-up sysctl -w net.ipv4.conf.${IFACE}.forwarding=1
    post-up ip route add 192.168.1.0/24 via 1.1.0.53
    post-up ip route add 1.1.0.32/28 via 1.1.0.53
    post-up ip route add 0.0.0.0/0 via 1.1.0.53

  iface vlan888 inet6 static
    vlan_raw_device bond0
    address 2222:2222:ffff::11
    netmask 124
    post-up sysctl -w net.ipv6.conf.${IFACE}.forwarding=1
    post-up ip -6 route add 2222:2222:ffff::/124 via 2222:2222:ffff::14
    post-up ip -6 route add ::/0 via 2222:2222:ffff::14

  auto vlan889
  iface vlan889 inet static
    vlan_raw_device bond0
    address 1.1.0.5
    netmask 255.255.255.240
    ip-proxy-arp 0
    post-up sysctl -w net.ipv4.conf.${IFACE}.forwarding=1
    post-up ip route add 10.0.0.0/8 via 1.1.0.1
    post-up ip route add 192.168.0.0/12 via 1.1.0.1
    post-up ip route add 1.1.0.0/20 via 1.1.0.1
    post-up ip route add 2.2.0.0/24 via 1.1.0.1
    post-up ip route add 3.3.0.0/24 via 1.1.0.1

  iface vlan889 inet6 static
    vlan_raw_device bond0
    address 2222:2222:ffff::105
    netmask 120
    post-up sysctl -w net.ipv6.conf.${IFACE}.forwarding=1
    post-up ip -6 route add 2222:2222::/32 via 2222:2222:ffff::101
  --- 
  AlsaDevices:
   total 0
   crw-rw---T 1 root audio 116,  1 May  8 07:31 seq
   crw-rw---T 1 root audio 116, 33 May  8 07:31 timer
  AplayDevices: Error: [Errno 2] No such file or directory
  ApportVersion: 2.0.1-0ubuntu7
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1:
  CRDA: Error: [Errno 2] No such file or directory
  DistroRelease: Ubuntu 12.04
  HibernationDevice: RESUME=UUID=3a88e785-42be-4d09-a3d2-6509e148b49a
  InstallationMedia: Ubuntu-Server 12.04 LTS "Precise Pangolin" - Beta amd64 (20120327)
  MachineType: HP ProLiant DL380 G7
  Package: linux (not installed)
  PciMultimedia:
   
  ProcFB: 0 radeondrmfb
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-3.2.0-24-generic root=UUID=db7e17c0-5861-4074-9467-cffce02483c0 ro
  ProcVersionSignature: Ubuntu 3.2.0-24.37-generic 3.2.14
  RelatedPackageVersions:
   linux-restricted-modules-3.2.0-24-generic N/A
   linux-backports-modules-3.2.0-24-generic  N/A
   linux-firmware                            1.79
  RfKill: Error: [Errno 2] No such file or directory
  Tags:  precise
  Uname: Linux 3.2.0-24-generic x86_64
  UpgradeStatus: No upgrade log present (probably fresh install)
  UserGroups:
   
  dmi.bios.date: 05/05/2011
  dmi.bios.vendor: HP
  dmi.bios.version: P67
  dmi.chassis.type: 23
  dmi.chassis.vendor: HP
  dmi.modalias: dmi:bvnHP:bvrP67:bd05/05/2011:svnHP:pnProLiantDL380G7:pvr:cvnHP:ct23:cvr:
  dmi.product.name: ProLiant DL380 G7
  dmi.sys.vendor: HP

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/996369/+subscriptions