← Back to team overview

group.of.nepali.translators team mailing list archive

[Bug 1840789] Re: bnx2x: fatal hardware error/reboot/tx timeout with LLDP enabled

 

This fix is already present in Eoan and Unstable:

~/git/ubuntu-eoan$ git log --oneline origin/master-next -- drivers/net/ethernet/broadcom/bnx2x/ | head | grep cos
1c41d7b7cf60 bnx2x: Disable multi-cos feature.

~/git/ubuntu-eoan$ git describe --contains 1c41d7b7cf60
Ubuntu-5.2.0-12.13~51

~/git/ubuntu-unstable$ git log --oneline origin/master -- drivers/net/ethernet/broadcom/bnx2x/ | head | grep cos
d1f0b5dce8fd bnx2x: Disable multi-cos feature.
~/git/ubuntu-unstable$ git describe --contains d1f0b5dce8fd
Ubuntu-5.3.0-4.5~313^2~91


** Description changed:

- Description/patches to be provided this week.
+ [Impact]
+ 
+  * The bnx2x driver may cause hardware faults (leading to
+    panic/reboot) and other behaviors as transmit timeouts,
+    after commit 3968d38917eb ("bnx2x: Fix Multi-Cos.") is
+    introduced.
+ 
+  * This issue has been observed by an user shortly
+    after starting docker & kubelet, with adapters:
+    - Broadcom NetXtreme II BCM57800 [14e4:168a] from Dell [1028:1f5c]
+    - Broadcom NetXtreme II BCM57840 [14e4:16a1] from Dell [1028:1f79]
+ 
+  * If options to ignore hardware faults are used
+    (erst_disable=1 hest_disable=1 ghes.disable=1)
+    the system doesn't panic/reboot and continues
+    on to timeout on adapter stats, then transmit
+    timeouts, spewing some adapter firmware dumps,
+    but the network interface is non-functional.
+ 
+  * The issue only happened when LLDP is enabled
+    on the network switches, and crashdump shows
+    the bnx2x driver is stuck/waits for firmware
+    to complete the stop traffic command in LLDP
+    handling. Workaround used is to disable LLDP
+    in the network switches/ports.
+ 
+  * Analysis of the driver and firmware dumps
+    didn't help significantly towards finding
+    the root cause.
+ 
+  * Upstream/mainline recently just reverted the
+    patch, due to similar problem reports, while
+    looking for the root cause/proper fix.
+ 
+ [Test Case]
+ 
+  * No reproducible test case found outside
+    the user's systems/cluster, where it is
+    enough to start docker & kubelet & wait.
+ 
+  * The user verified test kernels for Xenial
+    and Bionic - the problem does not happen.
+ 
+ [Regression Potential]
+ 
+  * Users who significantly use/apply the non-default
+    traffic class (tc) / class of service (cos) might
+    possibly see performance changes (if any at all)
+    in such applications, however that's unclear now.
+ 
+  * This is a recent revert upstream (v5.3-rc'ish),
+    so there's chance things might change in this area.
+ 
+  * Nonetheless, the patch is authored by the driver
+    vendor, and made its way into stable kernels
+    (e.g., v5.2.8 which made Eoan/19.10 recently).

** Also affects: linux (Ubuntu Bionic)
   Importance: Undecided
       Status: New

** Also affects: linux (Ubuntu Xenial)
   Importance: Undecided
       Status: New

** Also affects: linux (Ubuntu Eoan)
   Importance: Undecided
     Assignee: Mauricio Faria de Oliveira (mfo)
       Status: In Progress

** Also affects: linux (Ubuntu Disco)
   Importance: Undecided
       Status: New

** Changed in: linux (Ubuntu Eoan)
       Status: In Progress => Fix Released

** Changed in: linux (Ubuntu Disco)
       Status: New => In Progress

** Changed in: linux (Ubuntu Bionic)
       Status: New => In Progress

** Changed in: linux (Ubuntu Xenial)
       Status: New => In Progress

** Changed in: linux (Ubuntu Disco)
     Assignee: (unassigned) => Mauricio Faria de Oliveira (mfo)

** Changed in: linux (Ubuntu Bionic)
     Assignee: (unassigned) => Mauricio Faria de Oliveira (mfo)

** Changed in: linux (Ubuntu Xenial)
     Assignee: (unassigned) => Mauricio Faria de Oliveira (mfo)

** Description changed:

  [Impact]
  
-  * The bnx2x driver may cause hardware faults (leading to
-    panic/reboot) and other behaviors as transmit timeouts,
-    after commit 3968d38917eb ("bnx2x: Fix Multi-Cos.") is
-    introduced.
+  * The bnx2x driver may cause hardware faults (leading to
+    panic/reboot) and other behaviors as transmit timeouts,
+    after commit 3968d38917eb ("bnx2x: Fix Multi-Cos.") is
+    introduced.
  
-  * This issue has been observed by an user shortly
-    after starting docker & kubelet, with adapters:
-    - Broadcom NetXtreme II BCM57800 [14e4:168a] from Dell [1028:1f5c]
-    - Broadcom NetXtreme II BCM57840 [14e4:16a1] from Dell [1028:1f79]
+  * This issue has been observed by an user shortly
+    after starting docker & kubelet, with adapters:
+    - Broadcom NetXtreme II BCM57800 [14e4:168a] from Dell [1028:1f5c]
+    - Broadcom NetXtreme II BCM57840 [14e4:16a1] from Dell [1028:1f79]
  
-  * If options to ignore hardware faults are used
-    (erst_disable=1 hest_disable=1 ghes.disable=1)
-    the system doesn't panic/reboot and continues
-    on to timeout on adapter stats, then transmit
-    timeouts, spewing some adapter firmware dumps,
-    but the network interface is non-functional.
+  * If options to ignore hardware faults are used
+    (erst_disable=1 hest_disable=1 ghes.disable=1)
+    the system doesn't panic/reboot and continues
+    on to timeout on adapter stats, then transmit
+    timeouts, spewing some adapter firmware dumps,
+    but the network interface is non-functional.
  
-  * The issue only happened when LLDP is enabled
-    on the network switches, and crashdump shows
-    the bnx2x driver is stuck/waits for firmware
-    to complete the stop traffic command in LLDP
-    handling. Workaround used is to disable LLDP
-    in the network switches/ports.
+  * The issue only happened when LLDP is enabled
+    on the network switches, and crashdump shows
+    the bnx2x driver is stuck/waits for firmware
+    to complete the stop traffic command in LLDP
+    handling. Workaround used is to disable LLDP
+    in the network switches/ports.
  
-  * Analysis of the driver and firmware dumps
-    didn't help significantly towards finding
-    the root cause.
+  * Analysis of the driver and firmware dumps
+    didn't help significantly towards finding
+    the root cause.
  
-  * Upstream/mainline recently just reverted the
-    patch, due to similar problem reports, while
-    looking for the root cause/proper fix.
+  * Upstream/mainline recently just reverted the
+    patch, due to similar problem reports, while
+    looking for the root cause/proper fix.
  
  [Test Case]
  
-  * No reproducible test case found outside
-    the user's systems/cluster, where it is
-    enough to start docker & kubelet & wait.
+  * No reproducible test case found outside
+    the user's systems/cluster, where it is
+    enough to start docker & kubelet & wait.
  
-  * The user verified test kernels for Xenial
-    and Bionic - the problem does not happen.
+  * The user verified test kernels for Xenial
+    and Bionic - the problem does not happen;
+    build-tested on Disco.
  
  [Regression Potential]
  
-  * Users who significantly use/apply the non-default
-    traffic class (tc) / class of service (cos) might
-    possibly see performance changes (if any at all)
-    in such applications, however that's unclear now.
+  * Users who significantly use/apply the non-default
+    traffic class (tc) / class of service (cos) might
+    possibly see performance changes (if any at all)
+    in such applications, however that's unclear now.
  
-  * This is a recent revert upstream (v5.3-rc'ish),
-    so there's chance things might change in this area.
+  * This is a recent revert upstream (v5.3-rc'ish),
+    so there's chance things might change in this area.
  
-  * Nonetheless, the patch is authored by the driver
-    vendor, and made its way into stable kernels
-    (e.g., v5.2.8 which made Eoan/19.10 recently).
+  * Nonetheless, the patch is authored by the driver
+    vendor, and made its way into stable kernels
+    (e.g., v5.2.8 which made Eoan/19.10 recently).

-- 
You received this bug notification because you are a member of नेपाली
भाषा समायोजकहरुको समूह, which is subscribed to Xenial.
Matching subscriptions: Ubuntu 16.04 Bugs
https://bugs.launchpad.net/bugs/1840789

Title:
  bnx2x: fatal hardware error/reboot/tx timeout with LLDP enabled

Status in linux package in Ubuntu:
  Fix Released
Status in linux source package in Xenial:
  In Progress
Status in linux source package in Bionic:
  In Progress
Status in linux source package in Disco:
  In Progress
Status in linux source package in Eoan:
  Fix Released

Bug description:
  [Impact]

   * The bnx2x driver may cause hardware faults (leading to
     panic/reboot) and other behaviors as transmit timeouts,
     after commit 3968d38917eb ("bnx2x: Fix Multi-Cos.") is
     introduced.

   * This issue has been observed by an user shortly
     after starting docker & kubelet, with adapters:
     - Broadcom NetXtreme II BCM57800 [14e4:168a] from Dell [1028:1f5c]
     - Broadcom NetXtreme II BCM57840 [14e4:16a1] from Dell [1028:1f79]

   * If options to ignore hardware faults are used
     (erst_disable=1 hest_disable=1 ghes.disable=1)
     the system doesn't panic/reboot and continues
     on to timeout on adapter stats, then transmit
     timeouts, spewing some adapter firmware dumps,
     but the network interface is non-functional.

   * The issue only happened when LLDP is enabled
     on the network switches, and crashdump shows
     the bnx2x driver is stuck/waits for firmware
     to complete the stop traffic command in LLDP
     handling. Workaround used is to disable LLDP
     in the network switches/ports.

   * Analysis of the driver and firmware dumps
     didn't help significantly towards finding
     the root cause.

   * Upstream/mainline recently just reverted the
     patch, due to similar problem reports, while
     looking for the root cause/proper fix.

  [Test Case]

   * No reproducible test case found outside
     the user's systems/cluster, where it is
     enough to start docker & kubelet & wait.

   * The user verified test kernels for Xenial
     and Bionic - the problem does not happen;
     build-tested on Disco.

  [Regression Potential]

   * Users who significantly use/apply the non-default
     traffic class (tc) / class of service (cos) might
     possibly see performance changes (if any at all)
     in such applications, however that's unclear now.

   * This is a recent revert upstream (v5.3-rc'ish),
     so there's chance things might change in this area.

   * Nonetheless, the patch is authored by the driver
     vendor, and made its way into stable kernels
     (e.g., v5.2.8 which made Eoan/19.10 recently).

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1840789/+subscriptions