← Back to team overview

kernel-packages team mailing list archive

[Bug 1500739] Re: CPU lockup on HP Proliant DL380 Gen9 servers

 

Next time this happens, please check the ilo for a massive backlog of
nbd kernel messages. I suspect it's nbd's insane logging rate (tens of
thousands of lines per second) endlessly growing the serial console
backlog. "dmesg -D" appears to be a quick way to fix machines in this
state, or prevent it from ever happening. There is probably a more
elegant solution (such as, get the nbd module to calm down)

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1500739

Title:
  CPU lockup on HP Proliant DL380 Gen9 servers

Status in linux package in Ubuntu:
  Confirmed

Bug description:
  Over the past 3-ish weeks we've had 3 seperate HP Proliant DL380 Gen9
  servers lock up with a similar looking cpu lockup bug.  All 3 of these
  servers are nova-compute nodes in an OpenStack cluster, with a
  reasonable amount of load on them.  The symptoms are the load shoots
  up into the hundreds, and ps stops returning.

  I've attached lspci -vnvn and 3 sets of syslog message traces that we
  grabbed on each of the 3 times it has crashed.

  $ cat /proc/version_signature 
  Ubuntu 3.16.0-49.65~14.04.1-generic 3.16.7-ckt15

  Please let us know if you need any further information.
  --- 
  AlsaDevices:
   total 0
   crw-rw---- 1 root audio 116,  1 Sep 29 06:47 seq
   crw-rw---- 1 root audio 116, 33 Sep 29 06:47 timer
  AplayDevices: Error: [Errno 2] No such file or directory
  ApportVersion: 2.14.1-0ubuntu3.15
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1:
  CRDA: Error: [Errno 2] No such file or directory
  DistroRelease: Ubuntu 14.04
  HibernationDevice: RESUME=UUID=c2111986-9219-44f2-ad09-8d367ce8b30b
  MachineType: HP ProLiant DL380 Gen9
  Package: linux (not installed)
  PciMultimedia:
   
  ProcEnviron:
   TERM=xterm
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=<set>
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  ProcFB: 0 EFI VGA
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-3.16.0-49-generic root=UUID=038cefb6-3559-4104-9754-5308497961f5 ro console=tty0 console=ttyS1,115200 BOOTIF=01-3c:a8:2a:23:f5:00 quiet
  ProcVersionSignature: Ubuntu 3.16.0-49.65~14.04.1-generic 3.16.7-ckt15
  RelatedPackageVersions:
   linux-restricted-modules-3.16.0-49-generic N/A
   linux-backports-modules-3.16.0-49-generic  N/A
   linux-firmware                             1.127.15
  RfKill: Error: [Errno 2] No such file or directory
  Tags:  trusty uec-images
  Uname: Linux 3.16.0-49-generic x86_64
  UpgradeStatus: No upgrade log present (probably fresh install)
  UserGroups: adm cdrom dialout libvirtd lpadmin plugdev sambashare sudo
  _MarkForUpload: True
  dmi.bios.date: 05/06/2015
  dmi.bios.vendor: HP
  dmi.bios.version: P89
  dmi.chassis.type: 23
  dmi.chassis.vendor: HP
  dmi.modalias: dmi:bvnHP:bvrP89:bd05/06/2015:svnHP:pnProLiantDL380Gen9:pvr:cvnHP:ct23:cvr:
  dmi.product.name: ProLiant DL380 Gen9
  dmi.sys.vendor: HP

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1500739/+subscriptions


References