← Back to team overview

kernel-packages team mailing list archive

[Bug 1315736] Re: Machine Check Exception

 

mesg also has following message:

[ 9441.626809] kernel BUG at /build/buildd/linux-3.13.0/mm/memory.c:3756!
[ 9441.628777] invalid opcode: 0000 [#1] SMP 
[ 9441.630053] Modules linked in: ip6t_REJECT xt_hl ip6t_rt nf_conntrack_ipv6 nf
_defrag_ipv6 ipt_REJECT xt_limit xt_tcpudp xt_addrtype nf_conntrack_ipv4 nf_defr
ag_ipv4 xt_conntrack ip6table_filter ip6_tables rpcsec_gss_krb5 nfsv4 dcdbas nfsd auth_rpcgss nfs_acl nfs lockd sunrpc fscache nf_conntrack_netbios_ns nf_conntrack_broadcast nf_nat_ftp nf_nat nf_conntrack_ftp nf_conntrack gpio_ich iptable_filter ip_tables x_tables x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm joydev crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd shpchp mei_me lpc_ich mei wmi ipmi_si lp mac_hid acpi_power_meter parport hid_generic tg3 ahci usbhid ptp hid libahci megaraid_sas pps_core
[ 9441.651872] CPU: 29 PID: 12407 Comm: java Not tainted 3.13.0-24-generic #46-Ubuntu
[ 9441.654159] Hardware name: Dell Inc. PowerEdge R720/0DCWD1, BIOS 2.2.2 01/16/2014
[ 9441.656421] task: ffff8817c7115fc0 ti: ffff880294c74000 task.ti: ffff880294c74000
[ 9441.658682] RIP: 0010:[<ffffffff81179051>]  [<ffffffff81179051>] handle_mm_fault+0xe61/0xf10
[ 9441.661268] RSP: 0018:ffff880294c75d98  EFLAGS: 00010246
[ 9441.662863] RAX: 0000000000000100 RBX: 00000007ff258010 RCX: ffff880294c75b18
[ 9441.665019] RDX: ffff8817c7115fc0 RSI: 0000000000000000 RDI: 8000000182e009e6
[ 9441.667173] RBP: ffff880294c75e20 R08: 0000000000000000 R09: 00000000000000a9
[ 9441.669329] R10: 0000000000000001 R11: 0000000000000000 R12: ffff8801057acfc8
[ 9441.671483] R13: ffff881238353800 R14: ffff8817f8fdd080 R15: 0000000000000080
[ 9441.673640] FS:  00007fd475705700(0000) GS:ffff88301f3c0000(0000) knlGS:0000000000000000
[ 9441.676088] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 9441.677817] CR2: 00007f9cd00451b8 CR3: 00000002cd12c000 CR4: 00000000001407e0
[ 9441.679973] Stack:
[ 9441.680557]  0000000000000001 ffff880294c75db0 ffffffff8109a780 ffff880294c75dd0
[ 9441.682897]  ffffffff810d7ad6 0000000000000001 ffffffff81f1e948 ffff880294c75e78
[ 9441.685238]  ffffffff810d983d ffff880294c75e48 ffff8800000000a9 00000001ffffffff
[ 9441.687580] Call Trace:
[ 9441.688309]  [<ffffffff8109a780>] ? wake_up_state+0x10/0x20
[ 9441.689986]  [<ffffffff810d7ad6>] ? wake_futex+0x66/0x90
[ 9441.691583]  [<ffffffff810d983d>] ? futex_wake_op+0x4ed/0x620
[ 9441.693314]  [<ffffffff817219a4>] __do_page_fault+0x184/0x560
[ 9441.695046]  [<ffffffff811112fc>] ? acct_account_cputime+0x1c/0x20
[ 9441.696912]  [<ffffffff8109d76b>] ? account_user_time+0x8b/0xa0
[ 9441.698697]  [<ffffffff8109dd84>] ? vtime_account_user+0x54/0x60
[ 9441.700505]  [<ffffffff81721d9a>] do_page_fault+0x1a/0x70
[ 9441.702132]  [<ffffffff8171e208>] page_fault+0x28/0x30
[ 9441.703673] Code: ff 48 89 d9 4c 89 e2 4c 89 ee 4c 89 f7 44 89 4d c8 e8 34 c1 ff ff 85 c0 0f 85 94 f5 ff ff 49 8b 3c 24 44 8b 4d c8 e9 68 f3 ff ff <0f> 0b be 8e 00 00 00 48 c7 c7 18 25 a6 81 44 89 4d c8 e8 18 e7 
[ 9441.711148] RIP  [<ffffffff81179051>] handle_mm_fault+0xe61/0xf10
[ 9441.713013]  RSP <ffff880294c75d98>
[ 9441.877706] ---[ end trace e9d74187101c55ae ]---

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1315736

Title:
  Machine Check Exception

Status in “linux” package in Ubuntu:
  Confirmed

Bug description:
  Dell PowerEdge 720 on ubuntu 14.04 shows MCE errors on dmesg. Dell
  support instructed to run DSET and BIOS hardware diagnostics. Neither
  of the tools showed any errors. Dell support said that if there was a
  hardware error it would have been shown on Dell logs and the probable
  reason for the dmesg log is a bug in ubuntu kernel MCE reporting.

  So, is it that following dmesg is because of a kernel bug in ubuntu
  14.04 server?

  [11562.171040] Please check user daemon is running.
  [94953.306404] sbridge: HANDLING MCE MEMORY ERROR
  [94953.306415] CPU 1: Machine Check Exception: 0 Bank 9: 8c00004b000800c0
  [94953.306416] TSC 0 ADDR 2dfa0e1000 MISC 90000800080168c PROCESSOR 0:306e4 TIME 1399142359 SOCKET 1 APIC 20
  [94953.306422] sbridge: HANDLING MCE MEMORY ERROR
  [94953.306423] CPU 1: Machine Check Exception: 0 Bank 10: 8c000050000800c1
  [94953.306424] TSC 0 ADDR 2dfa0e1000 MISC 90000000000208c PROCESSOR 0:306e4 TIME 1399142359 SOCKET 1 APIC 20
  [94953.532217] EDAC MC1: 1 CE memory scrubbing error on CPU_SrcID#1_Channel#0_DIMM#0 (channel:0 slot:0 page:0x2dfa0e1 offset:0x0 grain:32 syndrome:0x0 -  area:DRAM err_code:0008:00c0 socket:1 channel_mask:3 rank:0)
  [94953.532226] EDAC MC1: 1 CE memory scrubbing error on CPU_SrcID#1_Channel#1_DIMM#0 (channel:1 slot:0 page:0x2dfa0e1 offset:0x0 grain:32 syndrome:0x0 -  area:DRAM err_code:0008:00c1 socket:1 channel_mask:3 rank:0)

  ProblemType: Bug
  DistroRelease: Ubuntu 14.04
  Package: linux-image-3.13.0-24-generic 3.13.0-24.46
  ProcVersionSignature: Ubuntu 3.13.0-24.46-generic 3.13.9
  Uname: Linux 3.13.0-24-generic x86_64
  AlsaDevices:
   total 0
   crw-rw---- 1 root audio 116,  1 touko  2 19:15 seq
   crw-rw---- 1 root audio 116, 33 touko  2 19:15 timer
  AplayDevices: Error: [Errno 2] No such file or directory: 'aplay'
  ApportVersion: 2.14.1-0ubuntu3
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord'
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1:
  CRDA: Error: [Errno 2] No such file or directory: 'iw'
  CurrentDmesg: Error: command ['sh', '-c', 'dmesg | comm -13 --nocheck-order /var/log/dmesg -'] failed with exit code 1: comm: /var/log/dmesg: Permission denied
  Date: Sat May  3 21:52:07 2014
  InstallationDate: Installed on 2014-02-26 (66 days ago)
  InstallationMedia: Ubuntu-Server 14.04 LTS "Trusty Tahr" - Alpha amd64 (20140219)
  MachineType: Dell Inc. PowerEdge R720
  PciMultimedia:
   
  ProcFB: 0 VESA VGA
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-3.13.0-24-generic root=UUID=c03eb237-955a-4dee-bba1-deded53df372 ro
  RfKill: Error: [Errno 2] No such file or directory: 'rfkill'
  SourcePackage: linux
  UpgradeStatus: No upgrade log present (probably fresh install)
  WifiSyslog:
   
  dmi.bios.date: 01/16/2014
  dmi.bios.vendor: Dell Inc.
  dmi.bios.version: 2.2.2
  dmi.board.name: 0DCWD1
  dmi.board.vendor: Dell Inc.
  dmi.board.version: A01
  dmi.chassis.type: 23
  dmi.chassis.vendor: Dell Inc.
  dmi.modalias: dmi:bvnDellInc.:bvr2.2.2:bd01/16/2014:svnDellInc.:pnPowerEdgeR720:pvr:rvnDellInc.:rn0DCWD1:rvrA01:cvnDellInc.:ct23:cvr:
  dmi.product.name: PowerEdge R720
  dmi.sys.vendor: Dell Inc.
  --- 
  AlsaDevices:
   total 0
   crw-rw---- 1 root audio 116,  1 touko  2 19:15 seq
   crw-rw---- 1 root audio 116, 33 touko  2 19:15 timer
  AplayDevices: Error: [Errno 2] No such file or directory
  ApportVersion: 2.14.1-0ubuntu3
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1:
  CRDA: Error: [Errno 2] No such file or directory
  CurrentDmesg:
   Error: command ['sh', '-c', 'dmesg | comm -13 --nocheck-order /var/log/dmesg -'] failed with exit code 1: comm: /var/log/dmesg: Permission denied
   dmesg: write failed: Broken pipe
  DistroRelease: Ubuntu 14.04
  InstallationDate: Installed on 2014-02-26 (66 days ago)
  InstallationMedia: Ubuntu-Server 14.04 LTS "Trusty Tahr" - Alpha amd64 (20140219)
  MachineType: Dell Inc. PowerEdge R720
  Package: linux (not installed)
  PciMultimedia:
   
  ProcFB: 0 VESA VGA
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-3.13.0-24-generic root=UUID=c03eb237-955a-4dee-bba1-deded53df372 ro
  ProcVersionSignature: Ubuntu 3.13.0-24.46-generic 3.13.9
  RfKill: Error: [Errno 2] No such file or directory
  Tags:  trusty
  Uname: Linux 3.13.0-24-generic x86_64
  UpgradeStatus: No upgrade log present (probably fresh install)
  UserGroups:
   
  WifiSyslog:
   
  _MarkForUpload: True
  dmi.bios.date: 01/16/2014
  dmi.bios.vendor: Dell Inc.
  dmi.bios.version: 2.2.2
  dmi.board.name: 0DCWD1
  dmi.board.vendor: Dell Inc.
  dmi.board.version: A01
  dmi.chassis.type: 23
  dmi.chassis.vendor: Dell Inc.
  dmi.modalias: dmi:bvnDellInc.:bvr2.2.2:bd01/16/2014:svnDellInc.:pnPowerEdgeR720:pvr:rvnDellInc.:rn0DCWD1:rvrA01:cvnDellInc.:ct23:cvr:
  dmi.product.name: PowerEdge R720
  dmi.sys.vendor: Dell Inc.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1315736/+subscriptions


References