← Back to team overview

kernel-packages team mailing list archive

[Bug 1301496] Re: kernel crash: Unable to handle kernel paging request for data

 

regarding stack 4k page kernel, this just happened on wolfe-02. running (I believe) a 4k page kernel.
$ grep CONFIG_PPC.*.*PAGES /boot/config-3.13.0-8-generic 
CONFIG_PPC_4K_PAGES=y
# CONFIG_PPC_64K_PAGES is not set


wolfe-02 login: [241848.101690] Unable to handle kernel paging request for data at address 0x2001400000044
[241848.112613] Faulting instruction address: 0xc000000000954b60
[241848.112704] Oops: Kernel access of bad area, sig: 11 [#1]
[241848.112777] SMP NR_CPUS=2048 NUMA pSeries
[241848.112871] Modules linked in: btrfs xor raid6_pq libcrc32c veth xt_CHECKSUM iptable_mangle ipt_MASQUERADE iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack xt_tcpudp bridge stp llc iptable_filter ip_tables x_tables dm_crypt
[241848.113353] CPU: 1 PID: 10355 Comm: kworker/u4:0 Not tainted 3.13.0-8-generic #28-Ubuntu
[241848.113465] Workqueue: netns .cleanup_net
[241848.113540] task: c0000002192621f0 ti: c000000253eec000 task.ti: c000000253eec000
[241848.113655] NIP: c000000000954b60 LR: c000000000954b68 CTR: c000000000954b00
[241848.113768] REGS: c000000253eef760 TRAP: 0300   Not tainted  (3.13.0-8-generic)
[241848.113871] MSR: 8000000000009033 <SF,EE,ME,IR,DR,RI,LE>  CR: 44002024  XER: 00000000
[241848.114117] CFAR: c0000000001d2820 DAR: 0002001400000044 DSISR: 40000000 SOFTE: 1
GPR00: c000000000954b68 c000000253eef9e0 c0000000010b0dd0 0002001400000044
GPR04: f00000000b7698d8 c00000034674d900 c000000000954b68 c0000003fe023508
GPR08: 0000000000010000 c0000002536a0000 000000000000000e 0000000000000001
GPR12: 0000000044002028 c00000000fe80300 c0000000000c3f00 c000000363207c40
GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
GPR20: 0000000000000000 0000000000000000 0000000000000001 c000000000f630fc
GPR24: 0000000000000001 fffffffffffffef7 0000000000000000 c000000000f58638
GPR28: 0000000000000001 c0000003506f3900 0000000000002000 0000000000000000
[241848.115585] NIP [c000000000954b60] .tcp_net_metrics_exit+0x60/0x110
[241848.115675] LR [c000000000954b68] .tcp_net_metrics_exit+0x68/0x110
[241848.115762] Call Trace:
[241848.115805] [c000000253eef9e0] [c000000000954b68] .tcp_net_metrics_exit+0x68/0x110 (unreliable)
[241848.115945] [c000000253eefa70] [c0000000008cc49c] .ops_exit_list.isra.2+0x6c/0xd0
[241848.116078] [c000000253eefb00] [c0000000008ccef0] .cleanup_net+0x150/0x250
[241848.116198] [c000000253eefbc0] [c0000000000b9e28] .process_one_work+0x1a8/0x4d0
[241848.116320] [c000000253eefc60] [c0000000000baaf0] .worker_thread+0x180/0x4a0
[241848.116429] [c000000253eefd30] [c0000000000c4010] .kthread+0x110/0x130
[241848.116538] [c000000253eefe30] [c00000000000a160] .ret_from_kernel_thread+0x5c/0x7c
[241848.116659] Instruction dump:
[241848.116731] 7d295030 2f890000 e93d0288 419e0058 3bc00000 3b800001 60000000 60420000
[241848.116920] 7bc81f24 7c69402a 2fa30000 419e0024 <ebe30000> 4b8b809d 60000000 2fbf0000
[241848.117115] ---[ end trace 531dcfc8ed4b2948 ]---
[241848.124367]
[241848.129853] Unable to handle kernel paging request for data at address 0xffffffffffffffd8
[241848.129968] Faulting instruction address: 0xc0000000000c49c0
[241848.130056] Oops: Kernel access of bad area, sig: 11 [#2]
[241848.130128] SMP NR_CPUS=2048 NUMA pSeries
[241848.130220] Modules linked in: btrfs xor raid6_pq libcrc32c veth xt_CHECKSUM iptable_mangle ipt_MASQUERADE iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack xt_tcpudp bridge stp llc iptable_filter ip_tables x_tables dm_crypt
[241848.130695] CPU: 0 PID: 10355 Comm: kworker/u4:0 Tainted: G      D      3.13.0-8-generic #28-Ubuntu
[241848.130833] task: c0000002192621f0 ti: c000000253eec000 task.ti: c000000253eec000
[241848.130938] NIP: c0000000000c49c0 LR: c0000000000bb278 CTR: c0000000000ded80
[241848.131042] REGS: c000000253eeeed0 TRAP: 0300   Tainted: G      D       (3.13.0-8-generic)
[241848.131145] MSR: 8000000000009033 <SF,EE,ME,IR,DR,RI,LE>  CR: 42004028  XER: 00000000
[241848.131391] CFAR: 00003fff90fa7250 DAR: ffffffffffffffd8 DSISR: 40000000 SOFTE: 0
GPR00: c0000000000bb278 c000000253eef150 c0000000010b0dd0 c0000002192621f0
GPR04: 0000000000000000 0000000000000001 0000000000000800 0000000000000000
GPR08: 0000000000000001 0000000000000000 0000000000000001 0000000000000018
GPR12: 0000000022004088 c00000000fe80000 000000000075a000 0000000000000001
GPR16: 0000000000000000 c000000000eee780 c000000000096c48 c00000000110df34
GPR20: c000000000eee780 0000000000000001 0000000000000000 0000000000000004
GPR24: c000000000eee780 c000000001109a60 c000000253eec000 c000000000eee780
GPR28: c000000219262690 0000000000000000 c0000002192621f0 c0000002192621f0
[241848.132789] NIP [c0000000000c49c0] .kthread_data+0x20/0x40
[241848.132861] LR [c0000000000bb278] .wq_worker_sleeping+0x28/0xf0
[241848.132947] Call Trace:
[241848.132988] [c000000253eef150] [c0000002192632d4] 0xc0000002192632d4 (unreliable)
[241848.133109] [c000000253eef1d0] [c0000000000bb278] .wq_worker_sleeping+0x28/0xf0
[241848.133231] [c000000253eef260] [c000000000a14a7c] .__schedule+0x65c/0x8c0
[241848.133338] [c000000253eef4e0] [c000000000096c48] .do_exit+0x738/0xb30
[241848.133443] [c000000253eef5d0] [c000000000021c60] .die+0x2f0/0x450
[241848.133550] [c000000253eef670] [c0000000000474e0] .bad_page_fault+0xe0/0x130
[241848.133656] [c000000253eef6f0] [c000000000009284] handle_page_fault+0x2c/0x30
[241848.133780] --- Exception: 300 at .tcp_net_metrics_exit+0x60/0x110
[241848.133780]     LR = .tcp_net_metrics_exit+0x68/0x110
[241848.133938] [c000000253eefa70] [c0000000008cc49c] .ops_exit_list.isra.2+0x6c/0xd0
[241848.134057] [c000000253eefb00] [c0000000008ccef0] .cleanup_net+0x150/0x250
[241848.134161] [c000000253eefbc0] [c0000000000b9e28] .process_one_work+0x1a8/0x4d0
[241848.134280] [c000000253eefc60] [c0000000000baaf0] .worker_thread+0x180/0x4a0
[241848.134385] [c000000253eefd30] [c0000000000c4010] .kthread+0x110/0x130
[241848.134489] [c000000253eefe30] [c00000000000a160] .ret_from_kernel_thread+0x5c/0x7c
[241848.134607] Instruction dump:
[241848.134677] 4e800020 60000000 60000000 60420000 7c0802a6 fbe1fff8 f8010010 f821ff81
[241848.134850] 7c7f1b78 60000000 60000000 e93f0458 <e869ffd8> 38210080 e8010010 ebe1fff8
[241848.135026] ---[ end trace 531dcfc8ed4b2949 ]---
[241848.140660]
[241848.140706] Fixing recursive fault but reboot is needed!

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1301496

Title:
  kernel crash: Unable to handle kernel paging request for data

Status in “linux” package in Ubuntu:
  Confirmed

Bug description:
  We've seen this happen twice now on ppc64el guests that are probably
  under load.  I don't have a lot of the details on what was going on
  when they failed, but I have the stack traces.

  [101168.836780] Unable to handle kernel paging request for data at address 0x00010001
  [101168.836886] Faulting instruction address: 0xc000000000954b60
  [101168.836934] Oops: Kernel access of bad area, sig: 11 [#1]
  [101168.836971] SMP NR_CPUS=2048 NUMA pSeries
  [101168.837020] Modules linked in: veth xt_CHECKSUM iptable_mangle ipt_MASQUERADE iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack xt_tcpudp bridge stp llc iptable_filter ip_tables x_tables dm_crypt
  [101168.837234] CPU: 1 PID: 19760 Comm: kworker/u4:0 Not tainted 3.13.0-8-generic #28-Ubuntu
  [101168.837294] Workqueue: netns .cleanup_net
  [101168.837332] task: c0000003f99d43e0 ti: c0000001cce44000 task.ti: c0000001cce44000
  [101168.837386] NIP: c000000000954b60 LR: c000000000954b68 CTR: c000000000954b00
  [101168.837439] REGS: c0000001cce47760 TRAP: 0300   Not tainted  (3.13.0-8-generic)
  [101168.837493] MSR: 8000000000009033 <SF,EE,ME,IR,DR,RI,LE>  CR: 24002024  XER: 00000000
  [101168.837620] CFAR: 000000001063ea4c DAR: 0000000000010001 DSISR: 40000000 SOFTE: 1
  GPR00: c000000000954b68 c0000001cce479e0 c0000000010b0dd0 0000000000010001
  GPR04: f0000000099918f0 c0000002be072380 c000000000954b68 c0000003fe023508
  GPR08: 0000000000010000 c000000209fc0000 000000000000000e 0000000000000001
  GPR12: 0000000044002028 c00000000fe80300 c0000000000c3f00 c0000002be1e8bc0
  GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
  GPR20: 0000000000000000 0000000000000000 0000000000000001 c000000000f630fc
  GPR24: 0000000000000001 fffffffffffffef7 0000000000000000 c000000000f58638
  GPR28: 0000000000000001 c0000003fbdc0000 0000000000002000 0000000000000000
  [101168.838355] NIP [c000000000954b60] .tcp_net_metrics_exit+0x60/0x110
  [101168.838402] LR [c000000000954b68] .tcp_net_metrics_exit+0x68/0x110
  [101168.838448] Call Trace:
  [101168.838469] [c0000001cce479e0] [c000000000954b68] .tcp_net_metrics_exit+0x68/0x110 (unreliable)
  [101168.838542] [c0000001cce47a70] [c0000000008cc49c] .ops_exit_list.isra.2+0x6c/0xd0
  [101168.838605] [c0000001cce47b00] [c0000000008ccef0] .cleanup_net+0x150/0x250
  [101168.838662] [c0000001cce47bc0] [c0000000000b9e28] .process_one_work+0x1a8/0x4d0
  [101168.838726] [c0000001cce47c60] [c0000000000baaf0] .worker_thread+0x180/0x4a0
  [101168.838783] [c0000001cce47d30] [c0000000000c4010] .kthread+0x110/0x130
  [101168.838841] [c0000001cce47e30] [c00000000000a160] .ret_from_kernel_thread+0x5c/0x7c
  [101168.838903] Instruction dump:
  [101168.838940] 7d295030 2f890000 e93d0288 419e0058 3bc00000 3b800001 60000000 60420000
  [101168.839031] 7bc81f24 7c69402a 2fa30000 419e0024 <ebe30000> 4b8b809d 60000000 2fbf0000
  [101168.839127] ---[ end trace fb028b2b5c006a6a ]---
  --- 
  AlsaDevices: Error: command ['ls', '-l', '/dev/snd/'] failed with exit code 2: ls: cannot access /dev/snd/: No such file or directory
  AplayDevices: Error: [Errno 2] No such file or directory
  ApportVersion: 2.14-0ubuntu1
  Architecture: ppc64el
  ArecordDevices: Error: [Errno 2] No such file or directory
  CRDA: Error: [Errno 2] No such file or directory
  DistroRelease: Ubuntu 14.04
  Lspci:
   
  Lsusb: Error: command ['lsusb'] failed with exit code 1: unable to initialize libusb: -99
  Package: linux (not installed)
  PciMultimedia:
   
  ProcEnviron:
   TERM=xterm
   PATH=(custom, no user)
   XDG_RUNTIME_DIR=<set>
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  ProcFB:
   
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinux-3.13.0-19-generic root=UUID=19eaa2f9-0f24-49b9-ba48-24879242481c ro console=hvc0 earlyprintk
  ProcVersionSignature: User Name 3.13.0-19.40-generic 3.13.6
  RelatedPackageVersions:
   linux-restricted-modules-3.13.0-19-generic N/A
   linux-backports-modules-3.13.0-19-generic  N/A
   linux-firmware                             N/A
  RfKill: Error: [Errno 2] No such file or directory
  Tags:  trusty uec-images
  Uname: Linux 3.13.0-19-generic ppc64le
  UpgradeStatus: No upgrade log present (probably fresh install)
  UserGroups: adm audio cdrom dialout dip floppy netdev plugdev sudo video
  WifiSyslog:
   
  _MarkForUpload: True

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1301496/+subscriptions


References