kernel-packages team mailing list archive
-
kernel-packages team
-
Mailing list archive
-
Message #105232
[Bug 1424275] Re: Ubuntu 15.04 (non-virtualized): System hung after kdump is trigger from Power NV system
** Tags added: kernel-da-key
--
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1424275
Title:
Ubuntu 15.04 (non-virtualized): System hung after kdump is trigger
from Power NV system
Status in linux package in Ubuntu:
New
Bug description:
== Comment: #0 ==
---Problem Description---
Ubuntu 15.04 (non-virtualized): System hung after kdump is trigger from Power NV system
---uname output---
ubuntu 15.04 kernel
uname -a:
Linux lep8d 3.18.0-12-generic #13-Ubuntu SMP Thu Jan 29 13:44:26 UTC 2015 ppc64le ppc64le ppc64le GNU/Linux
Machine Type = Power 8 NV
Steps to reproduce:
1) root@lep8d:~# cat /proc/cmdline
root=UUID=0d90e79d-3d3f-4d8e-9de9-af000b6f285d ro quiet splash crashkernel=4096M
2) update-grub
3) reboot
4) root@lep8d:~# kdump-config show
USE_KDUMP: 1
KDUMP_SYSCTL: kernel.panic_on_oops=1
KDUMP_COREDIR: /var/crash
crashkernel addr:
current state: ready to kdump
kexec command:
/sbin/kexec -p --args-linux --command-line="root=UUID=0d90e79d-3d3f-4d8e-9de9-af000b6f285d ro quiet splash irqpoll maxcpus=1 nousb systemd.unit=kdump-tools.service" --initrd=/boot/initrd.img-3.18.0-12-generic /boot/vmlinux-3.18.0-12-generic
root@lep8d:~#
5) root@lep8d:/var/crash# echo 'c' | sudo tee /proc/sysrq-trigger
[ 300.997178] SysRq : Trigger a crash
[ 300.997221] Unable to handle kernel paging request for data at address 0x00000000
[ 300.997264] Faulting instruction address: 0xc0000000005f9314
[ 300.997328] Oops: Kernel access of bad area, sig: 11 [#1]
[ 300.997378] SMP NR_CPUS=2048 NUMA PowerNV
[ 300.997430] Modules linked in: uio_pdrv_genirq uio powernv_rng ses enclosure lpfc ipr scsi_transport_fc
[ 300.997595] CPU: 64 PID: 2219 Comm: tee Not tainted 3.18.0-12-generic #13-Ubuntu
[ 300.997669] task: c000001fdf647e60 ti: c000001fdf698000 task.ti: c000001fdf698000
[ 300.997742] NIP: c0000000005f9314 LR: c0000000005fa3ac CTR: c0000000005f92e0
[ 300.997816] REGS: c000001fdf69b9d0 TRAP: 0300 Not tainted (3.18.0-12-generic)
[ 300.997889] MSR: 9000000000009033 <SF,HV,EE,ME,IR,DR,RI,LE> CR: 28004022 XER: 20000000
[ 300.998075] CFAR: c0000000000084d8 DAR: 0000000000000000 DSISR: 42000000 SOFTE: 1
GPR00: c0000000005fa3ac c000001fdf69bc50 c000000001426990 0000000000000063
GPR04: c00000010c0083f8 c00000010c018db0 0000000000000599 c0000000041dfdf0
GPR08: c000000000e76990 0000000000000001 0000000000000000 0000000000000001
GPR12: c0000000005f92e0 c000000007b44000 0000000000000000 0000000000000000
GPR16: 0000000000000000 0000000000000000 0000000000000000 0000000000000000
GPR20: 0000000000000000 0000000000000000 0000000010009dd8 0000000000000001
GPR24: 0000000010009dd8 00003fffc9d48270 c000000001375708 0000000000000004
GPR28: 0000000000000063 c00000000133f598 c000000001375ac8 0000000000000000
[ 300.999053] NIP [c0000000005f9314] sysrq_handle_crash+0x34/0x50
[ 300.999116] LR [c0000000005fa3ac] __handle_sysrq+0xec/0x280
[ 300.999166] Call Trace:
[ 300.999192] [c000001fdf69bc50] [c000001fdf69bd90] 0xc000001fdf69bd90 (unreliable)
[ 300.999279] [c000001fdf69bc70] [c0000000005fa3ac] __handle_sysrq+0xec/0x280
[ 300.999354] [c000001fdf69bd10] [c0000000005faba8] write_sysrq_trigger+0x78/0xa0
[ 300.999443] [c000001fdf69bd40] [c000000000345b90] proc_reg_write+0xb0/0x110
[ 300.999519] [c000001fdf69bd90] [c0000000002b96ac] vfs_write+0xdc/0x260
[ 300.999594] [c000001fdf69bde0] [c0000000002ba24c] SyS_write+0x6c/0x110
[ 300.999669] [c000001fdf69be30] [c00000000000927c] syscall_exit+0x0/0x7c
[ 300.999768] Instruction dump:
[ 300.999839] 3842d6b0 7c0802a6 f8010010 f821ffe1 60000000 60000000 3d22001b 39491bdc
[ 301.000068] 39200001 912a0000 7c0004ac 39400000 <992a0000> 38210020 e8010010 7c0803a6
[ 301.000300] ---[ end trace 93f659e78f1526bc ]---
[ 301.000390]
[ 301.000484] Sending IPI to other CPUs
[ 301.001675] IPI complete
-> smp_release_cpus()
spinning_secondaries = 159
<- smp_release_cpus()
<- setup_system()
[ 1.828216] /build/buildd/linux-3.18.0/drivers/rtc/hctosys.c: unable to open rtc device (rtc0)
starting version 218
[ 5.230190] lpfc 0001:03:00.0: 0:1303 Link Up Event x1 received Data: x1 x1 x10 x0 x0 x0 0
root@lep8d:~# dpkg --list | grep -i kdump
ii kdump-tools 1:1.5.7-4 all scripts and tools for automating kdump (Linux crash dumps)
root@lep8d:~#
root@lep8d:~# dpkg --list | grep -i linux-crashdump
ii linux-crashdump 3.18.0.12.12 ppc64el Linux kernel crashdump setup for the latest generic kernel
root@lep8d:~#
============================
Steps to reproduce:
1) Installed kdump tools and linux-crashdump
root@lep8d:~# dpkg --list | grep -i dump
ii kdump-tools 1:1.5.7-4 all scripts and tools for automating kdump (Linux crash dumps)
ii linux-crashdump 3.18.0.12.12 ppc64el Linux kernel crashdump setup for the latest generic kernel
2) Enable kdump
USE_KDUMP=1
KDUMP_COREDIR="/var/crash/Feb8"
3) root@lep8d:~# kdump-config status
current state : ready to kdump
root@lep8d:~#
4) root@lep8d:~# cat /proc/cmdline
root=UUID=0d90e79d-3d3f-4d8e-9de9-af000b6f285d ro quiet splash crashkernel=4096M
root@lep8d:~#
5) root@lep8d:~# kdump-config show
USE_KDUMP: 1
KDUMP_SYSCTL: kernel.panic_on_oops=1
KDUMP_COREDIR: /var/crash
crashkernel addr:
current state: ready to kdump
kexec command:
/sbin/kexec -p --args-linux --command-line="root=UUID=0d90e79d-3d3f-4d8e-9de9-af000b6f285d ro quiet splash irqpoll maxcpus=1 nousb systemd.unit=kdump-tools.service" --initrd=/boot/initrd.img-3.18.0-12-generic /boot/vmlinux-3.18.0-12-generic
6) root@lep8d:~# cat /proc/sys/kernel/sysrq
176
7) trigger dump
root@lep8d:~# echo c > /proc/sysrq-trigger
[ 702.915680] SysRq : Trigger a crash
[ 702.915743] Unable to handle kernel paging request for data at address 0x00000000
[ 702.915816] Faulting instruction address: 0xc0000000005f9314
[ 702.915879] Oops: Kernel access of bad area, sig: 11 [#1]
[ 702.915927] SMP NR_CPUS=2048 NUMA PowerNV
[ 702.915978] Modules linked in: uio_pdrv_genirq uio powernv_rng ses enclosure lpfc ipr scsi_transport_fc
[ 702.916142] CPU: 12 PID: 2045 Comm: bash Not tainted 3.18.0-12-generic #13-Ubuntu
[ 702.916215] task: c000000fdc499030 ti: c000000fdc54c000 task.ti: c000000fdc54c000
[ 702.916288] NIP: c0000000005f9314 LR: c0000000005fa3ac CTR: c0000000005f92e0
[ 702.916360] REGS: c000000fdc54f9d0 TRAP: 0300 Not tainted (3.18.0-12-generic)
[ 702.916432] MSR: 9000000000009033 <SF,HV,EE,ME,IR,DR,RI,LE> CR: 28242822 XER: 20000000
[ 702.916615] CFAR: c0000000000084d8 DAR: 0000000000000000 DSISR: 42000000 SOFTE: 1
GPR00: c0000000005fa3ac c000000fdc54fc50 c000000001426990 0000000000000063
GPR04: c00000010b3083f8 c00000010b318db0 0000000000000599 c0000000041dfda0
GPR08: c000000000e76990 0000000000000001 0000000000000000 0000000000000001
GPR12: c0000000005f92e0 c000000007b26c00 0000000000000000 0000000022000000
GPR16: 000000001016d6e8 0000010032868ab8 0000000010143eb8 00000000100c9390
GPR20: 0000000000000000 000000001017b008 0000000010143d18 0000000000000000
GPR24: 0000000010156c00 0000000010178868 c000000001375708 0000000000000004
GPR28: 0000000000000063 c00000000133f598 c000000001375ac8 0000000000000000
[ 702.917579] NIP [c0000000005f9314] sysrq_handle_crash+0x34/0x50
[ 702.917640] LR [c0000000005fa3ac] __handle_sysrq+0xec/0x280
[ 702.917688] Call Trace:
[ 702.917715] [c000000fdc54fc70] [c0000000005fa3ac] __handle_sysrq+0xec/0x280
[ 702.917789] [c000000fdc54fd10] [c0000000005faba8] write_sysrq_trigger+0x78/0xa0
[ 702.917876] [c000000fdc54fd40] [c000000000345b90] proc_reg_write+0xb0/0x110
[ 702.917951] [c000000fdc54fd90] [c0000000002b96ac] vfs_write+0xdc/0x260
[ 702.918025] [c000000fdc54fde0] [c0000000002ba24c] SyS_write+0x6c/0x110
[ 702.918099] [c000000fdc54fe30] [c00000000000927c] syscall_exit+0x0/0x7c
[ 702.918171] Instruction dump:
[ 702.918208] 3842d6b0 7c0802a6 f8010010 f821ffe1 60000000 60000000 3d22001b 39491bdc
[ 702.918330] 39200001 912a0000 7c0004ac 39400000 <992a0000> 38210020 e8010010 7c0803a6
[ 702.918455] ---[ end trace de0be178fae811c7 ]---
[ 702.918504]
[ 702.918548] Sending IPI to other CPUs
[ 702.919695] IPI complete
-> smp_release_cpus()
spinning_secondaries = 159
<- smp_release_cpus()
<- setup_system()
[ 1.790146] /build/buildd/linux-3.18.0/drivers/rtc/hctosys.c: unable to open rtc device (rtc0)
starting version 218
[ 5.119894] lpfc 0001:03:00.0: 0:1303 Link Up Event x1 received Data: x1 x1 x10 x0 x0 x0 0
[ 5.682631] blk_update_request: I/O error, dev sda, sector 0
[ 5.682740] Buffer I/O error on dev sda, logical block 0, async page read
[ 6.232603] blk_update_request: I/O error, dev sda, sector 0
[ 6.232713] Buffer I/O error on dev sda, logical block 0, async page read
[ 6.749296] blk_update_request: I/O error, dev sda, sector 0
[ 6.749404] Buffer I/O error on dev sda, logical block 0, async page read
[ 7.265979] blk_update_request: I/O error, dev sda, sector 0
[ 7.266089] Buffer I/O error on dev sda, logical block 0, async page read
[ 7.556728] lpfc 0001:03:00.1: 1:1303 Link Up Event x1 received Data: x1 x1 x10 x0 x0 x0 0
[ 7.773261] blk_update_request: I/O error, dev sda, sector 0
[ 7.773370] Buffer I/O error on dev sda, logical block 0, async page read
[ 8.149349] blk_update_request: I/O error, dev sdc, sector 0
[ 8.149458] Buffer I/O error on dev sdc, logical block 0, async page read
[ 8.282652] blk_update_request: I/O error, dev sda, sector 0
[ 8.282760] Buffer I/O error on dev sda, logical block 0, async page read
[ 8.666008] blk_update_request: I/O error, dev sdc, sector 0
[ 8.666117] Buffer I/O error on dev sdc, logical block 0, async page read
[ 8.799327] blk_update_request: I/O error, dev sda, sector 0
[ 8.799435] Buffer I/O error on dev sda, logical block 0, async page read
[ 9.182675] blk_update_request: I/O error, dev sdc, sector 0
[ 9.182783] Buffer I/O error on dev sdc, logical block 0, async page read
[ 10.732722] blk_update_request: I/O error, dev sdc, sector 0
[ 10.732833] Buffer I/O error on dev sdc, logical block 0, async page read
[ 10.866031] blk_update_request: I/O error, dev sda, sector 0
[ 10.866138] Buffer I/O error on dev sda, logical block 0, async page read
[ 11.249409] blk_update_request: I/O error, dev sdc, sector 0
[ 11.249516] Buffer I/O error on dev sdc, logical block 0, async page read
[ 11.382717] blk_update_request: I/O error, dev sda, sector 0
[ 11.382824] Buffer I/O error on dev sda, logical block 0, async page read
[ 11.382900] Dev sda: unable to read RDB block 0
[ 11.766089] blk_update_request: I/O error, dev sdc, sector 0
[ 11.766197] Buffer I/O error on dev sdc, logical block 0, async page read
[ 11.899388] blk_update_request: I/O error, dev sda, sector 0
[ 11.899496] Buffer I/O error on dev sda, logical block 0, async page read
[ 12.282748] blk_update_request: I/O error, dev sdc, sector 0
[ 12.282857] Buffer I/O error on dev sdc, logical block 0, async page read
[ 12.416063] blk_update_request: I/O error, dev sda, sector 0
[ 12.416171] Buffer I/O error on dev sda, logical block 0, async page read
[ 12.799430] blk_update_request: I/O error, dev sdc, sector 0
[ 12.799538] Buffer I/O error on dev sdc, logical block 0, async page read
[ 12.932746] blk_update_request: I/O error, dev sda, sector 0
[ 12.932854] Buffer I/O error on dev sda, logical block 0, async page read
[ 13.832806] Dev sdc: unable to read RDB block 0
[ 15.899500] blk_update_request: I/O error, dev sdc, sector 0
[ 15.899613] Buffer I/O error on dev sdc, logical block 0, async page read
[ 16.416167] blk_update_request: I/O error, dev sdc, sector 0
[ 16.416275] Buffer I/O error on dev sdc, logical block 0, async page read
[ 16.932859] blk_update_request: I/O error, dev sdc, sector 230686592
[ 17.449539] blk_update_request: I/O error, dev sdc, sector 230686592
[ 17.449648] Buffer I/O error on dev sdc, logical block 1802239, async page read
[ 28.268922] sd 0:2:0:0: [sde] Assuming drive cache: write through
[ 28.284920] sd 0:2:1:0: [sdf] Assuming drive cache: write through
[ 28.405468] scsi 1:0:0:31: Failed to get diagnostic page 0x8000002
[ 28.405580] scsi 1:0:0:31: Failed to bind enclosure -19
[ 28.412464] scsi 1:0:1:31: Failed to get diagnostic page 0x8000002
[ 28.412563] scsi 1:0:1:31: Failed to bind enclosure -19
[ 28.419518] scsi 2:0:0:31: Failed to get diagnostic page 0x8000002
[ 28.419629] scsi 2:0:0:31: Failed to bind enclosure -19
[ 28.425512] scsi 2:0:1:31: Failed to get diagnostic page 0x8000002
[ 28.425636] scsi 2:0:1:31: Failed to bind enclosure -19
[-1;-1f[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[ 63.081279] reboot: Restarting system
[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbuntu 15.04[-1;-1f. . . .[-1;-1fUbun
System does not get rebooted and console just stay at above screen
message and I need to do reboot from ASMI console to get system booted
up.
After reboot I don't see crash file
root@lep8d:~# cd /var/crash/Feb8/
root@lep8d:/var/crash/Feb8# ls
197001010530 kexec_cmd
root@lep8d:/var/crash/Feb8/197001010530# ls
dmesg.197001010530 dump.197001010530
root@lep8d:/var/crash/Feb8/197001010530# dmesg | grep -i crash
[ 0.000000] Reserving 4096MB of memory at 128MB for crashkernel (System RAM: 196608MB)
[ 0.000000] Kernel command line: root=UUID=0d90e79d-3d3f-4d8e-9de9-af000b6f285d ro quiet splash crashkernel=4096M
== Comment: #3 ==
The problem still can be recreate with latest daily ISO builds.
After a dump, system does not reboot automatically. One has to force a
reboot.
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1424275/+subscriptions