group.of.nepali.translators team mailing list archive
-
group.of.nepali.translators team
-
Mailing list archive
-
Message #16428
[Bug 1708399] Re: kernel panic -not syncing: Fatal exception: panic_on_oops
** Changed in: linux (Ubuntu)
Status: Fix Committed => Fix Released
** Changed in: ubuntu-z-systems
Status: Fix Committed => Fix Released
--
You received this bug notification because you are a member of नेपाली
भाषा समायोजकहरुको समूह, which is subscribed to Xenial.
Matching subscriptions: Ubuntu 16.04 Bugs
https://bugs.launchpad.net/bugs/1708399
Title:
kernel panic -not syncing: Fatal exception: panic_on_oops
Status in Ubuntu on IBM z Systems:
Fix Released
Status in linux package in Ubuntu:
Fix Released
Status in linux source package in Xenial:
Fix Released
Status in linux source package in Zesty:
Fix Released
Bug description:
SRU justification:
Impact: A race in context flushing is causing a kernel panic on the
s390x architecture.
Fix: Using a set of 3 patches (all restricted to arch code), one
already upstream and the other 2 pending on linux-next. Regression
risk should be low (limited to arch code and tested).
Testcase: see below
---
== Comment: #0 - QI YE <yeqi@xxxxxxxxxx> - 2017-08-02 04:11:25 ==
---Problem Description---
Ubuntu got kernel panic
---uname output---
#110-Ubuntu SMP Tue Jul 18 12:56:43 UTC 2017 s390x s390x s390x GNU/Linux
---Debugger Data---
PID: 10991 TASK: 19872a0e8 CPU: 2 COMMAND: "hyperkube"
LOWCORE INFO:
-psw : 0x0004c00180000000 0x0000000000115fa6
-function : pcpu_delegate at 115fa6
-prefix : 0x7fe42000
-cpu timer: 0x7ffab2827828aa50
-clock cmp: 0xd2eb8b31445e4200
-general registers:
0x0004e00100000000 0x00000000001283b6
0x0000c00100000000 0x000000008380fcb8
0x0000000000115f9e 0x000000000056f6e2
0x0000000000000004 0x0000000000cf9070
0x00000001f3bfc000 0x0000000000112fd8
0x00000001c72bb400 0x0000000000000002
0x000000007fffc000 0x00000000007c9ef0
0x0000000000115f9e 0x000000008380fc18
-access registers:
0x000003ff 0x7ffff910 0000000000 0000000000
0000000000 0000000000 0000000000 0000000000
0000000000 0000000000 0000000000 0000000000
0000000000 0000000000 0000000000 0000000000
-control registers:
0x0000000014066a12 0x000000007e6d81c7
0x0000000000011140 000000000000000000
0x0000000000002aef 0x0000000000000400
0x0000000050000000 0x000000007e6d81c7
000000000000000000 000000000000000000
000000000000000000 000000000000000000
000000000000000000 0x0000000000cfc007
0x00000000db000000 0x0000000000011280
-floating point registers:
0x409c7e2580000000 0x401de4e000000000
000000000000000000 0x3fd24407ab0e073a
0x3ff0000000000000 0x3fee666666666666
0x3fef218f8a7a41a0 0x3fee666666666666
0x0000000000800000 000000000000000000
0x000003ff7f800000 0x000002aa4940e9e0
0x000000000000d401 0x000003ffe81fe110
000000000000000000 0x000003fff2cfe638
#0 [8380fc78] smp_find_processor_id at 1160f8
#1 [8380fc90] machine_kexec at 1135d4
#2 [8380fcb8] crash_kexec at 1fbb8a
#3 [8380fd88] panic at 27d0e0
#4 [8380fe28] die at 1142cc
#5 [8380fe90] do_low_address at 12215e
#6 [8380fea8] pgm_check_handler at 7c2ab4
PSW: 0705200180000000 000002aa267e0e42 (user space)
GPRS: 0000000000000000 0000000000000000 000002aa2c4fd690 0000000000000001
000002aa2c4fd690 000003ff7fffee38 0000000000000000 0000000000000002
0000000000029c0f 000000c42001ea00 0000000000000001 0000000000000001
000000c42001c5c8 000000c42082c1a0 000002aa2666325e 000003ff7fffed90
Contact Information = Chee Ye / yeqi@xxxxxxxxxx
Stack trace output:
no
Oops output:
[43200.761465] docker0: port 10(vethb9132e9) entered forwarding state
[50008.560926] hrtimer: interrupt took 1698076 ns
[123483.768984] systemd[1]: apt-daily.timer: Adding 7h 34min 22.582204s random time.
[123483.930058] systemd[1]: apt-daily.timer: Adding 2h 18min 14.857162s random time.
[123484.064879] systemd[1]: apt-daily.timer: Adding 10h 46min 2.301756s random time.
[123484.824760] systemd[1]: apt-daily.timer: Adding 6h 16min 22.178655s random time.
[153113.703126] conntrack: generic helper won't handle protocol 47. Please consider loading the specific helper module.
[477085.704538] Low-address protection: 0004 ilc:2 [#1] SMP
[477085.704551] Modules linked in: xt_physdev veth xt_recent xt_comment xt_mark xt_nat ipt_MASQUERADE nf_nat_masquerade_ipv4 xfrm_user xfrm_algo iptable_nat nf_nat_ipv4 xt_addrtype nf_nat br_netfilter bridge stp llc aufs ipt_REJECT nf_reject_ipv4 xt_tcpudp nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack iptable_filter ip_tables x_tables ghash_s390 prng aes_s390 des_s390 des_generic sha512_s390 qeth_l2 sha256_s390 qeth sha1_s390 qdio sha_common ccwgroup vmur dasd_eckd_mod dasd_mod
[477085.705522] CPU: 2 PID: 10991 Comm: hyperkube Not tainted 4.4.0-87-generic #110-Ubuntu
[477085.705525] task: 000000019872a0e8 ti: 000000008380c000 task.ti: 000000008380c000
[477085.705529] User PSW : 0705200180000000 000002aa267e0e42
[477085.705532] R:0 T:1 IO:1 EX:1 Key:0 M:1 W:0 P:1 AS:0 CC:2 PM:0 EA:3
User GPRS: 0000000000000000 0000000000000000 000002aa2c4fd690 0000000000000001
[477085.705539] 000002aa2c4fd690 000003ff7fffee38 0000000000000000 0000000000000002
[477085.705553] 0000000000029c0f 000000c42001ea00 0000000000000001 0000000000000001
[477085.705554] 000000c42001c5c8 000000c42082c1a0 000002aa2666325e 000003ff7fffed90
[477085.705578] User Code: 000002aa267e0e30: e340f0080004 lg %r4,8(%r15)
000002aa267e0e36: e330f0100014 lgf %r3,16(%r15)
#000002aa267e0e3c: e36040000014 lgf %r6,0(%r4)
>000002aa267e0e42: ba634000 cs %r6,%r3,0(%r4)
000002aa267e0e46: a774fffe brc 7,2aa267e0e42
000002aa267e0e4a: e360f0180050 sty %r6,24(%r15)
000002aa267e0e50: 07fe bcr 15,%r14
000002aa267e0e52: 0000 unknown
[477085.705596] Last Breaking-Event-Address:
[477085.705599] [<000002aa26663258>] 0x2aa26663258
[477085.705600]
[477085.705602] Kernel panic - not syncing: Fatal exception: panic_on_oops
System Dump Location:
There are 4 vCPU defined. I can see hyperkube executed on two CPUs and then got kernel panic. It may be related to the TLB entry flush on the two CPUs.
CPU 0 RUNQUEUE: 1ea5a8c00
CURRENT: PID: 0 TASK: bb1528 COMMAND: "swapper/0"
RT PRIO_ARRAY: 1ea5a8db0
[no tasks queued]
CFS RB_ROOT: 1ea5a8c98
[no tasks queued]
CPU 1 RUNQUEUE: 1ea5b9c00
CURRENT: PID: 0 TASK: 1e94162b8 COMMAND: "swapper/1"
RT PRIO_ARRAY: 1ea5b9db0
[no tasks queued]
CFS RB_ROOT: 1ea5b9c98
[120] PID: 23421 TASK: 1c9368af8 COMMAND: "PipelineService"
[120] PID: 10957 TASK: 1987336d8 COMMAND: "hyperkube"
CPU 2 RUNQUEUE: 1ea5cac00
CURRENT: PID: 10991 TASK: 19872a0e8 COMMAND: "hyperkube"
RT PRIO_ARRAY: 1ea5cadb0
[no tasks queued]
CFS RB_ROOT: 1ea5cac98
[no tasks queued]
CPU 3 RUNQUEUE: 1ea5dbc00
CURRENT: PID: 10975 TASK: 198a30000 COMMAND: "hyperkube"
RT PRIO_ARRAY: 1ea5dbdb0
[no tasks queued]
CFS RB_ROOT: 1ea5dbc98
[120] PID: 21614 TASK: 1cbee57c0 COMMAND: "IngestServiceCl"
== Comment: #1 - QI YE <yeqi@xxxxxxxxxx> - 2017-08-02 04:20:02 ==
The problem happened randomly. Not pattern has been figured out yet.
It also happens on below kernel levels.
- 4.4.0-78-generic #99
- 4.4.0-83-generic
== Comment: #2 - Heinz-Werner Seeck <heinz-werner_seeck@xxxxxxxxxx> - 2017-08-02 08:25:06 ==
@QI YE: Please provide the use case of this problem report. And add dumps and dbginfo , sosreports as attachment. For me it is not clear which use case this problems generates.
Many thanks in advance
== Comment: #3 - QI YE <yeqi@xxxxxxxxxx> - 2017-08-02 08:44:01 ==
(In reply to comment #2)
> @QI YE: Please provide the use case of this problem report. And add dumps
> and dbginfo , sosreports as attachment. For me it is not clear which use
> case this problems generates.
> Many thanks in advance
Heinz-Werner, what do you mean by "use case"? Could you elaborate it?
If you are referring to what application caused this problem. We have
machine learning running on Ubuntu on the IBM Z community cloud.
The dump file is big, any suggestion of the location to upload the
dump file?
== Comment: #4 - QI YE <yeqi@xxxxxxxxxx> - 2017-08-02 08:50:32 ==
sosreport
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-z-systems/+bug/1708399/+subscriptions