← Back to team overview

lttng team mailing list archive

[Bug 1841766] Re: ubuntu_lttng_smoke_test crash with B-5.0 AWS/Azure/GCP

 

** Description changed:

+ [Impact]
  Test will crash and hang, until timeout on jenkins.
  
  This test passed with B-4.15 AWS, and D-5.0 AWS, just not B-5.0 AWS
  
  There is no report and no obvious failure on the jenkins output (just
  marked as build aborted), this can only be found by tailing the syslog
  when you run the test.
  
  Test output:
  Running '/home/ubuntu/autotest/client/tests/ubuntu_lttng_smoke_test/ubuntu_lttng_smoke_test.sh'
  [stdout] == lttng smoke test of session create/destroy ==
  [stdout] Session test-kernel-session created.
  [stdout] Traces will be written in /tmp/lttng-kernel-trace-6325-session
  [stdout] PASSED (lttng create)
  [stdout] Session test-kernel-session destroyed
  [stdout] PASSED (lttng destroy)
  [stdout]
  [stdout] == lttng smoke test trace context switches ==
  [stdout] Session test-kernel-session created.
  [stdout] Traces will be written in /tmp/lttng-kernel-trace-6325-session
  [stdout] PASSED (lttng create)
  
  syslog output:
   kernel: [  253.282158] lttng_kretprobes: loading out-of-tree module taints kernel.
   kernel: [  253.282191] lttng_kretprobes: module verification failed: signature and/or required key missing - tainting kernel
   kernel: [  253.293676] BUG: unable to handle kernel paging request at 0000000000025534
   kernel: [  253.296613] #PF error: [normal kernel read fault]
   kernel: [  253.298623] PGD 0 P4D 0
   kernel: [  253.299783] Oops: 0000 [#1] SMP PTI
   kernel: [  253.301313] CPU: 3 PID: 955 Comm: lttng-sessiond Tainted: G           OE     5.0.0-1014-aws #16~18.04.1-Ubuntu
   kernel: [  253.305391] Hardware name: Amazon EC2 m5.xlarge/, BIOS 1.0 10/16/2017
   kernel: [  253.308102] RIP: 0010:lttng_tracepoint_notify+0x172/0x210 [lttng_tracer]
   kernel: [  253.310886] Code: eb 1a 49 39 c6 0f 85 ab 00 00 00 49 8b 55 10 41 83 c4 01 44 39 a2 8c 02 00 00 76 a3 48 8b 92 90 02 00 00 49 63 c4 4c 63 34 82 <49> 8b 1e 48 89 df e8 33 fb ff ff 48 85 c0 49 89 c7 74 52 49 8b 47
   kernel: [  253.318387] RSP: 0018:ffffb9c80232fbd8 EFLAGS: 00010246
   kernel: [  253.320596] RAX: 0000000000000000 RBX: ffff9a27cbec9c00 RCX: 0000000000000041
   kernel: [  253.323556] RDX: ffffffffc0456a0c RSI: 0000000000000001 RDI: ffffffffc0898100
   kernel: [  253.326507] RBP: ffffb9c80232fc08 R08: ffff9a27d2ba70c0 R09: ffff9a27d2403680
   kernel: [  253.329460] R10: ffffb9c80232fb48 R11: 00000000ffffffff R12: 0000000000000000
   kernel: [  253.332420] R13: ffff9a27cbec9c00 R14: 0000000000025534 R15: ffffffffc0898740
   kernel: [  253.335375] FS:  00007f26e1d49700(0000) GS:ffff9a27d2b80000(0000) knlGS:0000000000000000
   kernel: [  253.338700] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
   kernel: [  253.341118] CR2: 0000000000025534 CR3: 000000040c86c003 CR4: 00000000007606e0
   kernel: [  253.345464] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
   kernel: [  253.349798] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
   kernel: [  253.354126] PKRU: 55555554
   kernel: [  253.356719] Call Trace:
   kernel: [  253.359210]  register_tracepoint_module_notifier+0x57/0x80
   kernel: [  253.362856]  ? 0xffffffffc08d7000
   kernel: [  253.365675]  lttng_tracepoint_init+0x45/0xc40 [lttng_tracer]
   kernel: [  253.369380]  lttng_events_init+0xbc/0x233 [lttng_tracer]
   kernel: [  253.372952]  ? 0xffffffffc08d7000
   kernel: [  253.375739]  do_one_initcall+0x4a/0x1c9
   kernel: [  253.378733]  ? _cond_resched+0x19/0x40
   kernel: [  253.381693]  ? kmem_cache_alloc_trace+0x151/0x1c0
   kernel: [  253.385037]  do_init_module+0x5f/0x216
   kernel: [  253.387999]  load_module+0x19f6/0x20a0
   kernel: [  253.390956]  __do_sys_finit_module+0xfc/0x120
   kernel: [  253.394153]  ? __do_sys_finit_module+0xfc/0x120
   kernel: [  253.397416]  __x64_sys_finit_module+0x1a/0x20
   kernel: [  253.400618]  do_syscall_64+0x5a/0x120
   kernel: [  253.403547]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
   kernel: [  253.407035] RIP: 0033:0x7f26e738d839
   kernel: [  253.409949] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
   kernel: [  253.421539] RSP: 002b:00007f26e1d38758 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
   kernel: [  253.427373] RAX: ffffffffffffffda RBX: 00007f26cc004d70 RCX: 00007f26e738d839
   kernel: [  253.431692] RDX: 0000000000000000 RSI: 00007f26e8d0a145 RDI: 0000000000000030
   kernel: [  253.435978] RBP: 00007f26e8d0a145 R08: 0000000000000000 R09: 00007f26cc004b00
   kernel: [  253.440270] R10: 0000000000000030 R11: 0000000000000246 R12: 0000000000000000
   kernel: [  253.444581] R13: 00007f26cc004570 R14: 0000000000000000 R15: 00007f26cc004b00
   kernel: [  253.448853] Modules linked in: lttng_tracer(OE+) lttng_statedump(OE) lttng_kprobes(OE) lttng_clock(OE) lttng_lib_ring_buffer(OE) lttng_kretprobes(OE) ppdev parport_pc parport serio_raw sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 crypto_simd cryptd glue_helper ena
   kernel: [  253.478768] CR2: 0000000000025534
   kernel: [  253.481575] ---[ end trace 9c47de0da4f80343 ]---
   kernel: [  253.484885] RIP: 0010:lttng_tracepoint_notify+0x172/0x210 [lttng_tracer]
   kernel: [  253.489142] Code: eb 1a 49 39 c6 0f 85 ab 00 00 00 49 8b 55 10 41 83 c4 01 44 39 a2 8c 02 00 00 76 a3 48 8b 92 90 02 00 00 49 63 c4 4c 63 34 82 <49> 8b 1e 48 89 df e8 33 fb ff ff 48 85 c0 49 89 c7 74 52 49 8b 47
   kernel: [  253.500676] RSP: 0018:ffffb9c80232fbd8 EFLAGS: 00010246
   kernel: [  253.504223] RAX: 0000000000000000 RBX: ffff9a27cbec9c00 RCX: 0000000000000041
   kernel: [  253.508498] RDX: ffffffffc0456a0c RSI: 0000000000000001 RDI: ffffffffc0898100
   kernel: [  253.512783] RBP: ffffb9c80232fc08 R08: ffff9a27d2ba70c0 R09: ffff9a27d2403680
   kernel: [  253.517125] R10: ffffb9c80232fb48 R11: 00000000ffffffff R12: 0000000000000000
   kernel: [  253.521421] R13: ffff9a27cbec9c00 R14: 0000000000025534 R15: ffffffffc0898740
   kernel: [  253.525702] FS:  00007f26e1d49700(0000) GS:ffff9a27d2b80000(0000) knlGS:0000000000000000
   kernel: [  253.531708] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
   kernel: [  253.535444] CR2: 0000000000025534 CR3: 000000040c86c003 CR4: 00000000007606e0
   kernel: [  253.539722] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
   kernel: [  253.544020] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
   kernel: [  253.548300] PKRU: 55555554
+ 
+ 
+ I am proposing we backport the lttng-modules from Disco (2.10.8-1ubuntu1) to fix the issues with most recent kernel.
+ 
+ [Test Case]
+ - Install lttng-modules-dkms
+ - load lttng-tracer module
+ 
+ I have built the package on bionic and load-tested with linux 4.15 and
+ some 5.0 backports.
+ 
+ [Regression Potential] 
+ Backporting a newer version to Bionic has the potential to break compilation or users with not up-to-date kernels 4.15 kernels.

** Changed in: linux-aws-edge (Ubuntu)
       Status: New => Invalid

** Changed in: linux-aws-edge (Ubuntu Bionic)
       Status: New => Invalid

** Changed in: lttng-modules (Ubuntu)
       Status: New => In Progress

** Changed in: lttng-modules (Ubuntu Bionic)
       Status: New => In Progress

-- 
You received this bug notification because you are a member of Ubuntu
LTTng, which is subscribed to lttng-modules in Ubuntu.
Matching subscriptions: lttng-modules-bugs
https://bugs.launchpad.net/bugs/1841766

Title:
  ubuntu_lttng_smoke_test crash with B-5.0 AWS/Azure/GCP

Status in ubuntu-kernel-tests:
  New
Status in linux-aws-edge package in Ubuntu:
  Invalid
Status in lttng-modules package in Ubuntu:
  In Progress
Status in linux-aws-edge source package in Bionic:
  Invalid
Status in lttng-modules source package in Bionic:
  In Progress

Bug description:
  [Impact]
  Test will crash and hang, until timeout on jenkins.

  This test passed with B-4.15 AWS, and D-5.0 AWS, just not B-5.0 AWS

  There is no report and no obvious failure on the jenkins output (just
  marked as build aborted), this can only be found by tailing the syslog
  when you run the test.

  Test output:
  Running '/home/ubuntu/autotest/client/tests/ubuntu_lttng_smoke_test/ubuntu_lttng_smoke_test.sh'
  [stdout] == lttng smoke test of session create/destroy ==
  [stdout] Session test-kernel-session created.
  [stdout] Traces will be written in /tmp/lttng-kernel-trace-6325-session
  [stdout] PASSED (lttng create)
  [stdout] Session test-kernel-session destroyed
  [stdout] PASSED (lttng destroy)
  [stdout]
  [stdout] == lttng smoke test trace context switches ==
  [stdout] Session test-kernel-session created.
  [stdout] Traces will be written in /tmp/lttng-kernel-trace-6325-session
  [stdout] PASSED (lttng create)

  syslog output:
   kernel: [  253.282158] lttng_kretprobes: loading out-of-tree module taints kernel.
   kernel: [  253.282191] lttng_kretprobes: module verification failed: signature and/or required key missing - tainting kernel
   kernel: [  253.293676] BUG: unable to handle kernel paging request at 0000000000025534
   kernel: [  253.296613] #PF error: [normal kernel read fault]
   kernel: [  253.298623] PGD 0 P4D 0
   kernel: [  253.299783] Oops: 0000 [#1] SMP PTI
   kernel: [  253.301313] CPU: 3 PID: 955 Comm: lttng-sessiond Tainted: G           OE     5.0.0-1014-aws #16~18.04.1-Ubuntu
   kernel: [  253.305391] Hardware name: Amazon EC2 m5.xlarge/, BIOS 1.0 10/16/2017
   kernel: [  253.308102] RIP: 0010:lttng_tracepoint_notify+0x172/0x210 [lttng_tracer]
   kernel: [  253.310886] Code: eb 1a 49 39 c6 0f 85 ab 00 00 00 49 8b 55 10 41 83 c4 01 44 39 a2 8c 02 00 00 76 a3 48 8b 92 90 02 00 00 49 63 c4 4c 63 34 82 <49> 8b 1e 48 89 df e8 33 fb ff ff 48 85 c0 49 89 c7 74 52 49 8b 47
   kernel: [  253.318387] RSP: 0018:ffffb9c80232fbd8 EFLAGS: 00010246
   kernel: [  253.320596] RAX: 0000000000000000 RBX: ffff9a27cbec9c00 RCX: 0000000000000041
   kernel: [  253.323556] RDX: ffffffffc0456a0c RSI: 0000000000000001 RDI: ffffffffc0898100
   kernel: [  253.326507] RBP: ffffb9c80232fc08 R08: ffff9a27d2ba70c0 R09: ffff9a27d2403680
   kernel: [  253.329460] R10: ffffb9c80232fb48 R11: 00000000ffffffff R12: 0000000000000000
   kernel: [  253.332420] R13: ffff9a27cbec9c00 R14: 0000000000025534 R15: ffffffffc0898740
   kernel: [  253.335375] FS:  00007f26e1d49700(0000) GS:ffff9a27d2b80000(0000) knlGS:0000000000000000
   kernel: [  253.338700] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
   kernel: [  253.341118] CR2: 0000000000025534 CR3: 000000040c86c003 CR4: 00000000007606e0
   kernel: [  253.345464] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
   kernel: [  253.349798] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
   kernel: [  253.354126] PKRU: 55555554
   kernel: [  253.356719] Call Trace:
   kernel: [  253.359210]  register_tracepoint_module_notifier+0x57/0x80
   kernel: [  253.362856]  ? 0xffffffffc08d7000
   kernel: [  253.365675]  lttng_tracepoint_init+0x45/0xc40 [lttng_tracer]
   kernel: [  253.369380]  lttng_events_init+0xbc/0x233 [lttng_tracer]
   kernel: [  253.372952]  ? 0xffffffffc08d7000
   kernel: [  253.375739]  do_one_initcall+0x4a/0x1c9
   kernel: [  253.378733]  ? _cond_resched+0x19/0x40
   kernel: [  253.381693]  ? kmem_cache_alloc_trace+0x151/0x1c0
   kernel: [  253.385037]  do_init_module+0x5f/0x216
   kernel: [  253.387999]  load_module+0x19f6/0x20a0
   kernel: [  253.390956]  __do_sys_finit_module+0xfc/0x120
   kernel: [  253.394153]  ? __do_sys_finit_module+0xfc/0x120
   kernel: [  253.397416]  __x64_sys_finit_module+0x1a/0x20
   kernel: [  253.400618]  do_syscall_64+0x5a/0x120
   kernel: [  253.403547]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
   kernel: [  253.407035] RIP: 0033:0x7f26e738d839
   kernel: [  253.409949] Code: 00 f3 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 1f f6 2c 00 f7 d8 64 89 01 48
   kernel: [  253.421539] RSP: 002b:00007f26e1d38758 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
   kernel: [  253.427373] RAX: ffffffffffffffda RBX: 00007f26cc004d70 RCX: 00007f26e738d839
   kernel: [  253.431692] RDX: 0000000000000000 RSI: 00007f26e8d0a145 RDI: 0000000000000030
   kernel: [  253.435978] RBP: 00007f26e8d0a145 R08: 0000000000000000 R09: 00007f26cc004b00
   kernel: [  253.440270] R10: 0000000000000030 R11: 0000000000000246 R12: 0000000000000000
   kernel: [  253.444581] R13: 00007f26cc004570 R14: 0000000000000000 R15: 00007f26cc004b00
   kernel: [  253.448853] Modules linked in: lttng_tracer(OE+) lttng_statedump(OE) lttng_kprobes(OE) lttng_clock(OE) lttng_lib_ring_buffer(OE) lttng_kretprobes(OE) ppdev parport_pc parport serio_raw sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 crypto_simd cryptd glue_helper ena
   kernel: [  253.478768] CR2: 0000000000025534
   kernel: [  253.481575] ---[ end trace 9c47de0da4f80343 ]---
   kernel: [  253.484885] RIP: 0010:lttng_tracepoint_notify+0x172/0x210 [lttng_tracer]
   kernel: [  253.489142] Code: eb 1a 49 39 c6 0f 85 ab 00 00 00 49 8b 55 10 41 83 c4 01 44 39 a2 8c 02 00 00 76 a3 48 8b 92 90 02 00 00 49 63 c4 4c 63 34 82 <49> 8b 1e 48 89 df e8 33 fb ff ff 48 85 c0 49 89 c7 74 52 49 8b 47
   kernel: [  253.500676] RSP: 0018:ffffb9c80232fbd8 EFLAGS: 00010246
   kernel: [  253.504223] RAX: 0000000000000000 RBX: ffff9a27cbec9c00 RCX: 0000000000000041
   kernel: [  253.508498] RDX: ffffffffc0456a0c RSI: 0000000000000001 RDI: ffffffffc0898100
   kernel: [  253.512783] RBP: ffffb9c80232fc08 R08: ffff9a27d2ba70c0 R09: ffff9a27d2403680
   kernel: [  253.517125] R10: ffffb9c80232fb48 R11: 00000000ffffffff R12: 0000000000000000
   kernel: [  253.521421] R13: ffff9a27cbec9c00 R14: 0000000000025534 R15: ffffffffc0898740
   kernel: [  253.525702] FS:  00007f26e1d49700(0000) GS:ffff9a27d2b80000(0000) knlGS:0000000000000000
   kernel: [  253.531708] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
   kernel: [  253.535444] CR2: 0000000000025534 CR3: 000000040c86c003 CR4: 00000000007606e0
   kernel: [  253.539722] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
   kernel: [  253.544020] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
   kernel: [  253.548300] PKRU: 55555554

  
  I am proposing we backport the lttng-modules from Disco (2.10.8-1ubuntu1) to fix the issues with most recent kernel.

  [Test Case]
  - Install lttng-modules-dkms
  - load lttng-tracer module

  I have built the package on bionic and load-tested with linux 4.15 and
  some 5.0 backports.

  [Regression Potential] 
  Backporting a newer version to Bionic has the potential to break compilation or users with not up-to-date kernels 4.15 kernels.

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1841766/+subscriptions


References