← Back to team overview

kernel-packages team mailing list archive

[Bug 1183862] Re: rbd driver crashes when ceph OSD down

 

Is this still an issue?  Moving to "Incomplete" for now.

** Changed in: linux (Ubuntu)
       Status: In Progress => Incomplete

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1183862

Title:
  rbd driver crashes when ceph OSD down

Status in “linux” package in Ubuntu:
  Incomplete

Bug description:
  [ 1117.016022] libceph: mon2 192.168.64.102:6789 socket closed
  [ 1117.379611] libceph: osd0 192.168.64.102:6800 socket closed
  [ 1117.381986] libceph: osd0 192.168.64.102:6800 connection failed
  [ 1117.816788] libceph: osd0 192.168.64.102:6800 connection failed
  [ 1118.816840] libceph: osd0 192.168.64.102:6800 connection failed
  [ 1222.374004] libceph: mon2 192.168.64.104:6789 socket closed
  [ 1222.375608] libceph: mon2 192.168.64.104:6789 connection failed
  [ 1222.478901] libceph: osd2 192.168.64.104:6800 socket closed
  [ 1222.481166] libceph: osd2 192.168.64.104:6800 connection failed
  [ 1222.816681] libceph: osd2 192.168.64.104:6800 connection failed
  [ 1223.816851] libceph: osd2 192.168.64.104:6800 connection failed
  [ 1247.072582] libceph: osd0 192.168.64.102:6800 socket closed
  [ 1247.081463] BUG: unable to handle kernel NULL pointer dereference at 0000000000000048
  [ 1247.083837] IP: [<ffffffffa0035e40>] write_partial_msg_pages+0x3f0/0x4b0 [libceph]
  [ 1247.084021] PGD 11665d067 PUD 114f74067 PMD 0
  [ 1247.084021] Oops: 0000 [#1] SMP
  [ 1247.084021] CPU 0
  [ 1247.084021] Modules linked in: psmouse serio_raw virtio_balloon mac_hid i2c_piix4 rbd libceph libcrc32c lp parport floppy [last unloaded: ipmi_msghandler]
  [ 1247.084021]
  [ 1247.084021] Pid: 3429, comm: kworker/0:2 Not tainted 3.2.0-40-generic #64-Ubuntu Bochs Bochs
  [ 1247.084021] RIP: 0010:[<ffffffffa0035e40>]  [<ffffffffa0035e40>] write_partial_msg_pages+0x3f0/0x4b0 [libceph]
  [ 1247.084021] RSP: 0018:ffff8800db8cbd10  EFLAGS: 00010246
  [ 1247.084021] RAX: 0000000000000000 RBX: ffff880115caa830 RCX: 0000000000000080
  [ 1247.084021] RDX: 0000000000030000 RSI: ffff8800db80a2a0 RDI: 0000000000000000
  [ 1247.084021] RBP: ffff8800db8cbd70 R08: 0000000000000000 R09: 0000000000000001
  [ 1247.084021] R10: ffff8800dba40af0 R11: 0000000000000000 R12: ffffea000332b280
  [ 1247.084021] R13: 0000000000050000 R14: ffff8800db80a200 R15: 0000000000080000
  [ 1247.084021] FS:  0000000000000000(0000) GS:ffff88011fc00000(0000) knlGS:0000000000000000
  [ 1247.084021] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
  [ 1247.084021] CR2: 0000000000000048 CR3: 0000000116bc4000 CR4: 00000000000006f0
  [ 1247.084021] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
  [ 1247.084021] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
  [ 1247.084021] Process kworker/0:2 (pid: 3429, threadinfo ffff8800db8ca000, task ffff8800dba7dc00)
  [ 1247.084021] Stack:
  [ 1247.084021]  ffff8800dba7dc48 0000000000000000 ffff8800db80a2a0 00080000db8cbd50
  [ 1247.084021]  0000000000000000 0000000000080000 0000000000000000 ffff880115caa830
  [ 1247.084021]  0000160000000000 ffff880000000000 0000000000001000 ffff880115caab08
  [ 1247.084021] Call Trace:
  [ 1247.084021]  [<ffffffffa00365ff>] try_write+0x11f/0x360 [libceph]
  [ 1247.084021]  [<ffffffffa0037c5c>] con_work+0x11c/0x160 [libceph]
  [ 1247.084021]  [<ffffffff81085aa7>] process_one_work+0x127/0x470
  [ 1247.084021]  [<ffffffff81086bb4>] worker_thread+0x164/0x370
  [ 1247.084021]  [<ffffffff81086a50>] ? manage_workers.isra.31+0x130/0x130
  [ 1247.084021]  [<ffffffff8108b41c>] kthread+0x8c/0xa0
  [ 1247.084021]  [<ffffffff81668bf4>] kernel_thread_helper+0x4/0x10
  [ 1247.084021]  [<ffffffff8108b390>] ? flush_kthread_worker+0xa0/0xa0
  [ 1247.084021]  [<ffffffff81668bf0>] ? gs_change+0x13/0x13
  [ 1247.084021] Code: c7 fc ff ff 0f 1f 80 00 00 00 00 49 83 be 90 00 00 00 00 0f 84 aa 00 00 00 49 63 be a0 00 00 00 49 8b 86 98 00 00 00 48 c1 e7 04 <48> 03 78 48 8b 45 c4 85 c0 4c 8b 27 8b
  4f 0c 75 31 8b 93 c8 02
  [ 1247.084021] RIP  [<ffffffffa0035e40>] write_partial_msg_pages+0x3f0/0x4b0 [libceph]
  [ 1247.084021]  RSP <ffff8800db8cbd10>
  [ 1247.084021] CR2: 0000000000000048
  [ 1247.161802] ---[ end trace 11a3b7333b7132d7 ]---
  [ 1247.165785] BUG: unable to handle kernel paging request at fffffffffffffff8
  [ 1247.168020] IP: [<ffffffff8108b8b1>] kthread_data+0x11/0x20
  [ 1247.168020] PGD 1c07067 PUD 1c08067 PMD 0
  [ 1247.168020] Oops: 0000 [#2] SMP
  [ 1247.168020] CPU 0
  [ 1247.168020] Modules linked in: psmouse serio_raw virtio_balloon mac_hid i2c_piix4 rbd libceph libcrc32c lp parport floppy [last unloaded: ipmi_msghandler]
  [ 1247.168020]
  [ 1247.168020] Pid: 3429, comm: kworker/0:2 Tainted: G      D      3.2.0-40-generic #64-Ubuntu Bochs Bochs
  [ 1247.168020] RIP: 0010:[<ffffffff8108b8b1>]  [<ffffffff8108b8b1>] kthread_data+0x11/0x20
  [ 1247.168020] Pid: 3429, comm: kworker/0:2 Tainted: G      D      3.2.0-40-generic #64-Ubuntu Bochs Bochs
  [ 1247.168020] RIP: 0010:[<ffffffff8108b8b1>]  [<ffffffff8108b8b1>] kthread_data+0x11/0x20
  [ 1247.168020] RSP: 0018:ffff8800db8cb930  EFLAGS: 00010092
  [ 1247.168020] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
  [ 1247.168020] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff8800dba7dc00
  [ 1247.168020] RBP: ffff8800db8cb948 R08: 0000000000989680 R09: 0000000000000000
  [ 1247.168020] R10: ffff880114e85248 R11: 0000000000000000 R12: 0000000000000000
  [ 1247.168020] R13: ffff8800dba7dfd0 R14: 0000000000000001 R15: 0000000000000000
  [ 1247.168020] FS:  0000000000000000(0000) GS:ffff88011fc00000(0000) knlGS:0000000000000000
  [ 1247.168020] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
  [ 1247.168020] CR2: fffffffffffffff8 CR3: 0000000116808000 CR4: 00000000000006f0
  [ 1247.168020] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
  [ 1247.168020] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
  [ 1247.168020] Process kworker/0:2 (pid: 3429, threadinfo ffff8800db8ca000, task ffff8800dba7dc00)
  [ 1247.168020] Stack:
  [ 1247.168020]  ffffffff81087165 ffff8800db8cb948 ffff88011fc137c0 ffff8800db8cb9c8
  [ 1247.168020]  ffffffff8165c147 ffff8800db8cb988 ffff8800dba7dc00 ffff8800db8cbfd8
  [ 1247.168020]  ffff8800db8cbfd8 ffff8800db8cbfd8 00000000000137c0 ffff8800db8cb9b8
  [ 1247.168020] Call Trace:
  [ 1247.168020]  [<ffffffff81087165>] ? wq_worker_sleeping+0x15/0xa0
  [ 1247.168020]  [<ffffffff8165c147>] __schedule+0x5d7/0x6f0
  [ 1247.168020]  [<ffffffff8165c58f>] schedule+0x3f/0x60
  [ 1247.168020]  [<ffffffff8106c89c>] do_exit+0x28c/0x450
  [ 1247.168020]  [<ffffffff8165f620>] oops_end+0xb0/0xf0
  [ 1247.168020]  [<ffffffff81644907>] no_context+0x150/0x15d
  [ 1247.168020]  [<ffffffff81644adf>] __bad_area_nosemaphore+0x1cb/0x1ea
  [ 1247.168020]  [<ffffffff81644b11>] bad_area_nosemaphore+0x13/0x15
  [ 1247.168020]  [<ffffffff81662266>] do_page_fault+0x426/0x520
  [ 1247.168020]  [<ffffffff81597206>] ? __tcp_push_pending_frames+0x26/0x90
  [ 1247.168020]  [<ffffffff81588bf0>] ? do_tcp_sendpages+0x5c0/0x6f0
  [ 1247.168020]  [<ffffffff81661aa5>] do_async_page_fault+0x35/0x80
  [ 1247.168020]  [<ffffffff8165ec25>] async_page_fault+0x25/0x30
  [ 1247.168020]  [<ffffffffa0035e40>] ? write_partial_msg_pages+0x3f0/0x4b0 [libceph]
  [ 1247.168020]  [<ffffffffa00365ff>] try_write+0x11f/0x360 [libceph]
  [ 1247.168020]  [<ffffffffa0037c5c>] con_work+0x11c/0x160 [libceph]
  [ 1247.168020]  [<ffffffff81085aa7>] process_one_work+0x127/0x470
  [ 1247.168020]  [<ffffffff81086bb4>] worker_thread+0x164/0x370
  [ 1247.168020]  [<ffffffff81086a50>] ? manage_workers.isra.31+0x130/0x130
  [ 1247.168020]  [<ffffffff8108b41c>] kthread+0x8c/0xa0
  [ 1247.168020]  [<ffffffff81668bf4>] kernel_thread_helper+0x4/0x10
  [ 1247.168020]  [<ffffffff8108b390>] ? flush_kthread_worker+0xa0/0xa0
  [ 1247.168020]  [<ffffffff81668bf0>] ? gs_change+0x13/0x13
  [ 1247.168020] Code: 41 5f 5d c3 be 3e 01 00 00 48 c7 c7 78 a7 a0 81 e8 25 c6 fd ff e9 74 fe ff ff 55 48 89 e5 0f 1f 44 00 00 48 8b 87 78 03 00 00 5d <48> 8b 40 f8 c3 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 0f 1f
  [ 1247.168020] RIP  [<ffffffff8108b8b1>] kthread_data+0x11/0x20
  [ 1247.168020]  RSP <ffff8800db8cb930>
  [ 1247.168020] CR2: fffffffffffffff8
  [ 1247.168020] ---[ end trace 11a3b7333b7132d8 ]---
  [ 1247.168020] Fixing recursive fault but reboot is needed!

  ProblemType: Bug
  DistroRelease: Ubuntu 12.04
  Package: linux-image-3.2.0-43-generic 3.2.0-43.68
  ProcVersionSignature: Ubuntu 3.2.0-43.68-generic 3.2.42
  Uname: Linux 3.2.0-43-generic x86_64
  AlsaDevices:
   total 0
   crw-rw---T 1 root audio 116,  1 May 23 17:26 seq
   crw-rw---T 1 root audio 116, 33 May 23 17:26 timer
  AplayDevices: Error: [Errno 2] No such file or directory
  ApportVersion: 2.0.1-0ubuntu17.2
  Architecture: amd64
  ArecordDevices: Error: [Errno 2] No such file or directory
  AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1:
  CRDA: Error: [Errno 2] No such file or directory
  Date: Fri May 24 11:34:00 2013
  HibernationDevice: RESUME=UUID=ec08ae22-c549-4f54-ae16-5f93ebcff467
  IwConfig:
   lo        no wireless extensions.
   
   eth1      no wireless extensions.
   
   eth0      no wireless extensions.
  Lsusb: Bus 001 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
  MachineType: Bochs Bochs
  MarkForUpload: True
  PciMultimedia:
   
  ProcEnviron:
   TERM=xterm
   PATH=(custom, no user)
   LANG=en_US.UTF-8
   SHELL=/bin/bash
  ProcFB:
   
  ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-3.2.0-43-generic root=UUID=2cabf957-646b-4fc0-80d0-d23c580f9187 ro console=ttyS0,115200 initrd=amd64/generic/precise/install/initrd.gz BOOTIF=01-52-54-00-17-bb-0e quiet
  RelatedPackageVersions:
   linux-restricted-modules-3.2.0-43-generic N/A
   linux-backports-modules-3.2.0-43-generic  N/A
   linux-firmware                            1.79.4
  RfKill: Error: [Errno 2] No such file or directory
  SourcePackage: linux
  UpgradeStatus: No upgrade log present (probably fresh install)
  WifiSyslog:
   May 24 09:02:51 inst-027 dhclient: DHCPREQUEST of 192.168.64.132 on eth0 to 192.168.64.1 port 67
   May 24 09:02:51 inst-027 dhclient: DHCPACK of 192.168.64.132 from 192.168.64.1
   May 24 09:02:51 inst-027 dhclient: bound to 192.168.64.132 -- renewal in 20745 seconds.
  dmi.bios.date: 01/01/2007
  dmi.bios.vendor: Bochs
  dmi.bios.version: Bochs
  dmi.chassis.type: 1
  dmi.chassis.vendor: Bochs
  dmi.modalias: dmi:bvnBochs:bvrBochs:bd01/01/2007:svnBochs:pnBochs:pvr:cvnBochs:ct1:cvr:
  dmi.product.name: Bochs
  dmi.sys.vendor: Bochs

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1183862/+subscriptions