← Back to team overview

kernel-packages team mailing list archive

[Bug 1212818] [NEW] Using fscache with NFS causes random kernel oops

 

Public bug reported:

It is hard to get an oops because the entire FS system seems to hang and
/var/log/kern.log won't capture it.  I have managed to get a kdump
capture. I think I told apport to submit the crash dump, but I'm not
really sure if it worked, and I don't know where to look to see if it
made it anyplace..

fscache was working fine in oneiric at least, so this is a regression.

I'm thinking this is probably the fix that needs to be backported:

http://www.redhat.com/archives/linux-cachefs/2013-May/msg00008.html

crash reports:

[178619.012128] BUG: unable to handle kernel NULL pointer dereference at 000000000000000c
[178619.012224] IP: [<ffffffff81044a99>] __ticket_spin_lock+0x9/0x30
[178619.012303] PGD 0 
[178619.012329] Oops: 0002 [#1] SMP 
[178619.012369] Modules linked in: nfsv4(F) cachefiles(F) rfcomm bnep bluetooth autofs4(F) nfsd(F) auth_rpcgss(F) nfs_acl(F) nfs(F) lockd(F) sunrpc(F) fscache(F) nls_iso8859_1(F) snd_hda_codec_realtek snd_hda_codec_hdmi coretemp snd_hda_intel snd_hda_codec kvm_intel snd_hwdep(F) snd_pcm(F) kvm snd_page_alloc(F) snd_seq_midi(F) snd_seq_midi_event(F) snd_rawmidi(F) snd_seq(F) snd_seq_device(F) snd_timer(F) snd(F) soundcore(F) tpm_infineon ghash_clmulni_intel(F) aesni_intel(F) aes_x86_64(F) i915 xts(F) lrw(F) gf128mul(F) hp_wmi psmouse(F) drm_kms_helper drm ablk_helper(F) cryptd(F) microcode(F) i2c_algo_bit serio_raw(F) sparse_keymap mac_hid wmi tpm_tis lpc_ich ppdev(F) video(F) parport_pc(F) iTCO_wdt iTCO_vendor_support lp(F) parport(F) hid_generic usbhid hid ahci(F) libahci(F) e1000e(F)
[178619.013231] CPU 4 
[178619.013257] Pid: 21461, comm: kworker/u:0 Tainted: GF          O 3.8.0-27-generic #40-Ubuntu Hewlett-Packard HP EliteDesk 800 G1 SFF/1998
[178619.013372] RIP: 0010:[<ffffffff81044a99>]  [<ffffffff81044a99>] __ticket_spin_lock+0x9/0x30
[178619.013460] RSP: 0018:ffff88034e9c9ce0  EFLAGS: 00010287
[178619.013512] RAX: 0000000000010000 RBX: ffff8803ef5c92c0 RCX: ffffffff81e4fc10
[178619.013579] RDX: 0000000000000001 RSI: 0000000000000005 RDI: 000000000000000c
[178619.013669] RBP: ffff88034e9c9ce0 R08: 0000000000000000 R09: ffffffff81e4fb70
[178619.013749] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000030
[178619.013816] R13: 000000000000000c R14: ffffffff81e4fb60 R15: ffffffff81e4fa00
[178619.013895] FS:  0000000000000000(0000) GS:ffff88041eb00000(0000) knlGS:0000000000000000
[178619.013997] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[178619.014064] CR2: 000000000000000c CR3: 0000000002c0d000 CR4: 00000000001407e0
[178619.014132] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[178619.014199] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[178619.014267] Process kworker/u:0 (pid: 21461, threadinfo ffff88034e9c8000, task ffff8803b99cdd00)
[178619.014364] Stack:
[178619.014398]  ffff88034e9c9cf0 ffffffff816cc97e ffff88034e9c9da0 ffffffffa02cc598
[178619.014483]  ffff880406319c00 ffff88034e9c9d20 ffffffff8108f393 ffff8803ef5c9340
[178619.014576]  ffff88034e9c9d30 ffffffff81074c74 ffff88034e9c9d40 ffffffff81074d14
[178619.014660] Call Trace:
[178619.014698]  [<ffffffff816cc97e>] _raw_spin_lock+0xe/0x20
[178619.014762]  [<ffffffffa02cc598>] fscache_invalidate_writes+0x28/0xc0 [fscache]
[178619.014839]  [<ffffffff8108f393>] ? wake_up_process+0x23/0x40
[178619.014927]  [<ffffffff81074c74>] ? wake_up_worker+0x24/0x30
[178619.014985]  [<ffffffff81074d14>] ? insert_work+0x94/0xb0
[178619.015043]  [<ffffffff8107725a>] ? __queue_work+0xea/0x370
[178619.015111]  [<ffffffffa02c8f6f>] fscache_object_state_machine+0x6f/0xa30 [fscache]
[178619.015192]  [<ffffffffa02c995b>] fscache_object_work_func+0x2b/0x60 [fscache]
[178619.015263]  [<ffffffff8107682c>] process_one_work+0x15c/0x4d0
[178619.015325]  [<ffffffff8107808e>] worker_thread+0x16e/0x480
[178619.015418]  [<ffffffff81077f20>] ? manage_workers+0x2a0/0x2a0
[178619.015491]  [<ffffffff8107d420>] kthread+0xc0/0xd0
[178619.015558]  [<ffffffff8107d360>] ? kthread_create_on_node+0x120/0x120
[178619.015662]  [<ffffffff816d516c>] ret_from_fork+0x7c/0xb0
[178619.015728]  [<ffffffff8107d360>] ? kthread_create_on_node+0x120/0x120
[178619.015790] Code: 00 00 48 c7 c1 a1 48 04 81 48 c7 c2 9e 48 04 81 e9 dd fe ff ff 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 55 b8 00 00 01 00 48 89 e5 <f0> 0f c1 07 89 c1 c1 e9 10 66 39 c1 89 ca 74 11 0f 1f 80 00 00 
[178619.016207] RIP  [<ffffffff81044a99>] __ticket_spin_lock+0x9/0x30
[178619.016302]  RSP <ffff88034e9c9ce0>
[178619.016338] CR2: 000000000000000c

** Affects: linux (Ubuntu)
     Importance: Undecided
         Status: New

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/1212818

Title:
  Using fscache with NFS causes random kernel oops

Status in “linux” package in Ubuntu:
  New

Bug description:
  It is hard to get an oops because the entire FS system seems to hang
  and /var/log/kern.log won't capture it.  I have managed to get a kdump
  capture. I think I told apport to submit the crash dump, but I'm not
  really sure if it worked, and I don't know where to look to see if it
  made it anyplace..

  fscache was working fine in oneiric at least, so this is a regression.

  I'm thinking this is probably the fix that needs to be backported:

  http://www.redhat.com/archives/linux-cachefs/2013-May/msg00008.html

  crash reports:

  [178619.012128] BUG: unable to handle kernel NULL pointer dereference at 000000000000000c
  [178619.012224] IP: [<ffffffff81044a99>] __ticket_spin_lock+0x9/0x30
  [178619.012303] PGD 0 
  [178619.012329] Oops: 0002 [#1] SMP 
  [178619.012369] Modules linked in: nfsv4(F) cachefiles(F) rfcomm bnep bluetooth autofs4(F) nfsd(F) auth_rpcgss(F) nfs_acl(F) nfs(F) lockd(F) sunrpc(F) fscache(F) nls_iso8859_1(F) snd_hda_codec_realtek snd_hda_codec_hdmi coretemp snd_hda_intel snd_hda_codec kvm_intel snd_hwdep(F) snd_pcm(F) kvm snd_page_alloc(F) snd_seq_midi(F) snd_seq_midi_event(F) snd_rawmidi(F) snd_seq(F) snd_seq_device(F) snd_timer(F) snd(F) soundcore(F) tpm_infineon ghash_clmulni_intel(F) aesni_intel(F) aes_x86_64(F) i915 xts(F) lrw(F) gf128mul(F) hp_wmi psmouse(F) drm_kms_helper drm ablk_helper(F) cryptd(F) microcode(F) i2c_algo_bit serio_raw(F) sparse_keymap mac_hid wmi tpm_tis lpc_ich ppdev(F) video(F) parport_pc(F) iTCO_wdt iTCO_vendor_support lp(F) parport(F) hid_generic usbhid hid ahci(F) libahci(F) e1000e(F)
  [178619.013231] CPU 4 
  [178619.013257] Pid: 21461, comm: kworker/u:0 Tainted: GF          O 3.8.0-27-generic #40-Ubuntu Hewlett-Packard HP EliteDesk 800 G1 SFF/1998
  [178619.013372] RIP: 0010:[<ffffffff81044a99>]  [<ffffffff81044a99>] __ticket_spin_lock+0x9/0x30
  [178619.013460] RSP: 0018:ffff88034e9c9ce0  EFLAGS: 00010287
  [178619.013512] RAX: 0000000000010000 RBX: ffff8803ef5c92c0 RCX: ffffffff81e4fc10
  [178619.013579] RDX: 0000000000000001 RSI: 0000000000000005 RDI: 000000000000000c
  [178619.013669] RBP: ffff88034e9c9ce0 R08: 0000000000000000 R09: ffffffff81e4fb70
  [178619.013749] R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000030
  [178619.013816] R13: 000000000000000c R14: ffffffff81e4fb60 R15: ffffffff81e4fa00
  [178619.013895] FS:  0000000000000000(0000) GS:ffff88041eb00000(0000) knlGS:0000000000000000
  [178619.013997] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
  [178619.014064] CR2: 000000000000000c CR3: 0000000002c0d000 CR4: 00000000001407e0
  [178619.014132] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
  [178619.014199] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
  [178619.014267] Process kworker/u:0 (pid: 21461, threadinfo ffff88034e9c8000, task ffff8803b99cdd00)
  [178619.014364] Stack:
  [178619.014398]  ffff88034e9c9cf0 ffffffff816cc97e ffff88034e9c9da0 ffffffffa02cc598
  [178619.014483]  ffff880406319c00 ffff88034e9c9d20 ffffffff8108f393 ffff8803ef5c9340
  [178619.014576]  ffff88034e9c9d30 ffffffff81074c74 ffff88034e9c9d40 ffffffff81074d14
  [178619.014660] Call Trace:
  [178619.014698]  [<ffffffff816cc97e>] _raw_spin_lock+0xe/0x20
  [178619.014762]  [<ffffffffa02cc598>] fscache_invalidate_writes+0x28/0xc0 [fscache]
  [178619.014839]  [<ffffffff8108f393>] ? wake_up_process+0x23/0x40
  [178619.014927]  [<ffffffff81074c74>] ? wake_up_worker+0x24/0x30
  [178619.014985]  [<ffffffff81074d14>] ? insert_work+0x94/0xb0
  [178619.015043]  [<ffffffff8107725a>] ? __queue_work+0xea/0x370
  [178619.015111]  [<ffffffffa02c8f6f>] fscache_object_state_machine+0x6f/0xa30 [fscache]
  [178619.015192]  [<ffffffffa02c995b>] fscache_object_work_func+0x2b/0x60 [fscache]
  [178619.015263]  [<ffffffff8107682c>] process_one_work+0x15c/0x4d0
  [178619.015325]  [<ffffffff8107808e>] worker_thread+0x16e/0x480
  [178619.015418]  [<ffffffff81077f20>] ? manage_workers+0x2a0/0x2a0
  [178619.015491]  [<ffffffff8107d420>] kthread+0xc0/0xd0
  [178619.015558]  [<ffffffff8107d360>] ? kthread_create_on_node+0x120/0x120
  [178619.015662]  [<ffffffff816d516c>] ret_from_fork+0x7c/0xb0
  [178619.015728]  [<ffffffff8107d360>] ? kthread_create_on_node+0x120/0x120
  [178619.015790] Code: 00 00 48 c7 c1 a1 48 04 81 48 c7 c2 9e 48 04 81 e9 dd fe ff ff 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 00 55 b8 00 00 01 00 48 89 e5 <f0> 0f c1 07 89 c1 c1 e9 10 66 39 c1 89 ca 74 11 0f 1f 80 00 00 
  [178619.016207] RIP  [<ffffffff81044a99>] __ticket_spin_lock+0x9/0x30
  [178619.016302]  RSP <ffff88034e9c9ce0>
  [178619.016338] CR2: 000000000000000c

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1212818/+subscriptions


Follow ups

References