[Lustre-discuss] Kernel BUG - ll_intent_drop_lock

Wojciech Turek wjt27 at cam.ac.uk
Thu Jul 12 03:35:05 PDT 2012


This bug happened on the client running Lustre-2.1.2, has anybody seen
that before?

BUG: unable to handle kernel paging request at 00000000ffffffff
IP: [<ffffffffa0895d05>] ll_intent_drop_lock+0x15/0xb0 [lustre]
PGD 488b52067 PUD 0
Oops: 0000 [#1] SMP
last sysfs file: /sys/module/ipmi_si/initstate
CPU 0
Modules linked in: lmv(U) ipmi_si ipmi_devintf ipmi_msghandler mgc(U)
lustre(U) lov(U) osc(U) lquota(U) mdc(U) fid(U) fld(U) ko2iblnd(U)
ptlrpc(U) obdclass(U) lnet(U) lvfs(U) libcfs(U) nfsd exportfs
acpi_cpufreq freq_table mperf rdma_ucm(U) rdma_cm(U) iw_cm(U)
ib_addr(U) ib_ipoib(U) ib_cm(U) ib_sa(U) ib_uverbs(U) ib_umad(U)
mlx4_ib(U) ib_mad(U) ib_core(U) mlx4_core(U) ext4 mbcache jbd2
dm_mirror dm_region_hash dm_log dm_mod vhost_net macvtap macvlan tun
kvm wmi microcode dcdbas sg sb_edac edac_core i2c_i801 i2c_core
iTCO_wdt iTCO_vendor_support shpchp ioatdma ipv6 sd_mod crc_t10dif
ahci igb dca nfs lockd fscache nfs_acl auth_rpcgss sunrpc [last
unloaded: scsi_wait_scan]

Pid: 89623, comm: ks_spectrum_his Tainted: G        W
----------------   2.6.32-220.23.1.el6.x86_64 #1 Dell Inc. PowerEdge
C6220/0W6W6G
RIP: 0010:[<ffffffffa0895d05>]  [<ffffffffa0895d05>]
ll_intent_drop_lock+0x15/0xb0 [lustre]
RSP: 0018:ffff880c6eb21c78  EFLAGS: 00010286
RAX: 00000000030e0580 RBX: 00000000ffffffff RCX: 0000000000000002
RDX: ffff88069193b348 RSI: ffffffff81ace670 RDI: 00000000ffffffff
RBP: ffff880c6eb21cb8 R08: ffff88069193b348 R09: 000000000000000b
R10: 0000000000000003 R11: 0000000000000086 R12: ffff88071f2eb840
R13: ffff88062e3daeb8 R14: ffff88071f2eb840 R15: ffff8807078b81c0
FS:  00007fb5ffdd9700(0000) GS:ffff880044600000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00000000ffffffff CR3: 0000000c6ea7a000 CR4: 00000000000406f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process ks_spectrum_his (pid: 89623, threadinfo ffff880c6eb20000, task
ffff880ec182b580)
Stack:
 ffff880c6eb21ce8 ffff8808690cd000 ffff88062e3dad80 ffffffffa0895000
<0> ffff88071f2eb840 ffff88062e3daeb8 00000000ffffffff ffff88071f2eb840
<0> ffff880c6eb21cf8 ffffffffa08962a1 ffff880c6eb21d28 ffff88062e3daeb8
Call Trace:
 [<ffffffffa0895000>] ? return_if_equal+0x0/0x30 [lustre]
 [<ffffffffa08962a1>] ll_intent_release+0x41/0x1c0 [lustre]
 [<ffffffffa0896475>] ll_release+0x55/0x470 [lustre]
 [<ffffffff81190a72>] ? iput+0x62/0x70
 [<ffffffffa0897a01>] ? ll_d_iput+0x31/0x60 [lustre]
 [<ffffffff8118d6c8>] d_free+0x28/0x60
 [<ffffffff8118d749>] d_kill+0x49/0x60
 [<ffffffff8118f15c>] dput+0x7c/0x150
 [<ffffffff81178379>] __fput+0x189/0x210
 [<ffffffff81178425>] fput+0x25/0x30
 [<ffffffff81173e6d>] filp_close+0x5d/0x90
 [<ffffffff8106c7bf>] put_files_struct+0x7f/0xf0
 [<ffffffff8106c883>] exit_files+0x53/0x70
 [<ffffffff8106e8f5>] do_exit+0x185/0x870
 [<ffffffff8106f038>] do_group_exit+0x58/0xd0
 [<ffffffff8106f0c7>] sys_exit_group+0x17/0x20
 [<ffffffff8100b0f2>] system_call_fastpath+0x16/0x1b
Code: 24 00 00 00 00 e8 ec 14 bc ff e9 ec fe ff ff 0f 1f 80 00 00 00
00 55 48 89 e5 48 83 ec 40 48 89 5d f0 4c 89 65 f8 0f 1f 44 00 00 <8b>
0f 48 89 fb 85 c9 74 7e 8b 77 28 85 f6 74 77 f6 05 ca 69 bd
RIP  [<ffffffffa0895d05>] ll_intent_drop_lock+0x15/0xb0 [lustre]
 RSP <ffff880c6eb21c78>
CR2: 00000000ffffffff
---[ end trace 7e2bd11f85dd477d ]---
Kernel panic - not syncing: Fatal exception
Pid: 89623, comm: ks_spectrum_his Tainted: G      D W
----------------   2.6.32-220.23.1.el6.x86_64 #1
Call Trace:
 [<ffffffff814ecb34>] ? panic+0x78/0x143
 [<ffffffff814f0cd4>] ? oops_end+0xe4/0x100
 [<ffffffff810423fb>] ? no_context+0xfb/0x260
 [<ffffffff81042685>] ? __bad_area_nosemaphore+0x125/0x1e0
 [<ffffffff81042753>] ? bad_area_nosemaphore+0x13/0x20
 [<ffffffff81042e0d>] ? __do_page_fault+0x31d/0x480
 [<ffffffffa045bc8f>] ? cfs_hash_bd_from_key+0x3f/0xc0 [libcfs]
 [<ffffffffa0895000>] ? return_if_equal+0x0/0x30 [lustre]
 [<ffffffffa045c317>] ? cfs_hash_bd_get+0x37/0x90 [libcfs]
 [<ffffffffa0614735>] ? ldlm_resource_foreach+0x85/0x3e0 [ptlrpc]
 [<ffffffffa06004e7>] ? ldlm_resource_putref+0x67/0x270 [ptlrpc]
 [<ffffffff814f2c8e>] ? do_page_fault+0x3e/0xa0
 [<ffffffff814f0045>] ? page_fault+0x25/0x30
 [<ffffffffa0895d05>] ? ll_intent_drop_lock+0x15/0xb0 [lustre]
 [<ffffffffa0895000>] ? return_if_equal+0x0/0x30 [lustre]
 [<ffffffffa0895000>] ? return_if_equal+0x0/0x30 [lustre]
 [<ffffffffa08962a1>] ? ll_intent_release+0x41/0x1c0 [lustre]
 [<ffffffffa0896475>] ? ll_release+0x55/0x470 [lustre]
 [<ffffffff81190a72>] ? iput+0x62/0x70
 [<ffffffffa0897a01>] ? ll_d_iput+0x31/0x60 [lustre]
 [<ffffffff8118d6c8>] ? d_free+0x28/0x60
 [<ffffffff8118d749>] ? d_kill+0x49/0x60
 [<ffffffff8118f15c>] ? dput+0x7c/0x150
 [<ffffffff81178379>] ? __fput+0x189/0x210
 [<ffffffff81178425>] ? fput+0x25/0x30
 [<ffffffff81173e6d>] ? filp_close+0x5d/0x90
 [<ffffffff8106c7bf>] ? put_files_struct+0x7f/0xf0
 [<ffffffff8106c883>] ? exit_files+0x53/0x70
 [<ffffffff8106e8f5>] ? do_exit+0x185/0x870
 [<ffffffff8106f038>] ? do_group_exit+0x58/0xd0
 [<ffffffff8106f0c7>] ? sys_exit_group+0x17/0x20
 [<ffffffff8100b0f2>] ? system_call_fastpath+0x16/0x1b

-- 
Wojciech Turek

Senior System Architect

High Performance Computing Service
University of Cambridge
Email: wjt27 at cam.ac.uk
Tel: (+)44 1223 763517



More information about the lustre-discuss mailing list