[Lustre-discuss] Kernel Dump

Chan Ching Yu, Patrick cychan at clustertech.com
Fri Jul 26 07:18:51 PDT 2013


Hi all,

Our Lustre 2.1.5 client rebooted itself and kernel dump was generated in /var/crash. 
The backtrace output shows ldlm function which is Lustre-related. Any idea? Thanks very much.

# crash /usr/lib/debug/lib/modules/2.6.32-279.19.1.el6_lustre.x86_64/vmlinux vmcore

      KERNEL: /usr/lib/debug/lib/modules/2.6.32-279.19.1.el6_lustre.x86_64/vmlinux
    DUMPFILE: vmcore-lustre-client-2013-07-26-14:04:43  [PARTIAL DUMP]
        CPUS: 16
        DATE: Fri Jul 26 14:03:36 2013
      UPTIME: 01:56:27
LOAD AVERAGE: 0.25, 0.30, 0.18
       TASKS: 840
    NODENAME: lustre-client
     RELEASE: 2.6.32-279.19.1.el6_lustre.x86_64
     VERSION: #1 SMP Wed Mar 20 16:37:18 PDT 2013
     MACHINE: x86_64  (2600 Mhz)
      MEMORY: 64 GB
       PANIC: "Oops: 0002 [#1] SMP " (check log for details)
         PID: 3200
     COMMAND: "ldlm_cb_01"
        TASK: ffff881062dfd500  [THREAD_INFO: ffff880f250c2000]
         CPU: 0
       STATE: TASK_RUNNING (PANIC)

crash> bt
PID: 3200   TASK: ffff881062dfd500  CPU: 0   COMMAND: "ldlm_cb_01"
#0 [ffff880f250c3900] machine_kexec at ffffffff81031f7b
#1 [ffff880f250c3960] crash_kexec at ffffffff810b8c22
#2 [ffff880f250c3a30] oops_end at ffffffff814ed980
#3 [ffff880f250c3a60] no_context at ffffffff81042a0b
#4 [ffff880f250c3ab0] __bad_area_nosemaphore at ffffffff81042c95
#5 [ffff880f250c3b00] bad_area_nosemaphore at ffffffff81042d63
#6 [ffff880f250c3b10] __do_page_fault at ffffffff810434c1
#7 [ffff880f250c3c30] do_page_fault at ffffffff814ef95e
#8 [ffff880f250c3c60] page_fault at ffffffff814ecd15
    [exception RIP: _spin_lock+14]
    RIP: ffffffff814ec7fe  RSP: ffff880f250c3d10  RFLAGS: 00010206
    RAX: 0000000000010000  RBX: ffff880e437d66c0  RCX: ffff880e2c963370
    RDX: 000000000000000d  RSI: 0000000000000000  RDI: 0000000000000018
    RBP: ffff880f250c3d10   R8: 0000000000000000   R9: 5a5a5a5a5a5a5a5a
    R10: 5a5a5a5a5a5a5a5a  R11: 0000000000000000  R12: ffff880e2c963370
    R13: ffff880e437d66c0  R14: ffff880f61b24688  R15: ffff88086659ce00
    ORIG_RAX: ffffffffffffffff  CS: 0010  SS: 0018
#9 [ffff880f250c3d18] lock_res_and_lock at ffffffffa0841060 [ptlrpc]
#10 [ffff880f250c3d38] ldlm_callback_handler at ffffffffa0869e2e [ptlrpc]
#11 [ffff880f250c3dc8] ptlrpc_main at ffffffffa089bbae [ptlrpc]
#12 [ffff880f250c3f48] kernel_thread at ffffffff8100c0ca
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20130726/44c48951/attachment.htm>


More information about the lustre-discuss mailing list