[Lustre-discuss] can't umount an OST

jrs botemout at gmail.com
Fri May 2 11:56:04 PDT 2008


For reasons unknown to me, I'm unable to umount an OST on one of my two
OSSes.  This just started.

oss02:~ # ps -ef|grep umount
root      8335  6373  0 11:45 pts/0    00:00:00 umount -t lustre -a
root     10016  8463  0 12:05 pts/1    00:00:00 grep mount

oss02:~ # lctl dl
   0 UP mgc MGC10.200.20.59 at tcp 52c2288f-b787-aa06-8956-b56a0b1f38cb 5
   1 UP ost OSS OSS_uuid 3
   2 ST obdfilter i3_lfs3-OST0001 i3_lfs3-OST0001_UUID 1

oss02:~ # tail -5 /var/log/messages
May  2 11:54:21 oss02 kernel: LustreError: 6919:0:(ldlm_lib.c:1442:target_send_reply_msg()) Skipped 20 previous similar messages
May  2 12:02:55 oss02 kernel: LustreError: 137-5: UUID 'i3_lfs3-OST0001_UUID' is not available  for connect (stopping)
May  2 12:02:55 oss02 kernel: LustreError: Skipped 40 previous similar messages
May  2 12:02:55 oss02 kernel: LustreError: 6960:0:(ldlm_lib.c:1442:target_send_reply_msg()) @@@ processing error (-19)  req at ffff810228532400 x45949/t0 
o8-><?>@<?>:-1 lens 304/0 ref 0 fl Interpret:/0/0 rc -19/0
May  2 12:02:55 oss02 kernel: LustreError: 6960:0:(ldlm_lib.c:1442:target_send_reply_msg()) Skipped 40 previous similar messages

The above was available and in use before the attempt to umount.

The OSS knows that something is amiss:

oss02:~ # cat /proc/fs/lustre/health_check
LBUG
NOT HEALTHY

The client, of course, is hanging now but I get this error.

samba02:~ # lfs df
UUID                 1K-blocks      Used Available  Use% Mounted on
i3_lfs3-MDT0000_UUID 5127239040 293477480 4833761560    5% /mnt/lustre/i3_lfs3[MDT:0]
error: llapi_obd_statfs failed: Bad address (-14)
samba02:~ #

After I power cycle oss02 (the OSS which had the mount that wouldn't release), I
still get the same:
samba02:~ # lfs df
UUID                 1K-blocks      Used Available  Use% Mounted on
i3_lfs3-MDT0000_UUID 5127239040 293477480 4833761560    5% /mnt/lustre/i3_lfs3[MDT:0]
error: llapi_obd_statfs failed: Bad address (-14)

and I can't run 'lfs check osts'; in fact it produces:

May  2 12:53:59 samba02 kernel: LustreError: 18105:0:(client.c:504:ptlrpc_import_delay_req()) @@@ Uninitialized import.  req at ffff8101e6c7c200
x46820/t0 o400->i3_lfs3-OST0000_UUID@<NULL>:6 lens 64/64 ref 1 fl Rpc:N/0/0 rc 0/0
May  2 12:53:59 samba02 kernel: LustreError: 18105:0:(client.c:506:ptlrpc_import_delay_req()) LBUG
May  2 12:53:59 samba02 kernel: Lustre: 18105:0:(linux-debug.c:168:libcfs_debug_dumpstack()) showing stack for process 18105
May  2 12:53:59 samba02 kernel: lfs           R  running task       0 18105  12088                     (NOTLB)
May  2 12:53:59 samba02 kernel: 0000000000000000 ffffffff8840741c 0000000000000004 ffff8101d7642ac0
May  2 12:53:59 samba02 kernel:        ffff8101efe779a0 ffffffff8016079c ffff8101e6c7c200 ffff8101e6c7c200
May  2 12:53:59 samba02 kernel:        ffff8101d3a609b8 ffffffff883fb72b
May  2 12:53:59 samba02 kernel: Call Trace: <ffffffff8840741c>{:ptlrpc:lustre_pack_request+1052}
May  2 12:53:59 samba02 kernel:        <ffffffff8016079c>{filemap_nopage+387} <ffffffff883fb72b>{:ptlrpc:ptlrpc_queue_wait+571}
May  2 12:53:59 samba02 kernel:        <ffffffff883f92c4>{:ptlrpc:ptlrpc_prep_req_pool+1796}
May  2 12:53:59 samba02 kernel:        <ffffffff88426d4c>{:ptlrpc:lprocfs_wr_ping+428} <ffffffff802d968d>{__down_read+18}
May  2 12:53:59 samba02 kernel:        <ffffffff8836f29b>{:obdclass:lprocfs_fops_write+91}
May  2 12:53:59 samba02 kernel:        <ffffffff80181803>{vfs_write+215} <ffffffff80181dca>{sys_write+69}
May  2 12:53:59 samba02 kernel:        <ffffffff8010ad3e>{system_call+126}
May  2 12:53:59 samba02 kernel: LustreError: dumping log to /tmp/lustre-log.1209754439.18105

thanks
JR




More information about the lustre-discuss mailing list