[Lustre-discuss] big problem: read-only fs
Papp Tamás
tompos at martos.bme.hu
Tue Sep 16 09:47:43 PDT 2008
hi All,
This morning we see on some client, it cannot connect to one of our node.
I run fsck on the node, and remounted it. Fsck found a lot of errors.
After this I see this on the logs again:
Sep 16 18:16:08 node1 kernel: LustreError:
2538:0:(ldlm_resource.c:719:ldlm_resource_add()) lvbo_init failed for
resource 80132: rc -2
Sep 16 18:16:08 node1 kernel: LustreError:
2538:0:(ldlm_resource.c:719:ldlm_resource_add()) Skipped 15 previous
similar messages
Sep 16 18:27:15 node1 kernel: LustreError:
2487:0:(ldlm_resource.c:719:ldlm_resource_add()) lvbo_init failed for
resource 232490: rc -2
Sep 16 18:27:15 node1 kernel: LustreError:
2487:0:(ldlm_resource.c:719:ldlm_resource_add()) Skipped 16 previous
similar messages
Sep 16 18:30:08 node1 kernel: LDISKFS-fs error (device sdb1):
ldiskfs_ext_find_extent: bad header in inode #58262056: invalid magic -
magic 0, entries 0, max 0(0), depth 0(0)
Sep 16 18:30:08 node1 kernel: Remounting filesystem read-only
Sep 16 18:30:16 node1 kernel: Lustre: Skipped 1 previous similar message
Sep 16 18:30:16 node1 kernel: LustreError:
3986:0:(fsfilt-ldiskfs.c:1318:fsfilt_ldiskfs_write_record()) can't start
transaction for 37 blocks (128 bytes)
Sep 16 18:30:16 node1 kernel: LustreError:
3986:0:(fsfilt-ldiskfs.c:1318:fsfilt_ldiskfs_write_record()) Skipped 53
previous similar messages
Sep 16 18:30:16 node1 kernel: LustreError:
3986:0:(filter.c:360:filter_client_free()) zeroing out client
2bee00d4-c421-4c8e-bf27-8dd131e0bc55 at idx 51 (14720) in last_rcvd rc -30
Sep 16 18:34:32 node1 kernel: LustreError:
2426:0:(fsfilt-ldiskfs.c:281:fsfilt_ldiskfs_start()) error starting
handle for op 8 (71 credits): rc -30
Sep 16 18:34:32 node1 kernel: LustreError:
2426:0:(fsfilt-ldiskfs.c:281:fsfilt_ldiskfs_start()) Skipped 36 previous
similar messages
Sep 16 18:37:16 node1 kernel: LustreError:
2416:0:(ldlm_resource.c:719:ldlm_resource_add()) lvbo_init failed for
resource 212576: rc -2
Sep 16 18:37:16 node1 kernel: LustreError:
2416:0:(ldlm_resource.c:719:ldlm_resource_add()) Skipped 15 previous
similar messages
Sep 16 18:43:20 node1 kernel: LustreError:
2341:0:(fsfilt-ldiskfs.c:281:fsfilt_ldiskfs_start()) error starting
handle for op 8 (71 credits): rc -30
Sep 16 18:43:20 node1 kernel: LustreError:
2341:0:(filter.c:273:filter_client_add()) unable to start transaction:
rc -30
Sep 16 18:43:20 node1 kernel: LustreError:
2341:0:(filter.c:273:filter_client_add()) Skipped 36 previous similar
messages
Sep 16 18:43:20 node1 kernel: LustreError:
2341:0:(filter.c:294:filter_client_add()) error writing last_rcvd client
idx 34: rc -30
Sep 16 18:43:20 node1 kernel: LustreError:
2341:0:(filter.c:294:filter_client_add()) Skipped 36 previous similar
messages
Sep 16 18:43:20 node1 kernel: LustreError:
2341:0:(ldlm_lib.c:1442:target_send_reply_msg()) @@@ processing error
(-30) req at ffff81000276de00 x8/t0 o8-><?>@<?>:-1 lens 240/144 ref 0 fl
Interpret:/0/0 rc -30/0
Sep 16 18:43:21 node1 kernel: LustreError:
2341:0:(ldlm_lib.c:1442:target_send_reply_msg()) Skipped 5 previous
similar messages
Sep 16 18:44:10 node1 kernel: LustreError:
2399:0:(ldlm_lib.c:1442:target_send_reply_msg()) @@@ processing error
(-30) req at ffff81005f5d0400 x38/t0 o8-><?>@<?>:-1 lens 240/144 ref 0 fl
Interpret:/0/0 rc -30/0
Sep 16 18:44:10 node1 kernel: LustreError:
2399:0:(ldlm_lib.c:1442:target_send_reply_msg()) Skipped 1 previous
similar message
Sep 16 18:44:35 node1 kernel: LustreError:
2341:0:(filter.c:273:filter_client_add()) unable to start transaction:
rc -30
Sep 16 18:44:35 node1 kernel: LustreError:
2341:0:(filter.c:273:filter_client_add()) Skipped 2 previous similar
messages
Sep 16 18:44:35 node1 kernel: LustreError:
2341:0:(filter.c:294:filter_client_add()) error writing last_rcvd client
idx 51: rc -30
Sep 16 18:44:35 node1 kernel: LustreError:
2341:0:(filter.c:294:filter_client_add()) Skipped 2 previous similar
messages
errno -2 was right after fsck, it's OK.
But why does -30 is here? I hoped, it will disappear after fsck, but I
see again. What could cause this problem? How can I solve it?
Thank you,
tamas
More information about the lustre-discuss
mailing list