[lustre-discuss] Wrong --index set for OST
rodger
rodger at csag.uct.ac.za
Mon Sep 25 02:21:13 PDT 2017
Dear All,
I'm still struggling with this. I am running an lfsck -A at present. The
status update is reporting:
layout_mdts_init: 0
layout_mdts_scanning-phase1: 1
layout_mdts_scanning-phase2: 0
layout_mdts_completed: 0
layout_mdts_failed: 0
layout_mdts_stopped: 0
layout_mdts_paused: 0
layout_mdts_crashed: 0
layout_mdts_partial: 0
layout_mdts_co-failed: 0
layout_mdts_co-stopped: 0
layout_mdts_co-paused: 0
layout_mdts_unknown: 0
layout_osts_init: 0
layout_osts_scanning-phase1: 0
layout_osts_scanning-phase2: 12
layout_osts_completed: 0
layout_osts_failed: 30
layout_osts_stopped: 0
layout_osts_paused: 0
layout_osts_crashed: 0
layout_osts_partial: 0
layout_osts_co-failed: 0
layout_osts_co-stopped: 0
layout_osts_co-paused: 0
layout_osts_unknown: 0
layout_repaired: 82358851
namespace_mdts_init: 0
namespace_mdts_scanning-phase1: 1
namespace_mdts_scanning-phase2: 0
namespace_mdts_completed: 0
namespace_mdts_failed: 0
namespace_mdts_stopped: 0
namespace_mdts_paused: 0
namespace_mdts_crashed: 0
namespace_mdts_partial: 0
namespace_mdts_co-failed: 0
namespace_mdts_co-stopped: 0
namespace_mdts_co-paused: 0
namespace_mdts_unknown: 0
namespace_osts_init: 0
namespace_osts_scanning-phase1: 0
namespace_osts_scanning-phase2: 0
namespace_osts_completed: 0
namespace_osts_failed: 0
namespace_osts_stopped: 0
namespace_osts_paused: 0
namespace_osts_crashed: 0
namespace_osts_partial: 0
namespace_osts_co-failed: 0
namespace_osts_co-stopped: 0
namespace_osts_co-paused: 0
namespace_osts_unknown: 0
namespace_repaired: 68265278
with the layout_repaired and namespace_repaired values ticking up at
about 10000 per second.
Is the layout_osts_failed value of 30 a concern?
Is there any way to know how far along it is?
I am also seeing many messages similar to the following in
/var/log/messages on the mdt and oss with OST0000:
Sep 25 10:48:00 mds0l210 kernel: LustreError:
5934:0:(osp_precreate.c:903:osp_precreate_cleanup_orphans())
terra-OST0000-osc-MDT0000: cannot cleanup orphans: rc = -22
Sep 25 10:48:00 mds0l210 kernel: LustreError:
5934:0:(osp_precreate.c:903:osp_precreate_cleanup_orphans()) Skipped 599
previous similar messages
Sep 25 10:48:30 mds0l210 kernel: LustreError:
6137:0:(fld_handler.c:256:fld_server_lookup()) srv-terra-MDT0000: Cannot
find sequence 0x8: rc = -2
Sep 25 10:48:30 mds0l210 kernel: LustreError:
6137:0:(fld_handler.c:256:fld_server_lookup()) Skipped 16593 previous
similar messages
Sep 25 10:58:01 mds0l210 kernel: LustreError:
5934:0:(osp_precreate.c:903:osp_precreate_cleanup_orphans())
terra-OST0000-osc-MDT0000: cannot cleanup orphans: rc = -22
Sep 25 10:58:01 mds0l210 kernel: LustreError:
5934:0:(osp_precreate.c:903:osp_precreate_cleanup_orphans()) Skipped 599
previous similar messages
Sep 25 10:58:57 mds0l210 kernel: LustreError:
6137:0:(fld_handler.c:256:fld_server_lookup()) srv-terra-MDT0000: Cannot
find sequence 0x8: rc = -2
Sep 25 10:58:57 mds0l210 kernel: LustreError:
6137:0:(fld_handler.c:256:fld_server_lookup()) Skipped 40309 previous
similar messages
Do these indicate that the process is not working?
Regards,
Rodger
On 23/09/2017 15:07, rodger wrote:
> Dear All,
>
> In the process of upgrading 1.8.x to 2.x I've messed up a number of the
> index values for OSTs by running tune2fs with the --index value set. To
> compound matters while trying to get the OSTs to mount I erased the
> last_rcvd files on the OSTs. I'm looking for a way to confirm what the
> index should be for each device. Part of the reason for my difficulty is
> that in the evolution of the filesystem some OSTs were decommissioned
> and so the full set no longer has a sequential set of index values. In
> practicing for the upgrade the trial sets that I created did have nice
> neat sequential indexes and the process I developed broke when I used
> the real data. :-(
>
> The result is that although the lustre filesystem mounts and all
> directories appear to be listed files in directories mostly have
> question marks for attributes and are not available for access. I'm
> assuming this is because the index for the OST holding the file is wrong.
>
> Any pointers to recovery would be much appreciated!
>
> Regards,
> Rodger
> _______________________________________________
> lustre-discuss mailing list
> lustre-discuss at lists.lustre.org
> http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
More information about the lustre-discuss
mailing list