[lustre-discuss] OST doomed after e2fsck

Riccardo Veraldi Riccardo.Veraldi at cnaf.infn.it
Wed May 30 20:36:43 PDT 2018


Hello,

after a power outage I had one of my OSTs (total of 60) in an unhappy state.

Lustre version 2.4.1

I ran then a FS check and here follows:

e2fsck 1.42.7.wc1 (12-Apr-2013)
Pass 1: Checking inodes, blocks, and sizes
Pass 2: Checking directory structure
Pass 3: Checking directory connectivity
Pass 4: Checking reference counts
Unattached inode 25793
Connect to /lost+found<y>? yes
Inode 25793 ref count is 2, should be 1.  Fix<y>? yes
Unattached inode 29096
Connect to /lost+found<y>? yes
Inode 29096 ref count is 2, should be 1.  Fix<y>? yes
Unattached inode 29745
Connect to /lost+found<y>? yes
Inode 29745 ref count is 2, should be 1.  Fix<y>? yes
Unattached inode 29821
Connect to /lost+found<y>? yes
Inode 29821 ref count is 2, should be 1.  Fix<y>? yes
yPass 5: Checking group summary information
Inode bitmap differences:  +23902 +29082 +29096 +29130 +29459 +29497
-29530 +29552 +29566 +29596 +(29643--29644) +29655 +29668 +29675 +29696
+29701 +29720 +29736 +29739 +29745 +29751 +29778 +29787 -29795 +29808
+29821
Fix<y>? yes
Free inodes count wrong for group #70 (1, counted=0).
Fix<y>? yes
Free inodes count wrong for group #76 (1, counted=0).
Fix<y>? yes
Free inodes count wrong for group #90 (1, counted=0).
Fix<y>? yes
Free inodes count wrong for group #93 (3, counted=2).
Fix<y>? yes
Free inodes count wrong for group #100 (2, counted=0).
Fix<y>? yes
Free inodes count wrong for group #101 (1, counted=0).
Fix<y>? yes
Free inodes count wrong for group #113 (5, counted=2).
Fix<y>? yes
Free inodes count wrong for group #114 (1, counted=0).
Fix<y>? yes
Free inodes count wrong for group #115 (13, counted=4).
Fix<y>? yes
Free inodes count wrong for group #116 (149, counted=140).
Fix<y>? yes
Free inodes count wrong (30493545, counted=30493516).
Fix<y>? yes
[QUOTA WARNING] Usage inconsistent for ID 0:actual (2083721216, 841) !=
expected (2082398208, 678)
[QUOTA WARNING] Usage inconsistent for ID 9997:actual (1095815659520,
19800) != expected (664375967744, 19791)
[QUOTA WARNING] Usage inconsistent for ID -1597706240:actual (0, 0) !=
expected (90112, 1)
[QUOTA WARNING] Usage inconsistent for ID -1428439040:actual (0, 0) !=
expected (126976, 1)
[QUOTA WARNING] Usage inconsistent for ID -1936064512:actual (0, 0) !=
expected (12288, 1)
[QUOTA WARNING] Usage inconsistent for ID -1684783104:actual (0, 0) !=
expected (28672, 1)
[QUOTA WARNING] Usage inconsistent for ID -2131947520:actual (0, 0) !=
expected (4096, 1)
[QUOTA WARNING] Usage inconsistent for ID 963263424:actual (957718528,
49) != expected (957628416, 48)
[QUOTA WARNING] Usage inconsistent for ID 987173056:actual (1364516864,
158) != expected (1364426752, 157)
[QUOTA WARNING] Usage inconsistent for ID -1537871872:actual (0, 0) !=
expected (73728, 1)
[QUOTA WARNING] Usage inconsistent for ID -2105077760:actual (0, 0) !=
expected (49152, 1)
[QUOTA WARNING] Usage inconsistent for ID -2145202176:actual (0, 0) !=
expected (24576, 1)
[QUOTA WARNING] Usage inconsistent for ID -1422704640:actual (0, 0) !=
expected (65536, 1)
Update quota info for quota type 0<y>? yes
[ERROR] quotaio_tree.c:357:free_dqentry:: Quota structure has offset to
other block (0) than it should (472).
[ERROR] quotaio_tree.c:357:free_dqentry:: Quota structure has offset to
other block (0) than it should (507).
[ERROR] quotaio_tree.c:357:free_dqentry:: Quota structure has offset to
other block (0) than it should (170).
[ERROR] quotaio_tree.c:357:free_dqentry:: Quota structure has offset to
other block (0) than it should (435).
[ERROR] quotaio_tree.c:357:free_dqentry:: Quota structure has offset to
other block (0) than it should (89).
[ERROR] quotaio_tree.c:357:free_dqentry:: Quota structure has offset to
other block (0) than it should (5).
[ERROR] quotaio_tree.c:357:free_dqentry:: Quota structure has offset to
other block (0) than it should (130).
[ERROR] quotaio_tree.c:357:free_dqentry:: Quota structure has offset to
other block (0) than it should (435).
[ERROR] quotaio_tree.c:357:free_dqentry:: Quota structure has offset to
other block (0) than it should (251).
[QUOTA WARNING] Usage inconsistent for ID 0:actual (8301957120, 843) !=
expected (5880315904, 677)
[QUOTA WARNING] Usage inconsistent for ID 2279:actual (14819280969728,
21842) != expected (14298746671104, 21705)
Update quota info for quota type 1<y>? yes

ana01-OST000e: ***** FILE SYSTEM WAS MODIFIED *****
ana01-OST000e: 29876/30523392 files (22.3% non-contiguous),
3670668872/3906963456 blocks


After this when trying to mount the OST again, it makes the lustre
kernel module hang and Linux kernel goes to panic.
it is reproducible, every time I try to mount the OST fixed with e2fsck.
So basically I lost all data on the OST.
Any hints on how could I recover it ?
thank you.

Rick







More information about the lustre-discuss mailing list