[lustre-discuss] MDT quota problem / MDS crash 2.5.3
Guido Laubender
laubender at zib.de
Tue Jul 12 14:18:01 PDT 2016
English version (I'm sorry for my previous mail in German - but should
have been a personal mail to Thomas only :( ):
We were recently able to fix wrong Lustre inode quotas by disabling and
re-enabling quota support on the MDT by 'tune2fs -O ^quota /dev/mdt' and
'tunefs.lustre --quota /dev/mdt'.
Maybe it helps here as well.
On Tue, 12 Jul 2016, Guido Laubender wrote:
> Bei uns waren vor kurzem die Inode-Quoten nicht korrekt; durch Deaktivieren
> und anschließendes Aktivieren der Quoten-Unterstützung (mittels 'tune2fs -O
> ^quota' und anschließendem 'tunefs.lustre --quota') auf dem MDT konnten wir
> es wieder reparieren.
>
> Vielleicht hilft das bei Euch auch...
>
> On Tue, 12 Jul 2016, Thomas Roth wrote:
>
>> Hi all,
>>
>> we are running Lustre 2.5.3 on our servers. OSTs are on ZFS, MDS is on
>> ldiskfs.
>>
>> After a MDS crash and e2fsck 1.42.9.wc1 on the partition, the MDS mounts
>> but causes high-frequency log entries
>>
>> Jul 12 16:06:38 lxmds12 kernel: VFS: find_free_dqentry(): Data block full
>> but it shouldn't.
>> Jul 12 16:06:38 lxmds12 kernel: VFS: Error -5 occurred while creating
>> quota.
>>
>> interspersed with
>>
>> Jul 12 16:06:38 lxmds12 kernel: LustreError:
>> 13159:0:(qsd_handler.c:1155:qsd_op_adjust()) nyx-MDT0000: fail to locate
>> lqe for id:6763, type:0
>> Jul 12 16:06:38 lxmds12 kernel: LustreError:
>> 13159:0:(qsd_handler.c:1155:qsd_op_adjust()) Skipped 4973 previous similar
>> messages
>>
>> or
>>
>> Jul 12 15:59:26 lxmds12 kernel: LustreError:
>> 13414:0:(qsd_entry.c:211:qsd_refresh_usage()) $$$ failed to read disk
>> usage, rc:-3 qsd:nyx-MDT0000 qtype:usr id:7408 enforced:0 granted:0
>> pending:0 waiting:0 req:0 usage:0 qunit:0 qtune:0 edquot:0
>> Jul 12 15:59:26 lxmds12 kernel: LustreError:
>> 13414:0:(qsd_entry.c:211:qsd_refresh_usage()) Skipped 5166 previous similar
>> messages
>>
>>
>> According to our experience from the last few days, this will eventually
>> bring all Lustre operations to a halt.
>>
>>
>> Both the web and the e2fsck-messages ([QUOTA WARNING] Usage inconsistent
>> for ID 7989:actual (278528000, 738675) != expected (222507008, 531071))
>> hint towards quota issues.
>>
>> Therefore, we have 'switched off' quota by "lctl conf_param
>> fsname.quota.ost|mdt=u|g|ug|none", restarted, umounted and 'switched on'
>> quota again, restarted, unmounted.
>>
>> -> The VSF-Errors still appear.
>>
>> Is there anything else we could do?
>> Mount the MDT as ldiskfs and do nasty things on the disk?
>> Is there any command that recalculates / rewrites the quota files on the
>> MDT?
>>
>>
>>
>> (As long as Lustre is still accessible, 'lfs quota' gives results for both
>> users and groups, but at least the file count is entirely wrong (all of my
>> own Lustre files amount to exactle 0 files).
>>
>> And the update of the usage numbers does not work either - I managed to
>> copy a 1GB-file and still had the same kbytes used...)
>>
>>
>> Regards,
>> Thomas
>>
>> --
>> --------------------------------------------------------------------
>> Thomas Roth
>> Department: Informationstechnologie
>> Location: SB3 1.250
>> Phone: +49-6159-71 1453 Fax: +49-6159-71 2986
>>
>> GSI Helmholtzzentrum für Schwerionenforschung GmbH
>> Planckstraße 1
>> 64291 Darmstadt
>> www.gsi.de
>>
>> Gesellschaft mit beschränkter Haftung
>> Sitz der Gesellschaft: Darmstadt
>> Handelsregister: Amtsgericht Darmstadt, HRB 1528
>>
>> Geschäftsführung: Ursula Weyrich
>> Professor Dr. Karlheinz Langanke
>> Jörg Blaurock
>>
>> Vorsitzende des Aufsichtsrates: St Dr. Georg Schütte
>> Stellvertreter: Ministerialdirigent Dr. Rolf Bernhardt
>>
>> _______________________________________________
>> lustre-discuss mailing list
>> lustre-discuss at lists.lustre.org
>> http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
>
More information about the lustre-discuss
mailing list