[lustre-discuss] How to activate an OST on a client ?
Haarst, Jan van
jan.vanhaarst at wur.nl
Tue Aug 27 05:39:55 PDT 2024
Since the last e-mail, the issue popped up again.
Is there anything we can do on our end, except rebooting clients ?
--
Jan van Haarst
HPC Administrator
Facilitair Bedrijf, onderdeel van Wageningen University & Research
Afdeling Informatie Technologie
Postbus 59, 6700 AB, Wageningen
Gebouw 116, Akkermaalsbos 12, 6700 WB, Wageningen
http://www.wur.nl/nl/Disclaimer.htm <http://www.wur.nl/nl/Disclaimer.htm>
From: lustre-discuss <lustre-discuss-bounces at lists.lustre.org> on behalf of Haarst, Jan van via lustre-discuss <lustre-discuss at lists.lustre.org>
Date: Thursday, 22 August 2024 at 14:36
To: lustre-discuss at lists.lustre.org <lustre-discuss at lists.lustre.org>
Subject: [lustre-discuss] How to activate an OST on a client ?
Hi,
Probably the wording of the subject doesn’t actually cover the issue, what we see is this :
We have a client behind a router (linking tcp to Omnipath) that shows an inactive OST (all on 2.15.5).
Other clients that go through the router do not have this issue.
One client had the same issue, although it showed a different OST as inactive.
After a reboot, all was well again on that machine.
The clients can lctl ping the OSSs.
So although we have a workaround (reboot the client), it would be nice to:
1. Fix the issue without a reboot
2. Fix the underlying issue.
It might be unrelated, but we also see another routing issue every now and then:
The router stops routing request toward a certain OSS, and this can be fixed by deleting the peer_nid of the OSS from the router.
I am probably missing informative logs, but I’m more than happy to try to generate them, if somebody has a pointer to how.
We are a bit stumped right now.
With kind regards,
--
Jan van Haarst
HPC Administrator
For Anunna/HPC questions, please use https://support.wur.nl <https://support.wur.nl> (with HPC as service)
Aanwezig: maandag, dinsdag, donderdag & vrijdag
Facilitair Bedrijf, onderdeel van Wageningen University & Research
Afdeling Informatie Technologie
Postbus 59, 6700 AB, Wageningen
Gebouw 116, Akkermaalsbos 12, 6700 WB, Wageningen
http://www.wur.nl/nl/Disclaimer.htm <http://www.wur.nl/nl/Disclaimer.htm>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20240827/c16725db/attachment-0001.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/x-pkcs7-signature
Size: 11814 bytes
Desc: not available
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20240827/c16725db/attachment-0001.bin>
More information about the lustre-discuss
mailing list