[lustre-discuss] unmount FS when endpoint is gone

Degremont, Aurelien degremoa at amazon.fr
Tue Dec 21 02:42:00 PST 2021


Hello Florin,

As the filesystem servers do not exist anymore as you deleted it previously, the client could not reach them to complete the unmount process.

Try unmounting them using '-f' flag, ie: 'umount -f <filesystem path>'


You should also reach out to AWS support and check that with them.

Aurélien



Le 21/12/2021 00:54, « lustre-discuss au nom de Florin Andrei » <lustre-discuss-bounces at lists.lustre.org au nom de florin at andrei.myip.org> a écrit :

    CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you can confirm the sender and know the content is safe.



    We've created a few Lustre FS endpoints in AWS. They were mounted on a
    system. The Lustre endpoints got terminated soon after that, and others
    were created instead.

    Now the old Lustre filesystems appear to be mounted on that node, and
    there's automation trying to unmount them, resulting in a very large
    number of umount processes just hanging. In dmesg I see this message
    repeated many, many times:

    Lustre: 919:0:(client.c:2116:ptlrpc_expire_one_request()) @@@ Request
    sent has failed due to network error:

    What is the recommended procedure to unmount those FSs? Just running
    umount manually also hangs indefinitely. I would prefer to not reboot
    that node.

    --
    Florin Andrei
    https://florin.myip.org/
    _______________________________________________
    lustre-discuss mailing list
    lustre-discuss at lists.lustre.org
    http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org




More information about the lustre-discuss mailing list