Hello Florin,

As the filesystem servers do not exist anymore as you deleted it previously, 
the client could not reach them to complete the unmount process.

Try unmounting them using '-f' flag, ie: 'umount -f <filesystem path>'


You should also reach out to AWS support and check that with them.

Aurélien



Le 21/12/2021 00:54, « lustre-discuss au nom de Florin Andrei » 
<lustre-discuss-boun...@lists.lustre.org au nom de flo...@andrei.myip.org> a 
écrit :

    CAUTION: This email originated from outside of the organization. Do not 
click links or open attachments unless you can confirm the sender and know the 
content is safe.



    We've created a few Lustre FS endpoints in AWS. They were mounted on a
    system. The Lustre endpoints got terminated soon after that, and others
    were created instead.

    Now the old Lustre filesystems appear to be mounted on that node, and
    there's automation trying to unmount them, resulting in a very large
    number of umount processes just hanging. In dmesg I see this message
    repeated many, many times:

    Lustre: 919:0:(client.c:2116:ptlrpc_expire_one_request()) @@@ Request
    sent has failed due to network error:

    What is the recommended procedure to unmount those FSs? Just running
    umount manually also hangs indefinitely. I would prefer to not reboot
    that node.

    --
    Florin Andrei
    https://florin.myip.org/
    _______________________________________________
    lustre-discuss mailing list
    lustre-discuss@lists.lustre.org
    http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


_______________________________________________
lustre-discuss mailing list
lustre-discuss@lists.lustre.org
http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org

Reply via email to