[lustre-discuss] Full OST

Alastair Basden a.g.basden at durham.ac.uk
Fri Sep 3 13:51:55 PDT 2021


Hi,

lctl get_param mdt.*.exports.*.open_files  returns:
mdt.snap8-MDT0000.exports.172.18.180.21 at o2ib.open_files=
[0x20000b90e:0x10aa:0x0]
mdt.snap8-MDT0000.exports.172.18.180.22 at o2ib.open_files=
[0x20000b90e:0x21b3:0x0]
mdt.snap8-MDT0000.exports.172.18.181.19 at o2ib.open_files=
[0x20000b90e:0x21b3:0x0]
[0x20000b90e:0x21b4:0x0]
[0x20000b90c:0x1574:0x0]
[0x20000b90c:0x1575:0x0]
[0x20000b90c:0x1576:0x0]

Doesn't seem to be many open, so I don't think it's a problem of open 
files.

Not sure which bit of this I need to use with lfs fid2path either...

Cheers,
Alastair.


On Fri, 3 Sep 2021, Andreas Dilger wrote:

> [EXTERNAL EMAIL]
> You can also check "mdt.*.exports.*.open_files" on the MDTs for a list of FIDs open on each client, and use "lfs fid2path" to resolve them to a pathname.
>
> On Sep 3, 2021, at 02:09, Degremont, Aurelien via lustre-discuss <lustre-discuss at lists.lustre.org<mailto:lustre-discuss at lists.lustre.org>> wrote:
>
> Hi
>
> It could be a bug, but most of the time, this is due to an open-unlinked file, typically a log file which is still in use and some processes keep writing to it until it fills the OSTs it is using.
>
> Look for such files on your clients (use lsof).
>
> Aurélien
>
>
> Le 03/09/2021 09:50, « lustre-discuss au nom de Alastair Basden » <lustre-discuss-bounces at lists.lustre.org<mailto:lustre-discuss-bounces at lists.lustre.org> au nom de a.g.basden at durham.ac.uk<mailto:a.g.basden at durham.ac.uk>> a écrit :
>
>   CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you can confirm the sender and know the content is safe.
>
>
>
>   Hi,
>
>   We have a file system where each OST is a single SSD.
>
>   One of those is reporting as 100% full (lfs df -h /snap8):
>   snap8-OST004d_UUID          5.8T        2.0T        3.5T  37% /snap8[OST:77]
>   snap8-OST004e_UUID          5.8T        5.5T        7.5G 100% /snap8[OST:78]
>   snap8-OST004f_UUID          5.8T        2.0T        3.4T  38% /snap8[OST:79]
>
>   However, I can't find any files on it:
>   lfs find --ost snap8-OST004e /snap8/
>   returns nothing.
>
>   I guess that it has filled up, and that there is some bug or other that is
>   now preventing proper behaviour - but I could be wrong.
>
>   Does anyone have any suggestions?
>
>   Essentially, I'd like to find some of the files and delete or migrate
>   some, and thus return it to useful production.
>
>   Cheers,
>   Alastair.
>   _______________________________________________
>   lustre-discuss mailing list
>   lustre-discuss at lists.lustre.org<mailto:lustre-discuss at lists.lustre.org>
>   http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
>
> _______________________________________________
> lustre-discuss mailing list
> lustre-discuss at lists.lustre.org<mailto:lustre-discuss at lists.lustre.org>
> http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org
>
> Cheers, Andreas
> --
> Andreas Dilger
> Lustre Principal Architect
> Whamcloud
>
>
>
>
>
>
>
>


More information about the lustre-discuss mailing list