[lustre-discuss] File locking errors.

Prentice Bisbal pbisbal at pppl.gov
Thu Feb 15 12:01:47 PST 2018


Version 2.8.0 on the clients. Not sure what's on the servers, since I 
don't manage them, and I can't find the guy who does at the moment.

Prentice

On 02/15/2018 12:11 PM, Jones, Peter A wrote:
> Prentice
>
> Do you know which version of Lustre is in use?
>
> Peter
>
>
>
>
> On 2018-02-15, 9:02 AM, "lustre-discuss on behalf of Prentice Bisbal" <lustre-discuss-bounces at lists.lustre.org on behalf of pbisbal at pppl.gov> wrote:
>
>> Hi.
>>
>> I'm an experience HPC system admin, but I know almost nothing about
>> Lustre administration. The system admin who administered our small
>> Lustre filesystem recently retired, and no one has filled that gap yet.
>> A user recently reported they are now getting file-locking errors from a
>> program they've run repeatedly on Lustre in the past. When the run the
>> same program on an NFS filesystem, the error goes away. I've
>> cut-and-pasted the error messages below.
>>
>> Since I have real experience as a Lustre admin, I turned to google, and
>> it looks like it might be that the file-locking daemon died (if Lustre
>> has a separate file-lock daemon), or somehow file-locking was recently
>> disabled. If that is possible, how do I check this, and restart or
>> re-enable if necessary?  I skimmed the user manual, and could not find
>> anything on either of these issues.
>>
>> Any and all help will be greatly appreciated.
>>
>> Some of the error messages:
>>
>> HDF5-DIAG: Error detected in HDF5 (1.10.0-patch1) MPI-process 9:
>>    #000: H5F.c line 579 in H5Fopen(): unable to open file
>>      major: File accessibilty
>>      minor: Unable to open file
>>    #001: H5Fint.c line 1168 in H5F_open(): unable to lock the file or
>> initialize file structure
>>      major: File accessibilty
>>      minor: Unable to open file
>>    #002: H5FD.c line 1821 in H5FD_lock(): driver lock request failed
>>      major: Virtual File Layer
>>      minor: Can't update object
>>    #003: H5FDsec2.c line 939 in H5FD_sec2_lock(): unable to flock file,
>> errno = 38, error message = 'Function not implemented'
>>      major: File accessibilty
>>      minor: Bad file ID accessed
>> Error: couldn't open file HDF5-DIAG: Error detected in HDF5
>> (1.10.0-patch1) MPI-process 13:
>>    #000: H5F.c line 579 in H5Fopen(): unable to open file
>>      major: File accessibilty
>>      minor: Unable to open file
>>    #001: H5Fint.c line 1168 in H5F_open(): unable to lock the file or
>> initialize file structure
>>      major: File accessibilty
>>      minor: Unable to open file
>>    #002: H5FD.c line 1821 in H5FD_lock(): driver lock request failed
>>      major: Virtual File Layer
>>      minor: Can't update object
>>    #003: H5FDsec2.c line 939 in H5FD_sec2_lock(): unable to flock file,
>> errno = 38, error message = 'Function not implemented'
>>      major: File accessibilty
>>      minor: Bad file ID accessed
>>
>> -- 
>> Prentice
>>
>> _______________________________________________
>> lustre-discuss mailing list
>> lustre-discuss at lists.lustre.org
>> http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org



More information about the lustre-discuss mailing list