[lustre-discuss] File locking errors.

Prentice Bisbal pbisbal at pppl.gov
Thu Feb 15 09:02:33 PST 2018


Hi.

I'm an experience HPC system admin, but I know almost nothing about 
Lustre administration. The system admin who administered our small 
Lustre filesystem recently retired, and no one has filled that gap yet. 
A user recently reported they are now getting file-locking errors from a 
program they've run repeatedly on Lustre in the past. When the run the 
same program on an NFS filesystem, the error goes away. I've 
cut-and-pasted the error messages below.

Since I have real experience as a Lustre admin, I turned to google, and 
it looks like it might be that the file-locking daemon died (if Lustre 
has a separate file-lock daemon), or somehow file-locking was recently 
disabled. If that is possible, how do I check this, and restart or 
re-enable if necessary?  I skimmed the user manual, and could not find 
anything on either of these issues.

Any and all help will be greatly appreciated.

Some of the error messages:

HDF5-DIAG: Error detected in HDF5 (1.10.0-patch1) MPI-process 9:
   #000: H5F.c line 579 in H5Fopen(): unable to open file
     major: File accessibilty
     minor: Unable to open file
   #001: H5Fint.c line 1168 in H5F_open(): unable to lock the file or 
initialize file structure
     major: File accessibilty
     minor: Unable to open file
   #002: H5FD.c line 1821 in H5FD_lock(): driver lock request failed
     major: Virtual File Layer
     minor: Can't update object
   #003: H5FDsec2.c line 939 in H5FD_sec2_lock(): unable to flock file, 
errno = 38, error message = 'Function not implemented'
     major: File accessibilty
     minor: Bad file ID accessed
Error: couldn't open file HDF5-DIAG: Error detected in HDF5 
(1.10.0-patch1) MPI-process 13:
   #000: H5F.c line 579 in H5Fopen(): unable to open file
     major: File accessibilty
     minor: Unable to open file
   #001: H5Fint.c line 1168 in H5F_open(): unable to lock the file or 
initialize file structure
     major: File accessibilty
     minor: Unable to open file
   #002: H5FD.c line 1821 in H5FD_lock(): driver lock request failed
     major: Virtual File Layer
     minor: Can't update object
   #003: H5FDsec2.c line 939 in H5FD_sec2_lock(): unable to flock file, 
errno = 38, error message = 'Function not implemented'
     major: File accessibilty
     minor: Bad file ID accessed

-- 
Prentice



More information about the lustre-discuss mailing list