[lustre-discuss] File locking errors.
Prentice Bisbal
pbisbal at pppl.gov
Thu Feb 15 09:02:33 PST 2018
Hi.
I'm an experience HPC system admin, but I know almost nothing about
Lustre administration. The system admin who administered our small
Lustre filesystem recently retired, and no one has filled that gap yet.
A user recently reported they are now getting file-locking errors from a
program they've run repeatedly on Lustre in the past. When the run the
same program on an NFS filesystem, the error goes away. I've
cut-and-pasted the error messages below.
Since I have real experience as a Lustre admin, I turned to google, and
it looks like it might be that the file-locking daemon died (if Lustre
has a separate file-lock daemon), or somehow file-locking was recently
disabled. If that is possible, how do I check this, and restart or
re-enable if necessary? I skimmed the user manual, and could not find
anything on either of these issues.
Any and all help will be greatly appreciated.
Some of the error messages:
HDF5-DIAG: Error detected in HDF5 (1.10.0-patch1) MPI-process 9:
#000: H5F.c line 579 in H5Fopen(): unable to open file
major: File accessibilty
minor: Unable to open file
#001: H5Fint.c line 1168 in H5F_open(): unable to lock the file or
initialize file structure
major: File accessibilty
minor: Unable to open file
#002: H5FD.c line 1821 in H5FD_lock(): driver lock request failed
major: Virtual File Layer
minor: Can't update object
#003: H5FDsec2.c line 939 in H5FD_sec2_lock(): unable to flock file,
errno = 38, error message = 'Function not implemented'
major: File accessibilty
minor: Bad file ID accessed
Error: couldn't open file HDF5-DIAG: Error detected in HDF5
(1.10.0-patch1) MPI-process 13:
#000: H5F.c line 579 in H5Fopen(): unable to open file
major: File accessibilty
minor: Unable to open file
#001: H5Fint.c line 1168 in H5F_open(): unable to lock the file or
initialize file structure
major: File accessibilty
minor: Unable to open file
#002: H5FD.c line 1821 in H5FD_lock(): driver lock request failed
major: Virtual File Layer
minor: Can't update object
#003: H5FDsec2.c line 939 in H5FD_sec2_lock(): unable to flock file,
errno = 38, error message = 'Function not implemented'
major: File accessibilty
minor: Bad file ID accessed
--
Prentice
More information about the lustre-discuss
mailing list