[Lustre-discuss] ost_write operation failed with -28

Brock Palen brockp at umich.edu
Tue Oct 9 14:57:27 PDT 2007


I had a client die with the following errors in dmesg:

Lustre: Client nobackup-client has started
standard.exe[4680]: segfault at 0000002ac4e88058 rip 00000033df807b7c  
rsp 0000000040bf7e08 error 4
LustreError: 11-0: an error occurred while communicating with  
141.212.30.181 at tcp. The ost_write operation failed with -28
LustreError: 11-0: an error occurred while communicating with  
141.212.30.181 at tcp. The ost_write operation failed with -28
LustreError: 11-0: an error occurred while communicating with  
141.212.30.181 at tcp. The ost_write operation failed with -28
LustreError: Skipped 3 previous similar messages
LustreError: 11-0: an error occurred while communicating with  
141.212.30.181 at tcp. The ost_write operation failed with -28
LustreError: Skipped 5 previous similar messages


After abaqus segfaults, lustre throws some errors, but the mount is  
still useable.  The thing though is abaqus (standard.exe) has never  
died on us like this before on this input on this hardware (we are  
new lusture users).   Its a standard test case provided by abaqus.  s4b

I found no reference to -28  in google.
Any help would be great.

Brock Palen
Center for Advanced Computing
brockp at umich.edu
(734)936-1985





More information about the lustre-discuss mailing list