[Lustre-discuss] OSS load in the roof

Brian J. Murrell Brian.Murrell at Sun.COM
Fri Jun 27 10:07:32 PDT 2008


On Fri, 2008-06-27 at 12:44 -0400, Brock Palen wrote:
> 
> All of them are stuck in un-interruptible sleep.
> Has anyone seen this happen before?  Is this caused by a pending disk  
> failure?

Well, they are certainly stuck because of some blocking I/O.  That could
be disk failure, indeed.

> mptscsi: ioc1: attempting task abort! (sc=0000010038904c40)
> scsi1 : destination target 0, lun 0
>          command = Read (10) 00 75 94 40 00 00 10 00 00
> mptscsi: ioc1: task abort: SUCCESS (sc=0000010038904c40)

That does not look like a picture of happiness, indeed, no.  You have
SCSI commands aborting.

> Lustre: 6698:0:(lustre_fsfilt.h:306:fsfilt_setattr()) nobackup- 
> OST0001: slow setattr 100s
> Lustre: 6698:0:(watchdog.c:312:lcw_update_time()) Expired watchdog  
> for pid 6698 disabled after 103.1261s

Those are just fallout from the above disk situation.

b.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
URL: <http://lists.lustre.org/pipermail/lustre-discuss-lustre.org/attachments/20080627/a4c33f34/attachment.pgp>


More information about the lustre-discuss mailing list