[Lustre-discuss] OSS load in the roof

Brock Palen brockp at umich.edu
Fri Jun 27 13:17:00 PDT 2008


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

On Jun 27, 2008, at 1:07 PM, Brian J. Murrell wrote:
> On Fri, 2008-06-27 at 12:44 -0400, Brock Palen wrote:
>>
>> All of them are stuck in un-interruptible sleep.
>> Has anyone seen this happen before?  Is this caused by a pending disk
>> failure?
>
> Well, they are certainly stuck because of some blocking I/O.  That  
> could
> be disk failure, indeed.
>
>> mptscsi: ioc1: attempting task abort! (sc=0000010038904c40)
>> scsi1 : destination target 0, lun 0
>>          command = Read (10) 00 75 94 40 00 00 10 00 00
>> mptscsi: ioc1: task abort: SUCCESS (sc=0000010038904c40)
>
> That does not look like a picture of happiness, indeed, no.  You have
> SCSI commands aborting.

While the array was reporting no problems one of the disk was really  
lagging the others. We have swapped it out.  Thanks for the feedback  
everyone.

>
>> Lustre: 6698:0:(lustre_fsfilt.h:306:fsfilt_setattr()) nobackup-
>> OST0001: slow setattr 100s
>> Lustre: 6698:0:(watchdog.c:312:lcw_update_time()) Expired watchdog
>> for pid 6698 disabled after 103.1261s
>
> Those are just fallout from the above disk situation.
>
> b.
>
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-discuss

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (Darwin)

iD8DBQFIZUq/MFCQB4Bvz5QRAvacAJ9jkhi+2KgfbJ7bUI/KfHJ0Hnq1wQCeNgHO
d6+tzscwCqwYtuHXmzT2kFI=
=5p1N
-----END PGP SIGNATURE-----



More information about the lustre-discuss mailing list