[Lustre-discuss] OSS not healty

Frank Mietke frank.mietke at informatik.tu-chemnitz.de
Thu Mar 13 06:55:25 PDT 2008


Brian,

On Thu, Mar 13, 2008 at 01:44:45PM +0100, Brian J. Murrell wrote:
> On Thu, 2008-03-13 at 12:34 +0100, Frank Mietke wrote:
> 
> > okay I've found the following in /var/log/messages before the bulk of above
> > messages come. It seems that something with the RAID went wrong.
> 
> I don't see anything RAID specific however...

you're right, my mistake. 


> 
> > Mar 13 06:17:31 chic2e24 kernel: [3068633.701448] attempt to access beyond end of device
> > Mar 13 06:17:31 chic2e24 kernel: [3068633.701454] sda: rw=1, want=11287722456, limit=7796867072
> 
> This is pretty self-explanatory.  Something tried to read beyond the end
> of the disk.  Something has a misunderstanding of how big the disk is.

That's it why I'm asking. 

> Is it possible that the disk format process was misled about the disk
> size during initialization?
> 
> Andreas, does mkfs do any bounds checking to verify the sanity of the
> mkfs request?  I.e. does it make sure that if/when you specify a number
> of blocks for a filesystem that that many block are available?
> 
> Frank, is it at all possible that the size of the device had somehow
> gotten smaller since you first initialized it?

I think, no, because all the other OSTs show the same size. Is there a way to
request the assumptions of disk size from the MGS/MDS?

Frank




-- 
Dipl.-Inf. Frank Mietke     |     Fakultätsrechen- und Informationszentrum
Tel.: 0371 - 531 - 35538    |     Fak. für Informatik
Fax:  0371 - 531 8 35538    |     TU-Chemnitz
Key-ID: 60F59599            |     frank.mietke at informatik.tu-chemnitz.de



More information about the lustre-discuss mailing list