[Lustre-discuss] csum errors

Stuart Midgley sdm900 at gmail.com
Wed Aug 27 20:57:13 PDT 2008


We recently upgraded from 1.4.10.1 to 1.6.5.1 (clients and servers)  
and now we are seeing errors like


Aug 27 07:49:54 oss025 kernel: LustreError: 3738:0:(ost_handler.c: 
1163:ost_brw_write()) client csum 2dbc1696, server csum 9d081697
Aug 27 07:49:54 oss025 kernel: LustreError: 168-f: p1-OST0018: BAD  
WRITE CHECKSUM: changed in transit before arrival at OST from  
12345-172.16.4.93 at tcp inum 24522277/426969871 object 12021/0 extent  
[10485760-11534335]
Aug 27 07:49:55 oss025 kernel: LustreError: 3738:0:(ost_handler.c: 
1225:ost_brw_write()) client csum 2dbc1696, original server csum  
9d081697, server csum now 9d081697


always from the same cluster node...  Should we be worried?  I suspect  
this means we shouldn't turn check summing off?  I assume these are  
rejected and resent from the client?


-- 
Dr Stuart Midgley
sdm900 at gmail.com






More information about the lustre-discuss mailing list