[Lustre-discuss] Problem after upgrade to 1.6.5

Enrico Morelli morelli at cerm.unifi.it
Tue Jul 8 05:41:48 PDT 2008


On Mon, 7 Jul 2008 12:27:10 +0200
Enrico Morelli <morelli at cerm.unifi.it> wrote:

> Dear all,
> 
> I've a problem after the upgrade from 1.6.4.1 to 1.6.5.
> 
> I've four OSTs to create a /fastfs lustre filesystem. In each OST I
> have the following fstab:
> 
> /dev/vg/fastfs_ost   /fastfs_ost   lustre defaults,_netdev        0 0
> lustre-server at tcp0:/fastfs   /fastfs   lustre _netdev,defaults    0 0
> 
> On the lustre-server I have:
> /dev/data_se/fastfs_mdt    /fastfs_mdt  lustre  defaults,_netdev  0 0
> 
> On one OST (192.168.100.101) I have the following error:
> Lustre: Client fastfs-client has started
> Lustre: Request x686 sent from fastfs-OST0000-osc-c5b6cc00 to NID 0 at lo
> 5s ago has timed out (limit 5s). 
> Lustre: Skipped 62 previous similar messages
> 
> Infact on the other I obtain:
> Lustre: Client fastfs-client has started
> Lustre: Request x463 sent from fastfs-OST0000-osc-f7d1f800 to NID
> 192.168.100.101 at tcp 5s ago has timed out (limit 5s). 
> Lustre: Skipped 32 previous similar messages
> 
> 
> But on the lustre-server I have two OST that seems to be dead and one
> in timeout:
> Lustre: Client fastfs-client has started
> Lustre: fastfs-MDT0000: haven't heard from client
> d7fd9368-3f2b-7625-9c48-3de83b5c4cd3 (at 192.168.100.103 at tcp) in 231
> seconds. I think it's dead, and I am evicting it. 
> Lustre: fastfs-MDT0000: haven't heard from client
> 42c0e2c4-0844-8b8b-69b2-9c16ff0ba043 (at 192.168.100.100 at tcp) in 229
> seconds. I think it's dead, and I am evicting it.
> Lustre: Request x2950836 sent from fastfs-OST0000-osc to NID
> 192.168.100.101 at tcp 50s ago has timed out (limit 50s). 
> Lustre: Skipped 65 previous similar messages
> 
> On all machine I've installed the following rpms:
> lustre-ldiskfs-3.0.4-2.6.9_67.0.7.EL_lustre.1.6.5smp
> kernel-lustre-smp-2.6.9-67.0.7.EL_lustre.1.6.5
> lustre-1.6.5-2.6.9_67.0.7.EL_lustre.1.6.5smp
> lustre-modules-1.6.5-2.6.9_67.0.7.EL_lustre.1.6.5smp
> 
> On each node I have the following active modules:
> 
> lustre                644716  2 
> lov                   414696  3 lustre
> mdc                   144900  3 lustre
> lquota                212116  3 
> osc                   224680  6 lustre
> ksocklnd              138984  1 
> ptlrpc                970676  6 mgc,lustre,lov,mdc,lquota,osc
> obdclass              677464  9 mgc,lustre,lov,mdc,lquota,osc,ptlrpc
> lnet                  267292  4 lustre,ksocklnd,ptlrpc,obdclass
> lvfs                   90360  8
> mgc,lustre,lov,mdc,lquota,osc,ptlrpc,obdclass libcfs
> 132044  11
> mgc,lustre,lov,mdc,lquota,osc,ksocklnd,ptlrpc,obdclass,lnet,lvfs
> 
> With 1.6.4.1 all works fine, where I can check to solve the problem?
> 
> Thanks


The problems was solved itself. Tomorrow I found the /fastfs lustre
filesystem mounted everywhere without problems.

-- 
-------------------------------------------------------------------
       (o_
(o_    //\  Coltivate Linux che tanto Windows si pianta da solo.
(/)_   V_/_
+------------------------------------------------------------------+
|     ENRICO MORELLI         |  email: morelli at CERM.UNIFI.IT       |
| *     *       *       *    |  phone: +39 055 4574269             |
|  University of Florence    |  fax  : +39 055 4574253             |
|  CERM - via Sacconi, 6 -  50019 Sesto Fiorentino (FI) - ITALY    |
+------------------------------------------------------------------+



More information about the lustre-discuss mailing list