[Lustre-discuss] Oops: Lustre mount of MDS causes kernel panic in mds_free_client

Charles A. Taylor taylor at hpc.ufl.edu
Fri Sep 4 06:10:12 PDT 2009


You may want to try "The Dilger Procedure".   See

 http://wiki.hpc.ufl.edu/index.php/Lustre

This has saved us a number of times.

Charlie Taylor
UF HPC center

On Fri, 2009-09-04 at 08:45 -0400, Dave Johnson wrote:
> Our lustre filesystem is unable to run because the MDS host
> crashes immediately while mounting the metadata file system.
> It is accessing an invalid address (deadbeef) in the routine
> mds_free_client.  The Lustre version is 1.6.0.1.  Copying the
> crash log from the console by hand (lost the password to the
> management processors so we can't do serial console anymore):
> 
> mount.lustre  Cannot handle kernel paging request mds_client_free+612
> Trace:
> mds_destroy_export
> obdclass:class_export_destroy
> obdclass:obd_zombie_impexp_call
> obdclass:class_detach
> obdclass:class_process_config
> obdclass:class_manual_cleanup
> obdclass:lustre_fill_super
> 
> I found messages in the mailing list about removing CATALOGS and OBJECTS/*
> and mounting using -o abort_recov.  I tried these things, in addition to
> removing PENDING/* (all empty files).  This last crash trace was done
> (accidentally) without the -o abort_recov mount option, but the outcome
> did not improve on the earlier attempts.  
> 
> Any help in this would be greatly appreciated.
> 
> Thanks,
> 
>  -- ddj
> 
> Dave Johnson
> Brown University CCV
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-discuss




More information about the lustre-discuss mailing list