[lustre-discuss] Rebuild server

Peter Bortas bortas at gmail.com
Wed Mar 16 07:06:05 PDT 2016


Hi Jon,

Just for extra reinsurance from the real world: NSC reinstalls the
same system image on all MDSs and OSSs every time we reboot our
servers. So there is no special magic in the OS that needs to be
preserved with the exception of the /etc/ldev.conf file. In our case
we have fixed that difference between servers by having an init script
that knows how to recreate it.

Regards,
-- 
Peter Bortas
National Supercomputer Centre
Sweden


On Fri, Mar 11, 2016 at 12:41 PM, Jon Tegner <tegner at foi.se> wrote:
> Thanks! Much appreciated!
>
> Was quite stressed when I noticed the server was down (data is backed up,
> but still). Our servers are managed/provisioned by kickstart and saltstack -
> so it should be easy to bring up new ones with the same configuration.
>
> Thanks again,
>
> /jon
>
> On 03/11/2016 07:05 AM, Cowe, Malcolm J wrote:
>>
>> So, in summary: rebuild the root disks (maybe use a provisioning system
>> like kickstart for repeatability), restore the network config, restore LNet
>> config, maybe restore the HA software, restore the identity management (e.g.
>> LDAP, passwd, group) then mount the storage as before.
>
>
> _______________________________________________
> lustre-discuss mailing list
> lustre-discuss at lists.lustre.org
> http://lists.lustre.org/listinfo.cgi/lustre-discuss-lustre.org


More information about the lustre-discuss mailing list