[lustre-discuss] how does lustre handle node failure

Laura Hild lsh at jlab.org
Tue Jul 18 13:25:00 PDT 2023


I'm not familiar with using FLR to tolerate OSS failures.  My site does the HA pairs with shared storage method.  It's sort of described in the manual

  https://doc.lustre.org/lustre_manual.xhtml#configuringfailover

but in more, Pacemaker-specific detail at

  https://wiki.lustre.org/Creating_a_Framework_for_High_Availability_with_Pacemaker

and

  https://wiki.lustre.org/Creating_Pacemaker_Resources_for_Lustre_Storage_Services



More information about the lustre-discuss mailing list