[lustre-discuss] how does lustre handle node failure
Laura Hild
lsh at jlab.org
Tue Jul 18 13:25:00 PDT 2023
I'm not familiar with using FLR to tolerate OSS failures. My site does the HA pairs with shared storage method. It's sort of described in the manual
https://doc.lustre.org/lustre_manual.xhtml#configuringfailover
but in more, Pacemaker-specific detail at
https://wiki.lustre.org/Creating_a_Framework_for_High_Availability_with_Pacemaker
and
https://wiki.lustre.org/Creating_Pacemaker_Resources_for_Lustre_Storage_Services
More information about the lustre-discuss
mailing list