I would recommend the heartbeat with pacemaker setup for the fail-over control. The configuration may seem complex at the beginning but after enough reading (and there is many good sources) it is quite easy to setup. I have recently set up a Lustre system with 3 OSSs and two MDSs (DRBD with LVM between them) working as a single HA cluster and it was easy enough. Pacemaker allows single point of administration of lustre system (starting and stopping the filesystem) and there is a neat GUI for those who want to show something to their managers :)<br>

<br>Best regards,<br><br>Wojciech<br><br><div class="gmail_quote">On 10 August 2010 20:47, Bernd Schubert <span dir="ltr"><<a href="mailto:bs_lists@aakef.fastmail.fm">bs_lists@aakef.fastmail.fm</a>></span> wrote:<br>

<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;"><div class="im"><br>

On Tuesday, August 10, 2010, David Noriega wrote:<br>

> So your script resets the server so there is no fail-over(ie the other<br>

> server takes over resources from that server?) or there is failover<br>

> but you then manually return resources back to the server that was<br>

> reset?<br>

<br>

</div>Our ddn ipmi stonith script (external/ipmi_ddn in heartbeat/pacemaker stonith<br>

terms) only makes absolutely sure the node was really reset. If something<br>

fails, an error code is reported to pacemaker and then pacemaker (*) will not<br>

initiate resource fail-over in order to prevent split-brain.<br>

As Lustre devices use MMP (multiple-mount protection) that is not strictly<br>

required, in principal. But if something goes wrong. e.g. MMP was accidentally<br>

not enabled, a double mount could come up and that would cause serious<br>

filesystem and data corruption...<br>

<br>

<br>

Cheers,<br>

Bernd<br>

<br>

PS: (*) hearbeat-v1 (and v2/v3 if not in xml/crm mode) also *should* accept<br>

stonith error codes, but in general, I have seen it more than once that<br>

hearbeat-v1 run into split-brain and started resources on both cluster nodes.<br>

That is something where pacemaker does a much better job.<br>

<font color="#888888"><br>

--<br>

Bernd Schubert<br>

DataDirect Networks<br>

</font><div><div></div><div class="h5">_______________________________________________<br>

Lustre-discuss mailing list<br>

<a href="mailto:Lustre-discuss@lists.lustre.org">Lustre-discuss@lists.lustre.org</a><br>

<a href="http://lists.lustre.org/mailman/listinfo/lustre-discuss" target="_blank">http://lists.lustre.org/mailman/listinfo/lustre-discuss</a><br>

</div></div></blockquote></div><br><br clear="all"><br>-- <br>Wojciech Turek<br><br>Senior System Architect<br><br>High Performance Computing Service<br>University of Cambridge<br>Email: <a href="mailto:wjt27@cam.ac.uk" target="_blank">wjt27@cam.ac.uk</a><br>

Tel: (+)44 1223 763517 <br>

<div style="visibility: hidden; display: inline;" id="avg_ls_inline_popup"></div><style type="text/css">#avg_ls_inline_popup {  position:absolute;  z-index:9999;  padding: 0px 0px;  margin-left: 0px;  margin-top: 0px;  width: 240px;  overflow: hidden;  word-wrap: break-word;  color: black;  font-size: 10px;  text-align: left;  line-height: 13px;}</style>