[Lustre-discuss] How do you monitor your lustre?

Erik Froese erik.froese at gmail.com
Thu Sep 30 06:57:03 PDT 2010


We use:

LMT as well as Ganglia + collectl.
Nagios for system health, hardware health, and cluster health (crm_mon -s).
Splunk for monitoring and reviewing log messages.

Erik

On Thu, Sep 30, 2010 at 7:36 AM, Temple  Jason <jtemple at cscs.ch> wrote:
> We use ganglia with collectl.  These versions are the only ones I could find to work in this way:
>
> Sep 30 13:35 [root at wn125:~]# rpm -qa |grep collectl
> collectl-3.4.2-5
> Sep 30 13:35 [root at wn125:~]# rpm -qa |grep ganglia
> ganglia-gmond-3.1.7-1
>
> We are quite happy with it.
>
> Thanks,
>
> Jason
>
> -----Original Message-----
> From: lustre-discuss-bounces at lists.lustre.org [mailto:lustre-discuss-bounces at lists.lustre.org] On Behalf Of Andreas Davour
> Sent: giovedì, 30. settembre 2010 11:47
> To: lustre-discuss at lists.lustre.org
> Subject: [Lustre-discuss] How do you monitor your lustre?
>
>
> I ask because the lmt project seem to be quite moribund. Anyone else out there
> doing something?
>
> /andreas
> --
> Systems Engineer
> PDC Center for High Performance Computing
> CSC School of Computer Science and Communication
> KTH Royal Institute of Technology
> SE-100 44 Stockholm, Sweden
> Phone: 087906658
> "A satellite, an earring, and a dust bunny are what made America great!"
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-discuss
> _______________________________________________
> Lustre-discuss mailing list
> Lustre-discuss at lists.lustre.org
> http://lists.lustre.org/mailman/listinfo/lustre-discuss
>



More information about the lustre-discuss mailing list