[Pacemaker] SNMP/SMTP alerts on move or STONITH?

Simpson, John R john_simpson at reyrey.com
Tue May 25 10:11:36 EDT 2010


Dr. Schwartzkopff,

    Thank you for your reply.  I am using the -S / --snmptraps option
with ClusterMon (crm_mon) and I receive traps for start, stop and 
monitor.  I do not receive traps for move or STONITH, which seem more
critical from a data center ops standpoint.

    Zabbix looks interesting -- we currently use Zenoss.

John

John Simpson 
Senior Software Engineer, I. T. Engineering and Operations


> -----Original Message-----
> From: Michael Schwartzkopff [mailto:misch at multinet.de]
> Sent: Tuesday, May 25, 2010 3:37 AM
> To: The Pacemaker cluster resource manager
> Subject: Re: [Pacemaker] SNMP/SMTP alerts on move or STONITH?
> 
> Am Montag, 24. Mai 2010 16:05:15 schrieb Simpson, John R:
> > Greetings all,
> >
> >      First, my compliments to the Pacemaker and Corosync developers.
> I've
> > been trying out Pacemaker for the past few months, and (especially from
> the
> > command line) I've found building and managing Pacemaker-based clusters
> > more intuitive and flexible than RHCS.
> >
> >      Is there any way to generate SNMP traps and/or email notifications
> > when a resource is moved or a node is STONITH'd?
> 
> Yes. crm_mon -S i think, read man crm_mon.
> 
> >      Using the Pacemaker resource agent ClusterMon to run crm_mon I
> receive
> > the start, stop, and monitor notifications I expect, but there are no
> > specific notifications when a resource is moved or a node is killed.
> I'd
> > like to send up a giant red flag when one of these major events occurs,
> > rather than having to derive it from start/stop/monitor alerts (i.e. all
> > the resources usually hosted on node01 suddenly started and were
> monitored
> > on node02 - node01 must have been stonith'd).  I'm using the
> external/ssh
> > stonith agent for lab tests, if that is a factor.
> 
> ClusterMon uses crm_mon.
> 
> >      I'm using the following ClusterMon configuration and Pacemaker /
> > Corosync / SNMP versions:
> >
> > primitive Monitor-Cluster ocf:pacemaker:ClusterMon \
> >         params htmlfile="/var/www/html/rlb-cluster-monitor.html" \
> >         params pidfile="/var/run/rlb-cluster-monitor.pid" \
> >         params extra_options="--mail-host=outbound.msg.reyrey.net:25
> > --mail-from=john_simpson at reyrey.com --mail-to=john_simpson at reyrey.com
> > --snmp-traps=10.205.1.18" \ op start interval="0" timeout="90s" \
> >         op stop interval="0" timeout="100s"
> 
> There are developements for a openais SNMP agent on the way. See the mail
> of
> sato yuki of May, 5th. You also can try heartbeat's SNMP subagent,
> although it
> is still in some testing for openais.
> 
> I am checking the failcounter from my NMS. So in any case a node dies I
> get at
> least one trap and an error on the failcounters. Pretty straight forward
> on
> the NMS. And by the way: I am using Zabbix.
> 
> Greetings,
> 
> --
> Dr. Michael Schwartzkopff
> MultiNET Services GmbH
> Addresse: Bretonischer Ring 7; 85630 Grasbrunn; Germany
> Tel: +49 - 89 - 45 69 11 0
> Fax: +49 - 89 - 45 69 11 21
> mob: +49 - 174 - 343 28 75
> 
> mail: misch at multinet.de
> web: www.multinet.de
> 
> Sitz der Gesellschaft: 85630 Grasbrunn
> Registergericht: Amtsgericht München HRB 114375
> Geschäftsführer: Günter Jurgeneit, Hubert Martens
> 
> ---
> 
> PGP Fingerprint: F919 3919 FF12 ED5A 2801 DEA6 AA77 57A4 EDD8 979B
> Skype: misch42
> 
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf




More information about the Pacemaker mailing list