[Pacemaker] Best way to notify stonith action
Digimer
lists at alteeve.ca
Mon Jul 8 14:01:24 UTC 2013
On 08/07/13 03:48, Andreas Mock wrote:
> Hi all,
>
> I'm just wondering what the best way is to
> let an admin know that the cluster (rest of
> a cluster) has stonithed some other nodes?
>
> What is the recommended way?
> (The fact that the machine rebooted or is
> halted is not the problem. I want to know
> that stonithing was done)
>
> Best regards
> Andreas Mock
Personally, I have a little monitoring script I wrote that watches the
cluster resources, local hardware (via the IPMI BMC), UPSes and
what-not. It loop every 30 seconds and sends an email if/when anything
of note changes. A node being fenced certainly raises a flag and emails
go out.
My script is principally for cman + rgmanager, but it should be easy to
craft your own, too. I just read in the current state of things, compare
against the values in the last scan, decide whether to send an email or
not, copy the just-read values over to the last-scan values and delete
the "new" values and go back to sleep for 30 seconds.
hth
--
Digimer
Papers and Projects: https://alteeve.ca/w/
What if the cure for cancer is trapped in the mind of a person without
access to education?
More information about the Pacemaker
mailing list