[Pacemaker] Best way to check if PM is alive
Evgeniy Ivanov
lolkaantimat at gmail.com
Thu Dec 9 11:14:27 UTC 2010
Hi,
What is a best way to check if PM is still alive?
We tried following approach: there is a softdog timer (max value is
300s + extra 60s to give PM another chance) initially started and
checked by third party. Clone named HA_alive fails in monitor (except
first time), monitor interval is 200s. HA_alive:start should reset
that softdog timer. It looks like sometimes PM doesn't restart failed
resource for that 360s with no reason: system is almost IDLE.
Another approach we used was based on "crmadmin -S this_node" && start
timer if any problems && try to compare "crm resource status" at
different time to see that something happens on system (PM works and
bad result of crmadmin -S caused by high load of PM). It doesn't work
fine either.
--
Evgeniy Ivanov
More information about the Pacemaker
mailing list