[Pacemaker] Slave does not get become Master after unplugging power cable at master

Dominik Klein dk at in-telegence.net
Wed Aug 19 03:49:48 UTC 2009


hj lee wrote:
> Thank very much for the reply.
> 
> I tested it both stonith-enabled and no-quorum-policy. As Dejan pointed,
> this is related to stonith-enabled. With stonith-enabled true (which is
> default),
> if I kill the master node, the slave stays as a slave, it seems expecting
> something from stonith. With stonith-enabled false, the slave was promoted
> to master.

Right. Because the surviving node did not care there might be something
running on the dead node. That's usually not a good idea.

> The no-quorum-policy controls the behavior when the quorum is not enough.
> With no-quorum-policy stop, when one of two nodes dies, the resources in the
> other node get stopped. With no-quorum-poolicy ignore, when one of two nodes
> dies, nothing happens. If master die, the slave stays as a slave. If slave
> dies, the master stays as a master.
> 
> By the way during the test, no-quorum-policy works only in openais stack.
> With heartbeat stack, there is no difference between no-quorum-policy stop
> and ignore. It seems the heartbeat stack always behavior as no-quorum-policy
> ignore.
> 
> Here are my new questions:
> How does stonith detect the node was actually down? 

stonithd just uses the plugin you configured and tells it to kill the
node. If that is not successful, the node's state is consequently still
considered "unclean".

As Dejan already pointed out: Since your stonith plugin is ssh and the
other node is _powered off_, ssh cannot connect and therefore is not
successful. ssh is just a (bad) test plugin for stonith. It does only
work if the node is reachable and is not for production use.

"real" stonith devices need to be usable even if the node is already
powered off (like in your case). Ipmi usually is just a little better,
since - afaik - most ipmi devices won't respond if the node has no
power. The devices are not OS-dependant like ssh, but still
node-hardware-dependant.

However, there are some drawbacks on all the stonith devices I have
personally come across.

Regards
Dominik

> Doesn't it get this info
> from the cluster stack? Does it have its own mechanism like pinging the
> other nodes? Can stonith:external/ipmi detect the node is down? Which type
> of stonith has this ability?




More information about the Pacemaker mailing list