[Pacemaker] why does pacemaker execute fence action immediately when the target node becomes UNCLEAN?

Digimer lists at alteeve.ca
Wed Jan 2 11:49:13 EST 2013


On 01/02/2013 07:17 AM, Lars Marowsky-Bree wrote:
> On 2012-12-20T15:36:45, bin chen <free2coder at gmail.com> wrote:
> 
>> I have defined a fence resource ,and cloned it.But when a node becomes
>> UNCLEAN(I disconneted its network),the fence action will be executed
>> immediately.Is there a method to avoid it(for example,a network tolerance
>> time for network flash time )?For if the network is not stable,
>>  I don`t want cluster nodes be fenced again and again.:)
> 
> Increase the membership timeout of the underlying layer.
> 
> And make the network more stable. (Bonding, etc.)
> 
> 
> 
> Regards,
>     Lars

To build on Lars' comments;

"underlying layer" == corosync, which is tweaked in the corosync.conf
file. As for bonding, only use mode=1. The other modes don't
fail/recover fast enough.

cheers

-- 
Digimer
Papers and Projects: https://alteeve.ca/w/
What if the cure for cancer is trapped in the mind of a person without
access to education?




More information about the Pacemaker mailing list