[Pacemaker] migration-threshold and failure-timeout

Tue Sep 21 09:28:09 EDT 2010

On Tue, Sep 21, 2010 at 9:14 AM, Dan Frincu <dfrincu at streamwide.ro> wrote:
> Hi,
>
> This =>
> http://www.clusterlabs.org/doc/en-US/Pacemaker/1.0/html/Pacemaker_Explained/s-failure-migration.html
> explains it pretty well. Notice the INFINITY score and what sets it.
>
> However I don't know of any automatic method to clear the failcount.
>
> Regards,
> Dan

in pacemaker 1.0 nothing will clean failcount automatically, this is a
feature of pacemaker 1.1, imho

But,

crm configure rsc_defaults failure-timeout="10min"

will make cluster to "forget" about previous failure in 10 minutes.
if you want to futher decrease this paramater, you might need to decrease

crm configure property cluster-recheck-interval="10min"

Cheers,
Vadym

>
> Pavlos Parissis wrote:
>
> Hi,
>
> I am trying to figure a way to do the following
> if the monitor of x resource fails N time in a period of Z then fail over to
> the other node and clear fail-count.
>
> Regards,
> Pavlos
>
> ________________________________
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs:
> http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
>
>
> --
> Dan FRINCU
> Systems Engineer
> CCNA, RHCE
> Streamwide Romania
>
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs:
> http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
>
>