[Pacemaker] corosync [TOTEM ] Process pause detected for 577 ms

Jan Friesse jfriesse at redhat.com
Fri Apr 25 03:27:09 EDT 2014


Emanuel,

emmanuel segura napsal(a):
> Hello List,
>
> I have this two lines in my cluster logs, somebody can help to know what
> this means.
>
> ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
>
> corosync [TOTEM ] Process pause detected for 577 ms, flushing membership
> messages.
> corosync [TOTEM ] Process pause detected for 538 ms, flushing membership
> messages.

Corosync internally checks gap between member join messages. If such gap 
is > token/2, it means, that corosync was not scheduled to run by kernel 
for too long, and it should discard membership messages.

Original intend was to detect paused process. If pause is detected, it's 
better to discard old membership messages and initiate new query then 
sending outdated view.

So there are various reasons why this is triggered, but today it's 
usually VM with overloaded host machine.


> corosync [TOTEM ] A processor failed, forming new configuration.
>
> ::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::
>
> I know the "corosync [TOTEM ] A processor failed, forming new
> configuration" message is when the toten package is definitely lost.
>
> Thanks
>

Regards,
   Honza

>
>
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
>





More information about the Pacemaker mailing list