[Pacemaker] Cluster split brain on vmware VSphere

Torresani, Roberto roberto.torresani at unitn.it
Fri Jun 11 09:22:09 EDT 2010


 

> -----Original Message-----
> From: Dejan Muhamedagic [mailto:dejanmm at fastmail.fm] 
> Sent: Wednesday, June 09, 2010 2:23 PM
> To: The Pacemaker cluster resource manager
> Subject: Re: [Pacemaker] Cluster split brain on vmware VSphere
> 
> Hi,
> 
> On Wed, Jun 09, 2010 at 12:11:09PM +0200, Torresani, Roberto wrote:
> > Well... it seem to be SOLVED!!!
> > Thank you Dejan.
> > In the next few days I will load the cluster and then see 
> how it behaves.
> > 
> > I simply raise the token value to 10000 msec, leave all the 
> others to the defaults.
> 
> You should also raise the consensus value to 12000. corosync
> would even refuse to start in this case.


Yes, I simply leave corosync to determine the value as 1.2*token=12000

Thank you again
Best regards
Roberto



> 
> Thanks,
> 
> Dejan
> 
> > 
> > Thank you again.
> > Regards,
> > Roberto
> > 
> >  
> > 
> > > -----Original Message-----
> > > From: Dejan Muhamedagic [mailto:dejanmm at fastmail.fm] 
> > > Sent: Tuesday, June 08, 2010 6:42 PM
> > > To: The Pacemaker cluster resource manager
> > > Subject: Re: [Pacemaker] Cluster split brain on vmware VSphere
> > > 
> > > Hi,
> > > 
> > > On Mon, Jun 07, 2010 at 02:57:57PM +0200, Torresani, 
> Roberto wrote:
> > > > Sorry for have choosen the wrong ml... 
> > > 
> > > That's no problem. There's just better chance of getting help on
> > > the other list.
> > > 
> > > > Here the corosync.conf used by one cluster, the other one is
> > > > just the same provided by the epel repository packages.
> > > > 
> > > > I will try to raise the token value to 10000 as you suggest. Is
> > > > there a theoretical or a best practice to set this value ?
> > > 
> > > No, but 5000 should be OK for most. Ultimately, it depends on
> > > your network. I forgot what was exactly the case here, but it
> > > seems like you had some heavy processing (backup?) which used
> > > most of resources. That may be really hard to predict. You can
> > > use sar or similar to monitor the load.
> > > 
> > > Thanks,
> > > 
> > > Dejan
> > > 
> > > > I will keep you informed as it goes, and open a thread on the
> > > > corosync ml if necessary.
> > > > 
> > > > Thank you.
> > > > 
> > > > 
> > > > # Please read the corosync.conf.5 manual page
> > > > compatibility: whitetank
> > > > 
> > > > totem {
> > > >         version: 2
> > > >         secauth: off
> > > >         threads: 0
> > > >         token:          1000
> > > >         hold: 180
> > > >         token_retransmits_before_loss_const: 20
> > > >         join:           60
> > > >         consensus:      4800
> > > >         vsftype:        none
> > > >         max_messages:   20
> > > >         interface {
> > > >                 ringnumber: 0
> > > >                 bindnetaddr: 192.168.206.0
> > > >                 mcastaddr: 226.94.1.1
> > > >                 mcastport: 5405
> > > >         }
> > > > }
> > > > 
> > > > logging {
> > > >         fileline: off
> > > >         to_stderr: yes
> > > >         to_logfile: yes
> > > >         to_syslog: yes
> > > >         logfile: /tmp/corosync.log
> > > >         debug: off
> > > >         timestamp: on
> > > >         logger_subsys {
> > > >                 subsys: AMF
> > > >                 debug: off
> > > >         }
> > > > }
> > > > 
> > > > amf {
> > > >         mode: disabled
> > > > }
> > > > 
> > > > aisexec {
> > > >     user:  root
> > > >     group: root
> > > > }
> > > > 
> > > > service {
> > > >     name: pacemaker
> > > >     ver: 0
> > > > }
> > > > _______________________________________________
> > > > Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> > > > http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> > > > 
> > > > Project Home: http://www.clusterlabs.org
> > > > Getting started: 
> > > http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> > > > Bugs: 
> > > http://developerbugs.linux-foundation.org/enter_bug.cgi?produc
> > t=Pacemaker
> > > 
> > > _______________________________________________
> > > Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> > > http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> > > 
> > > Project Home: http://www.clusterlabs.org
> > > Getting started: 
> > > http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> > > Bugs: 
> > > http://developerbugs.linux-foundation.org/enter_bug.cgi?produc
> > t=Pacemaker
> > > 
> > _______________________________________________
> > Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> > http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> > 
> > Project Home: http://www.clusterlabs.org
> > Getting started: 
> http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> > Bugs: 
> http://developerbugs.linux-foundation.org/enter_bug.cgi?produc
t=Pacemaker
> 
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started: 
> http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: 
> http://developerbugs.linux-foundation.org/enter_bug.cgi?produc
t=Pacemaker
> 



More information about the Pacemaker mailing list