[Pacemaker] never ending election
David Riccitelli
david at interact.it
Sun Aug 3 09:18:34 UTC 2008
Hello there,
Can somebody help me with this problem?
I have 2 identical nodes, node #1 and node #2. Nodes are installed
with CentOS 5 and the current version of heartbeat (2.1.3) and
pacemaker (0.6.5).
Each node has 2 network ports bonded together (mode 1). bonding is
configured and working fine.
The nodes have one resource configured. And I must say everything
works fine. All the tests I'm running show perfect failovers, but one
test:
1. node #1 has the resource, node #2 is waiting,
2. I remove both network cables from node #1,
3. node #2 doesn't sense node #1 anymore and believes it is dead,
4. node #2 brings up the resource,
5. then I put back node #1 in the network - I believe the nodes
should see themselves and one of the two will leave the resource,
6. node #1 and node #2 see each other and start counting election
votes, but for an indefinite time and the resource is active on two
nodes at the same time:
logs (same on both nodes - this pattern repeats forever, until
heartbeat is manually stopped on one of the nodes):
Aug 1 12:10:56 rmefp-srv02x crmd: [20793]: info:
do_election_count_vote: Updated voted hash for rmefp-srv02x to vote
Aug 1 12:10:56 rmefp-srv02x crmd: [20793]: info:
do_election_count_vote: Election ignore: our vote (rmefp-srv02x)
Aug 1 12:10:56 rmefp-srv02x crmd: [20793]: info: do_election_check:
Still waiting on 1 non-votes (2 total)
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info:
do_election_count_vote: Election check: vote from rmefp-srv01x
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info:
do_election_count_vote: Election won over rmefp-srv01x
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info: do_election_check:
Still waiting on 2 non-votes (2 total)
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info:
do_election_count_vote: Election check: vote from rmefp-srv01x
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info:
do_election_count_vote: Election won over rmefp-srv01x
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info: do_election_check:
Still waiting on 2 non-votes (2 total)
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info:
do_election_count_vote: Election check: vote from rmefp-srv01x
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info:
do_election_count_vote: Election won over rmefp-srv01x
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info: do_election_check:
Still waiting on 2 non-votes (2 total)
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info:
do_election_count_vote: Election check: vote from rmefp-srv01x
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info:
do_election_count_vote: Election won over rmefp-srv01x
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info: do_election_check:
Still waiting on 2 non-votes (2 total)
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info:
do_election_count_vote: Election check: vote from rmefp-srv01x
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info:
do_election_count_vote: Election won over rmefp-srv01x
Aug 1 12:10:57 rmefp-srv02x crmd: [20793]: info: do_election_check:
Still waiting on 2 non-votes (2 total)
What's wrong?
Best regards,
David Riccitelli
________________________________________________________________________
David Riccitelli
e-mail: david at interact.it
skype: ziodave
phone: +39.0658318336
roma - tel.+39.0658318301 fax.+39.0658318303 P.I. 04856801008
Rispetta l'ambiente e non stampare questa e-mail a meno che non ti sia
realmente utile.
Please consider the environment and don't print this e-mail unless you
really need to.
NOTE SULLA PRIVACY
Le informazioni trasmesse attraverso la presente e-mail ed i suoi
allegati sono diretti esclusivamente al
destinatario e devono ritenersi riservati con divieto di diffusione e
di uso. La diffusione e la comunicazione
da parte di soggetto diverso dal destinatario è vietata dall’art. 616
e ss. c.p. e dal d. l.vo n. 196/03.
Se la presente e-mail ed i suoi allegati fossero stati ricevuti per
errore da persona diversa dal destinatario
siete pregati di distruggere tutto quanto ricevuto e di informare il
mittente con lo stesso mezzo.
________________________________________________________________________
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20080803/28eb421a/attachment-0002.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.gif
Type: image/gif
Size: 398 bytes
Desc: not available
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20080803/28eb421a/attachment-0002.gif>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image002.gif
Type: image/gif
Size: 1386 bytes
Desc: not available
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20080803/28eb421a/attachment-0003.gif>
More information about the Pacemaker
mailing list