[Pacemaker] unknown third node added to a 2 node cluster?

Brian J. Murrell (brian) brian at interlinx.bc.ca
Wed Oct 22 12:27:28 UTC 2014


On Mon, 2014-10-13 at 12:51 +1100, Andrew Beekhof wrote:
> 
> Even the same address can be a problem. That brief window where things were getting renewed can screw up corosync.

But as I proved, there was no renewal at all during the period of this
entire pacemaker run, so the use of DHCP here is a red-herring and does
not explain the observed behaviour.

> Never ever use dhcp for a cluster node. Ever. Really, never.

Fair enough.  But since this was not the cause of this problem, it's
still unexplained.  Is it a bug in pacemaker that it doesn't handle this
mysterious third node appearance/disappearance and it fouls up the
cluster?

> Yes. That is what nodeid's are calculated from.
> Different nodeid == different address

So your theory is that corosync on one of the nodes momentarily decided
to change which interface it was binding to and ...

> localhost is the most common one

... binded to localhost?  If so, I guess I should take this to the
corosync list.

b.







More information about the Pacemaker mailing list