[Pacemaker] Reason for cluster resource migration
Andrew Martin
amartin at xes-inc.com
Tue Dec 4 21:29:19 UTC 2012
Hello,
I am running a 3-node Pacemaker cluster (2 "real" nodes and 1 quorum node in standby) on Ubuntu 12.04 server (amd64) with Pacemaker 1.1.8 and Corosync 2.1.0. My cluster configuration is:
http://pastebin.com/6TPkWtbt
Recently, pengine died on storage0 (where the resources were running) which also happened to be the DC at the time. Consequently, Pacemaker went into recovery mode and released its role as DC, at which point storage1 took over the DC role and migrated the resources away from storage0 and onto storage1. Looking through the logs, it seems like storage0 came back into the cluster before the migration of the resources began:
Dec 03 08:31:20 [3165] storage1 crmd: info: peer_update_callback: Client storage0/peer now has status [online] (DC=true)
...
Dec 03 08:31:20 [3164] storage1 pengine: notice: LogActions: Start rscXXX (storage1)
Thus, why did the migration occur, rather than aborting and having the resources simply remain running on storage0? Here are the logs from each of the nodes:
storage0: http://pastebin.com/ZqqnH9uf
storage1: http://pastebin.com/rvSLVcZs
Thanks,
Andrew
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20121204/1400fcf4/attachment-0003.html>
More information about the Pacemaker
mailing list