[Pacemaker] Problem setting-up DRBD v8,4 with Pacemaker v1.1.6
Andreas Kurz
andreas at hastexo.com
Wed Jul 4 20:52:37 UTC 2012
On 07/04/2012 09:16 PM, Irfan Ali wrote:
> Hi all,
>
> We are trying to set-up an HA pair on RHEL 6.2 using DRBD (v
> 8.4.1-2), Pacemaker (v 1.1.6-3) and Corosync (v 1.4.1-4). We could
> make DRBD work independently syncing the two machines of the pair. But
> our problem begins when we try to connect DRBD with Pacemaker. Even
> though Pacemaker was able to detect the resources corresponding to
> DRBD running on both the machines, it does not promoted any one of
> them as master. I have pasted the output from 'crm status' below, it
> shows that the resource 'ms-drbd' is running as slaves on both the
> machines of our HA pair.
Please, don't paste snippets ... only full configurations are useful ...
more inline
>
> We fiddled a lot with the configuration for both Pacemaker and DRBD,
> but couldn't find anything to fix this problem. Any help / suggestions
> from you guys will be highly appreciated.
>
> ===========================================================================================================
>
> [root at c713 linbit]# crm status
>
> ============
>
> Last updated: Wed Jul 4 08:42:37 2012
>
> Last change: Wed Jul 4 07:34:53 2012 via crm_resource on c710.siemlab.com
>
> Stack: openais
>
> Current DC: c710.siemlab.com - partition with quorum
>
> Version: 1.1.6-3.el6-a02c0f19a00c1eb2527ad38f146ebc0834814558
>
> 2 Nodes configured, 2 expected votes
>
> 14 Resources configured.
>
> ============
>
>
>
> Online: [ c713.siemlab.com c710.siemlab.com ]
>
>
>
> Clone Set: connectivity [ping]
>
> Started: [ c710.siemlab.com c713.siemlab.com ]
What is the drbd state in /proc/drbd? everything ok?
>
> Clone Set: powerstatus [cpps0]
>
> Started: [ c710.siemlab.com c713.siemlab.com ]
>
> Master/Slave Set: ms-drbd [drbd0]
>
> Slaves: [ c710.siemlab.com c713.siemlab.com ]
>
> c713.siemlab.com-stonith (stonith:fence_ipmilan): Started
> c710.siemlab.com
>
> c710.siemlab.com-stonith (stonith:fence_ipmilan): Started
> c713.siemlab.com
>
>
>
> ===========================================================================================================
>
> We are using the following CIB configuration related to DRBD:
>
> <master id="ms-drbd">
>
> <meta_attributes id="ma-ms-drbd">
>
> <nvpair id="ma-ms-drbd-1" name="clone-max" value="2"/>
>
> <nvpair id="ma-ms-drbd-2" name="clone-node-max" value="1"/>
>
> <nvpair id="ma-ms-drbd-3" name="notify" value="yes"/>
>
> <nvpair id="ma-ms-drbd-4" name="globally-unique" value="false"/>
>
> <nvpair id="ma-ms-drbd-5" name="master-max" value="1"/>
>
> <nvpair id="ma-ms-drbd-6" name="master-node-max" value="1"/>
>
> <nvpair id="ma-ms-drbd-7" name="target-role" value="Master"/>
>
> <nvpair id="ma-ms-drbd-8" name="ordered" value="true"/>
remove that "ordered" attribute
>
> <nvpair id="ma-ms-drbd-crm-feature-set"
> name="crm-feature-set" value="3.0.5"/>
>
> <nvpair id="ma-ms-drbd-is-managed" name="is-managed" value="true"/>
>
> </meta_attributes>
>
> <primitive class="ocf" id="drbd0" provider="linbit" type="drbd">
>
> <instance_attributes id="ia-drbd">
>
> <nvpair id="ia-drbd-1" name="drbd_resource" value="var_nsm"/>
>
> </instance_attributes>
>
> <operations>
>
> <op id="op-drbd-1" interval="2min" name="monitor"
> role="Master" timeout="5min"/>
>
> <op id="op-drbd-2" interval="0" name="start" role="Master"
> timeout="4min"/>
>
> <op id="op-drbd-3" interval="0" name="stop" role="Master"
> timeout="4min"/>
you don't need roles for star/stop operations
>
> <op id="op-drbd-4" interval="3min" name="monitor"
> role="Slave" timeout="6min"/>
>
> <op id="op-drbd-5" interval="0" name="start" role="Slave"
> timeout="4min"/>
>
> <op id="op-drbd-6" interval="0" name="stop" role="Slave"
> timeout="4min"/>
>
> </operations>
>
> </primitive>
>
> </master>
>
> ===========================================================================================================
>
> The following is the content of drbd.conf:
>
> global {
>
> usage-count no;
>
> }
>
> common {
>
> net {
>
> protocol C;
>
> }
>
> startup {
>
> wfc-timeout 120;
>
> degr-wfc-timeout 120;
>
> }
>
> }
>
> resource var_nsm {
>
> disk {
>
> on-io-error detach;
>
> no-disk-barrier;
>
> no-disk-flushes;
>
> fencing resource-only;
>
> }
>
> handlers {
>
> fence-peer
> "/usr/lib/drbd/crm-fence-peer.sh";
>
> after-resync-target
> "/usr/lib/drbd/crm-unfence-peer.sh";
>
> }
>
> net {
>
> #rate 333M;
>
> after-sb-1pri discard-secondary;
... you know that can discard valid data?
Regards,
Andreas
--
Need help with Pacemaker?
http://www.hastexo.com/now
>
> max-buffers 8000;
>
> max-epoch-size 8000;
>
> sndbuf-size 0;
>
> }
>
> on c713.siemlab.com {
>
> device /dev/drbd1;
>
> disk /dev/sdb3;
>
> address 192.168.2.2:7791;
>
> meta-disk internal;
>
> }
>
> on c710.siemlab.com {
>
> device /dev/drbd1;
>
> disk /dev/sdb3;
>
> address 192.168.2.4:7791;
>
> meta-disk internal;
>
> }
>
> }
>
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 222 bytes
Desc: OpenPGP digital signature
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20120704/d0ba26b0/attachment-0004.sig>
More information about the Pacemaker
mailing list