[Pacemaker] testing a exmaple, but it does not quite work

Andrew Beekhof andrew at beekhof.net
Thu Mar 10 02:25:07 EST 2011


Looks like you don't have the openais parts installed/running.
You need corosync _and_ openais if you want cluster filesystems.

On Wed, Mar 9, 2011 at 8:42 PM, Lasse Lindgren <st0kk at hotmail.com> wrote:
> Hello
> I Have been trying now for a couple of day's to get
> https://wiki.edubuntu.org/ClusterStack/LucidTesting
> this working on a opensuse 11.3
> I end up with a bunch off errors
>  expected-quorum-votes="2" should be "1", but i cant change it...
> node Ctsfs1
> node Ctsfs2
> primitive resDLM ocf:pacemaker:controld \
>         op monitor interval="120s"
> primitive resDRBD ocf:linbit:drbd \
>         params drbd_resource="export" \
>         operations $id="resDRBD-operations" \
>         op monitor interval="20" role="Master" timeout="20" \
>         op monitor interval="30" role="Slave" timeout="20"
> primitive resFS ocf:heartbeat:Filesystem \
>         params device="/dev/drbd/by-res/export" directory="/export"
> fstype="ocfs2" \
>         op monitor interval="120s"
> primitive resO2CB ocf:pacemaker:o2cb \
>         op monitor interval="120s"
> ms msDRBD resDRBD \
>         meta resource-stickines="100" notify="true" master-max="2"
> interleave="true"
> clone cloneDLM resDLM \
>         meta globally-unique="false" interleave="true"
> clone cloneFS resFS \
>         meta interleave="true" ordered="true" target-role="Started"
> clone cloneO2CB resO2CB \
>         meta globally-unique="false" interleave="true" target-role="Started"
> colocation colDLMDRBD inf: cloneDLM msDRBD:Master
> colocation colFSO2CB inf: cloneFS cloneO2CB
> colocation colO2CBDLM inf: cloneO2CB cloneDLM
> order ordDLMO2CB 0: cloneDLM cloneO2CB
> order ordDRBDDLM 0: msDRBD:promote cloneDLM
> order ordO2CBFS 0: cloneO2CB cloneFS
> property $id="cib-bootstrap-options" \
>         no-quorum-policy="ignore" \
>         stonith-enabled="false" \
>         expected-quorum-votes="2" \
>         dc-version="1.0.9-89bd754939df5150de7cd76835f98fe90851b677" \
>         cluster-infrastructure="openais" \
>         last-lrm-refresh="1299673052"
> Ctsfs2:~ # cat /etc/udev/rules.d/51-ocfs2.rules
> KERNEL=="ocfs2_control", NAME="misc/ocfs2_control", MODE="0660"
> Ctsfs2:~ # cat /etc/udev/rules.d/51-dlm.rules
> KERNEL=="dlm-control", NAME="misc/dlm-control", MODE="0666"
> KERNEL=="dlm-monitor", NAME="misc/dlm-monitor", MODE="0666"
> KERNEL=="dlm_default", NAME="misc/dlm_default", MODE="0666"
> KERNEL=="dlm_*", NAME="misc/%k", MODE="0660"
> ============
> Last updated: Wed Mar  9 20:34:48 2011
> Stack: openais
> Current DC: Ctsfs2 - partition with quorum
> Version: 1.0.9-89bd754939df5150de7cd76835f98fe90851b677
> 2 Nodes configured, 2 expected votes
> 4 Resources configured.
> ============
> Online: [ Ctsfs1 Ctsfs2 ]
>  Master/Slave Set: msDRBD
>      Masters: [ Ctsfs2 Ctsfs1 ]
>  Clone Set: cloneDLM
>      Started: [ Ctsfs2 Ctsfs1 ]
> Failed actions:
>     resO2CB:1_start_0 (node=Ctsfs2, call=21, rc=1, status=complete): unknown
> error
>     resO2CB:0_start_0 (node=Ctsfs1, call=14, rc=1, status=complete): unknown
> error
> crm(live)resource# status
>  Master/Slave Set: msDRBD
>      Masters: [ Ctsfs2 Ctsfs1 ]
>  Clone Set: cloneDLM
>      Started: [ Ctsfs2 Ctsfs1 ]
>  Clone Set: cloneFS
>      Stopped: [ resFS:0 resFS:1 ]
>  Clone Set: cloneO2CB
>      Stopped: [ resO2CB:0 resO2CB:1 ]
> Ctsfs2:~ # echo "" > /var/log/messages
> Ctsfs2:~ # service corosync restart
> Signaling Corosync Cluster Engine (corosync) to terminate: [  OK  ]
> Waiting for corosync services to unload:...[  OK  ]
> Starting Corosync Cluster Engine (corosync): [  OK  ]
> Ctsfs2:~ # tail -f /var/log/messages
> Mar  9 20:36:31 Ctsfs2 ocfs2_controld[8478]: Added Pacemaker as client 1
> with fd 8
> Mar  9 20:36:31 Ctsfs2 ocfs2_controld[8478]: Initializing CKPT service (try
> 1)
> Mar  9 20:36:31 Ctsfs2 ocfs2_controld[8478]: Unable to connect to CKPT:
> Object does not exist
> Mar  9 20:36:31 Ctsfs2 ocfs2_controld[8478]: Unable to connect to CKPT:
> Object does not exist
> Mar  9 20:36:31 Ctsfs2 corosync[8058]:  [pcmk  ] info: pcmk_notify: Enabling
> node notifications for child 8478 (0x69a580)
> Mar  9 20:36:31 Ctsfs2 ocfs2_controld: [8478]: info:
> init_ais_connection_once: Creating connection to our AIS plugin
> Mar  9 20:36:31 Ctsfs2 ocfs2_controld: [8478]: info:
> init_ais_connection_once: AIS connection established
> Mar  9 20:36:31 Ctsfs2 ocfs2_controld: [8478]: info: get_ais_nodeid: Server
> details: id=33663168 uname=Ctsfs2 cname=pcmk
> Mar  9 20:36:31 Ctsfs2 ocfs2_controld: [8478]: info: crm_new_peer: Node
> Ctsfs2 now has id: 33663168
> Mar  9 20:36:31 Ctsfs2 ocfs2_controld: [8478]: info: crm_new_peer: Node
> 33663168 is now known as Ctsfs2
> Mar  9 20:36:42 Ctsfs2 o2cb[8416]: [8515]: ERROR: ocfs2_controld.pcmk did
> not come up
> Mar  9 20:36:42 Ctsfs2 crmd: [8070]: info: process_lrm_event: LRM operation
> resO2CB:0_start_0 (call=15, rc=1, cib-update=22, confirmed=true) unknown
> error
> Mar  9 20:36:42 Ctsfs2 attrd: [8068]: info: attrd_ais_dispatch: Update
> relayed from Ctsfs1
> Mar  9 20:36:42 Ctsfs2 attrd: [8068]: info: attrd_trigger_update: Sending
> flush op to all hosts for: fail-count-resO2CB:0 (INFINITY)
> Mar  9 20:36:42 Ctsfs2 attrd: [8068]: info: attrd_perform_update: Sent
> update 27: fail-count-resO2CB:0=INFINITY
> Mar  9 20:36:42 Ctsfs2 attrd: [8068]: info: attrd_ais_dispatch: Update
> relayed from Ctsfs1
> Mar  9 20:36:42 Ctsfs2 attrd: [8068]: info: attrd_trigger_update: Sending
> flush op to all hosts for: last-failure-resO2CB:0 (1299699433)
> Mar  9 20:36:42 Ctsfs2 attrd: [8068]: info: attrd_perform_update: Sent
> update 30: last-failure-resO2CB:0=1299699433
> Mar  9 20:36:42 Ctsfs2 crmd: [8070]: info: do_lrm_rsc_op: Performing
> key=77:5:0:42a6919a-89d9-4d38-9a5a-eaf7afd824df op=resDRBD:0_notify_0 )
> Mar  9 20:36:42 Ctsfs2 lrmd: [8067]: info: rsc:resDRBD:0:16: notify
> Mar  9 20:36:42 Ctsfs2 crmd: [8070]: info: do_lrm_rsc_op: Performing
> key=2:5:0:42a6919a-89d9-4d38-9a5a-eaf7afd824df op=resO2CB:0_stop_0 )
> Mar  9 20:36:42 Ctsfs2 lrmd: [8067]: info: rsc:resO2CB:0:17: stop
> Mar  9 20:36:42 Ctsfs2 crmd: [8070]: info: process_lrm_event: LRM operation
> resDRBD:0_notify_0 (call=16, rc=0, cib-update=23, confirmed=true) ok
> Mar  9 20:36:42 Ctsfs2 crmd: [8070]: info: process_lrm_event: LRM operation
> resO2CB:0_stop_0 (call=17, rc=0, cib-update=24, confirmed=true) ok
> Mar  9 20:36:42 Ctsfs2 cluster-dlm[8320]: log_config: dlm:controld conf 2 1
> 0 memb 16885952 33663168 join 16885952 left#012
> Mar  9 20:36:42 Ctsfs2 kernel: [ 1445.277467] block drbd0: peer( Secondary
> -> Primary )
> Mar  9 20:36:42 Ctsfs2 lrmd: [8067]: info: rsc:resDRBD:0:18: notify
> Mar  9 20:36:42 Ctsfs2 crmd: [8070]: info: do_lrm_rsc_op: Performing
> key=78:5:0:42a6919a-89d9-4d38-9a5a-eaf7afd824df op=resDRBD:0_notify_0 )
> Mar  9 20:36:42 Ctsfs2 lrmd: [8067]: info: RA output:
> (resDRBD:0:notify:stdout)
> Mar  9 20:36:42 Ctsfs2 crmd: [8070]: info: process_lrm_event: LRM operation
> resDRBD:0_notify_0 (call=18, rc=0, cib-update=25, confirmed=t
>
> /Greetings
> Lasse
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs:
> http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
>
>




More information about the Pacemaker mailing list