[Pacemaker] pacemaker 1.1.10 + RHEL 7 beta issues

Digimer lists at alteeve.ca
Wed Jan 1 23:38:47 UTC 2014


I didn't show here, but 'pidof httpd' confirmed that apache was not 
runnong on either node until the errant constraint was cleared.

On 01/01/14 06:30 PM, Digimer wrote:
> I have a problem where putting a node into standby causes my "WebSite"
> resource to stop and refuse to start. This is the reproducer I show
> here, but I have seen this happen with DRBD earlier (before I realized
> what was wrong, so I didn't document it), so I don't think this is an
> apache RA issue.
>
> [root at an-c03n01 ~]# rpm -q pacemaker httpd pcs corosync
> pacemaker-1.1.10-19.el7.x86_64
> httpd-2.4.6-7.el7.x86_64
> pcs-0.9.99-2.el7.x86_64
> corosync-2.3.2-2.el7.x86_64
>
> [root at an-c03n01 ~]# cat /etc/redhat-release
> Red Hat Enterprise Linux Everything release 7.0 Beta (Maipo)
>
> [root at an-c03n01 ~]# uname -a
> Linux an-c03n01.alteeve.ca 3.10.0-54.0.1.el7.x86_64 #1 SMP Tue Nov 26
> 16:51:22 EST 2013 x86_64 x86_64 x86_64 GNU/Linux
>
> DRBD installed from source:
>
> [root at an-c03n01 ~]# cat /proc/drbd
> version: 8.4.4 (api:1/proto:86-101)
> GIT-hash: 74402fecf24da8e5438171ee8c19e28627e1c98a build by
> root at an-c03n01.alteeve.ca, 2013-12-23 20:36:27
>
>
> Starting with everything working:
>
> =====
> [root at an-c03n01 ~]# pcs status
> Cluster name: an-cluster-03
> Last updated: Wed Jan  1 18:13:19 2014
> Last change: Wed Jan  1 18:03:08 2014 via cibadmin on an-c03n01.alteeve.ca
> Stack: corosync
> Current DC: an-c03n01.alteeve.ca (1) - partition with quorum
> Version: 1.1.10-19.el7-368c726
> 2 Nodes configured
> 7 Resources configured
>
>
> Online: [ an-c03n01.alteeve.ca an-c03n02.alteeve.ca ]
>
> Full list of resources:
>
>   fence_n01_virsh    (stonith:fence_virsh):    Started an-c03n02.alteeve.ca
>   fence_n02_virsh    (stonith:fence_virsh):    Started an-c03n02.alteeve.ca
>   ClusterIP    (ocf::heartbeat:IPaddr2):    Started an-c03n01.alteeve.ca
>   Master/Slave Set: WebDataClone [WebData]
>       Masters: [ an-c03n01.alteeve.ca ]
>       Slaves: [ an-c03n02.alteeve.ca ]
>   WebFS    (ocf::heartbeat:Filesystem):    Started an-c03n01.alteeve.ca
>   WebSite    (ocf::heartbeat:apache):    Started an-c03n01.alteeve.ca
>
> PCSD Status:
> an-c03n01.alteeve.ca:
>    an-c03n01.alteeve.ca: Online
> an-c03n02.alteeve.ca:
>    an-c03n02.alteeve.ca: Online
>
> Daemon Status:
>    corosync: active/disabled
>    pacemaker: active/disabled
>    pcsd: active/enabled
> =====
>
> =====
> [root at an-c03n01 ~]# pcs config show
> Cluster Name: an-cluster-03
> Corosync Nodes:
>   an-c03n01.alteeve.ca an-c03n02.alteeve.ca
> Pacemaker Nodes:
>   an-c03n01.alteeve.ca an-c03n02.alteeve.ca
>
> Resources:
>   Resource: ClusterIP (class=ocf provider=heartbeat type=IPaddr2)
>    Attributes: ip=192.168.122.10 cidr_netmask=32
>    Operations: monitor interval=30s (ClusterIP-monitor-interval-30s)
>   Master: WebDataClone
>    Meta Attrs: master-max=1 master-node-max=1 clone-max=2
> clone-node-max=1 notify=true
>    Resource: WebData (class=ocf provider=linbit type=drbd)
>     Attributes: drbd_resource=r0
>     Operations: monitor interval=60s (WebData-monitor-60s)
>   Resource: WebFS (class=ocf provider=heartbeat type=Filesystem)
>    Attributes: device=/dev/drbd0 directory=/var/www/html fstype=ext4
>    Operations: monitor interval=60s (WebFS-monitor-interval-60s)
>   Resource: WebSite (class=ocf provider=heartbeat type=apache)
>    Attributes: configfile=/etc/httpd/conf/httpd.conf
> statusurl=http://127.0.0.1/server-status
>    Operations: monitor interval=60s (WebSite-monitor-interval-60s)
>
> Stonith Devices:
>   Resource: fence_n01_virsh (class=stonith type=fence_virsh)
>    Attributes: pcmk_host_list=an-c03n01.alteeve.ca ipaddr=lemass
> login=root passwd_script=/root/lemass.pw delay=15 port=an-c03n01
>    Operations: monitor interval=60s (fence_n01_virsh-monitor-interval-60s)
>   Resource: fence_n02_virsh (class=stonith type=fence_virsh)
>    Attributes: pcmk_host_list=an-c03n02.alteeve.ca ipaddr=lemass
> login=root passwd_script=/root/lemass.pw port=an-c03n02
>    Operations: monitor interval=60s (fence_n02_virsh-monitor-interval-60s)
> Fencing Levels:
>
> Location Constraints:
>    Resource: ClusterIP
>      Enabled on: an-c03n01.alteeve.ca (score:INFINITY) (role: Started)
> (id:cli-prefer-ClusterIP)
>    Resource: WebDataClone
>      Enabled on: an-c03n01.alteeve.ca (score:INFINITY) (role: Started)
> (id:cli-prefer-WebDataClone)
>    Resource: WebSite
>      Enabled on: an-c03n01.alteeve.ca (score:INFINITY) (role: Started)
> (id:cli-prefer-WebSite)
> Ordering Constraints:
>    promote WebDataClone then start WebFS (Mandatory)
> (id:order-WebDataClone-WebFS-mandatory)
>    start WebFS then start WebSite (Mandatory)
> (id:order-WebFS-WebSite-mandatory)
> Colocation Constraints:
>    WebFS with WebDataClone (INFINITY) (with-rsc-role:Master)
> (id:colocation-WebFS-WebDataClone-INFINITY)
>    WebSite with ClusterIP (INFINITY)
> (id:colocation-WebSite-ClusterIP-INFINITY)
>    WebSite with WebFS (INFINITY) (id:colocation-WebSite-WebFS-INFINITY)
>
> Cluster Properties:
>   cluster-infrastructure: corosync
>   dc-version: 1.1.10-19.el7-368c726
>   last-lrm-refresh: 1388615115
>   no-quorum-policy: ignore
>   stonith-enabled: true
> =====
>
> Put an-c03n01 into standby:
>
> =====
> pcs cluster standby an-c03n01.alteeve.ca
>
> ==[an-c03n01 logs]===
> Jan  1 18:16:31 an-c03n01 cib[1762]: notice: cib:diff: Diff: --- 0.189.16
> Jan  1 18:16:31 an-c03n01 cib[1762]: notice: cib:diff: Diff: +++ 0.190.1
> 87f59f171d277d9cb1b51cbda41fbf24
> Jan  1 18:16:31 an-c03n01 cib[1762]: notice: cib:diff: -- <cib
> admin_epoch="0" epoch="189" num_updates="16"/>
> Jan  1 18:16:31 an-c03n01 cib[1762]: notice: cib:diff: ++
> <instance_attributes id="nodes-1">
> Jan  1 18:16:31 an-c03n01 cib[1762]: notice: cib:diff: ++ <nvpair
> id="nodes-1-standby" name="standby" value="on"/>
> Jan  1 18:16:31 an-c03n01 cib[1762]: notice: cib:diff: ++
> </instance_attributes>
> Jan  1 18:16:31 an-c03n01 crmd[1767]: notice: do_state_transition: State
> transition S_IDLE -> S_POLICY_ENGINE [ input=I_PE_CALC
> cause=C_FSA_INTERNAL origin=abort_transition_graph ]
> Jan  1 18:16:31 an-c03n01 pengine[1766]: notice: unpack_config: On loss
> of CCM Quorum: Ignore
> Jan  1 18:16:31 an-c03n01 pengine[1766]: notice: LogActions: Move
> ClusterIP    (Started an-c03n01.alteeve.ca -> an-c03n02.alteeve.ca)
> Jan  1 18:16:31 an-c03n01 pengine[1766]: notice: LogActions: Demote
> WebData:0    (Master -> Stopped an-c03n01.alteeve.ca)
> Jan  1 18:16:31 an-c03n01 pengine[1766]: notice: LogActions: Promote
> WebData:1    (Slave -> Master an-c03n02.alteeve.ca)
> Jan  1 18:16:31 an-c03n01 pengine[1766]: notice: LogActions: Move
> WebFS    (Started an-c03n01.alteeve.ca -> an-c03n02.alteeve.ca)
> Jan  1 18:16:31 an-c03n01 pengine[1766]: notice: LogActions: Move
> WebSite    (Started an-c03n01.alteeve.ca -> an-c03n02.alteeve.ca)
> Jan  1 18:16:31 an-c03n01 pengine[1766]: notice: process_pe_message:
> Calculated Transition 37: /var/lib/pacemaker/pengine/pe-input-176.bz2
> Jan  1 18:16:31 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 15: stop ClusterIP_stop_0 on an-c03n01.alteeve.ca (local)
> Jan  1 18:16:31 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 6: cancel WebData_cancel_60000 on an-c03n02.alteeve.ca
> Jan  1 18:16:31 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 50: stop WebSite_stop_0 on an-c03n01.alteeve.ca (local)
> Jan  1 18:16:31 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 64: notify WebData_pre_notify_demote_0 on an-c03n01.alteeve.ca
> (local)
> Jan  1 18:16:31 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 66: notify WebData_pre_notify_demote_0 on an-c03n02.alteeve.ca
> Jan  1 18:16:31 an-c03n01 apache(WebSite)[14743]: INFO: Attempting
> graceful stop of apache PID 10641
> Jan  1 18:16:31 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=97, rc=0, cib-update=0, confirmed=true) ok
> Jan  1 18:16:31 an-c03n01 IPaddr2(ClusterIP)[14742]: INFO: IP status =
> ok, IP_CIP=
> Jan  1 18:16:31 an-c03n01 avahi-daemon[298]: Withdrawing address record
> for 192.168.122.10 on eth0.
> Jan  1 18:16:31 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation ClusterIP_stop_0 (call=94, rc=0, cib-update=283,
> confirmed=true) ok
> Jan  1 18:16:33 an-c03n01 apache(WebSite)[14743]: INFO: apache stopped.
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebSite_stop_0 (call=96, rc=0, cib-update=284, confirmed=true) ok
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: run_graph: Transition 37
> (Complete=6, Pending=0, Fired=0, Skipped=22, Incomplete=13,
> Source=/var/lib/pacemaker/pengine/pe-input-176.bz2): Stopped
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: unpack_config: On loss
> of CCM Quorum: Ignore
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: LogActions: Start
> ClusterIP    (an-c03n02.alteeve.ca)
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: LogActions: Demote
> WebData:0    (Master -> Stopped an-c03n01.alteeve.ca)
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: LogActions: Promote
> WebData:1    (Slave -> Master an-c03n02.alteeve.ca)
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: LogActions: Move
> WebFS    (Started an-c03n01.alteeve.ca -> an-c03n02.alteeve.ca)
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: LogActions: Start
> WebSite    (an-c03n02.alteeve.ca)
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: process_pe_message:
> Calculated Transition 38: /var/lib/pacemaker/pengine/pe-input-177.bz2
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 12: start ClusterIP_start_0 on an-c03n02.alteeve.ca
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 43: stop WebFS_stop_0 on an-c03n01.alteeve.ca (local)
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 59: notify WebData_pre_notify_demote_0 on an-c03n01.alteeve.ca
> (local)
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 61: notify WebData_pre_notify_demote_0 on an-c03n02.alteeve.ca
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=100, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:16:33 an-c03n01 Filesystem(WebFS)[14862]: INFO: Running stop
> for /dev/drbd0 on /var/www/html
> Jan  1 18:16:33 an-c03n01 Filesystem(WebFS)[14862]: INFO: Trying to
> unmount /var/www/html
> Jan  1 18:16:33 an-c03n01 Filesystem(WebFS)[14862]: INFO: unmounted
> /var/www/html successfully
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebFS_stop_0 (call=99, rc=0, cib-update=286, confirmed=true) ok
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 14: demote WebData_demote_0 on an-c03n01.alteeve.ca (local)
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 13: monitor ClusterIP_monitor_30000 on an-c03n02.alteeve.ca
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.547981] block drbd0: role(
> Primary -> Secondary )
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.548034] block drbd0: bitmap
> WRITE of 0 pages took 0 jiffies
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.548037] block drbd0: 0 KB (0
> bits) marked out-of-sync by on disk bit-map.
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_demote_0 (call=101, rc=0, cib-update=287,
> confirmed=true) ok
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 60: notify WebData_post_notify_demote_0 on an-c03n01.alteeve.ca
> (local)
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 62: notify WebData_post_notify_demote_0 on an-c03n02.alteeve.ca
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=102, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 54: notify WebData_pre_notify_stop_0 on an-c03n01.alteeve.ca (local)
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 55: notify WebData_pre_notify_stop_0 on an-c03n02.alteeve.ca
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=103, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 15: stop WebData_stop_0 on an-c03n01.alteeve.ca (local)
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.638480] drbd r0: peer(
> Secondary -> Unknown ) conn( Connected -> Disconnecting ) pdsk( UpToDate
> -> DUnknown )
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.638490] drbd r0: asender
> terminated
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.638492] drbd r0: Terminating
> drbd_a_r0
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.638732] drbd r0: Connection closed
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.638741] drbd r0: conn(
> Disconnecting -> StandAlone )
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.638742] drbd r0: receiver
> terminated
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.638743] drbd r0: Terminating
> drbd_r_r0
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.638776] block drbd0: disk(
> UpToDate -> Failed )
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.651161] block drbd0: bitmap
> WRITE of 0 pages took 0 jiffies
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.651164] block drbd0: 0 KB (0
> bits) marked out-of-sync by on disk bit-map.
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.651167] block drbd0: disk(
> Failed -> Diskless )
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.651239] block drbd0:
> drbd_bm_resize called with capacity == 0
> Jan  1 18:16:33 an-c03n01 kernel: [ 2911.651279] drbd r0: Terminating
> drbd_w_r0
> Jan  1 18:16:33 an-c03n01 attrd[1765]: notice: attrd_trigger_update:
> Sending flush op to all hosts for: master-WebData (<null>)
> Jan  1 18:16:33 an-c03n01 attrd[1765]: notice: attrd_perform_update:
> Sent delete 154: node=1, attr=master-WebData, id=<n/a>, set=(null),
> section=status
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_stop_0 (call=104, rc=0, cib-update=288,
> confirmed=true) ok
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 56: notify WebData_post_notify_stop_0 on an-c03n02.alteeve.ca
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: run_graph: Transition 38
> (Complete=24, Pending=0, Fired=0, Skipped=10, Incomplete=4,
> Source=/var/lib/pacemaker/pengine/pe-input-177.bz2): Stopped
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: unpack_config: On loss
> of CCM Quorum: Ignore
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: LogActions: Promote
> WebData:0    (Slave -> Master an-c03n02.alteeve.ca)
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: LogActions: Start
> WebFS    (an-c03n02.alteeve.ca)
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: LogActions: Start
> WebSite    (an-c03n02.alteeve.ca)
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: process_pe_message:
> Calculated Transition 39: /var/lib/pacemaker/pengine/pe-input-178.bz2
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: unpack_config: On loss
> of CCM Quorum: Ignore
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: LogActions: Promote
> WebData:0    (Slave -> Master an-c03n02.alteeve.ca)
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: LogActions: Start
> WebFS    (an-c03n02.alteeve.ca)
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: LogActions: Start
> WebSite    (an-c03n02.alteeve.ca)
> Jan  1 18:16:33 an-c03n01 pengine[1766]: notice: process_pe_message:
> Calculated Transition 40: /var/lib/pacemaker/pengine/pe-input-178.bz2
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 54: notify WebData_pre_notify_promote_0 on an-c03n02.alteeve.ca
> Jan  1 18:16:33 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 16: promote WebData_promote_0 on an-c03n02.alteeve.ca
> Jan  1 18:16:34 an-c03n01 crmd[1767]: notice: handle_request: Current
> ping state: S_TRANSITION_ENGINE
> Jan  1 18:16:34 an-c03n01 cib[1762]: notice: cib:diff: Diff: --- 0.190.11
> Jan  1 18:16:34 an-c03n01 cib[1762]: notice: cib:diff: Diff: +++ 0.191.1
> ba2e8658b93b742e3ef335fa93b50c26
> Jan  1 18:16:34 an-c03n01 cib[1762]: notice: cib:diff: -- <cib
> admin_epoch="0" epoch="190" num_updates="11"/>
> Jan  1 18:16:34 an-c03n01 cib[1762]: notice: cib:diff: ++ <rsc_location
> rsc="WebDataClone" id="drbd-fence-by-handler-r0-WebDataClone">
> Jan  1 18:16:34 an-c03n01 cib[1762]: notice: cib:diff: ++         <rule
> role="Master" score="-INFINITY"
> id="drbd-fence-by-handler-r0-rule-WebDataClone">
> Jan  1 18:16:34 an-c03n01 cib[1762]: notice: cib:diff: ++ <expression
> attribute="#uname" operation="ne" value="an-c03n02.alteeve.ca"
> id="drbd-fence-by-handler-r0-expr-WebDataClone"/>
> Jan  1 18:16:34 an-c03n01 cib[1762]: notice: cib:diff: ++         </rule>
> Jan  1 18:16:34 an-c03n01 cib[1762]: notice: cib:diff: ++ </rsc_location>
> Jan  1 18:16:34 an-c03n01 stonith-ng[1763]: notice: unpack_config: On
> loss of CCM Quorum: Ignore
> Jan  1 18:16:34 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 55: notify WebData_post_notify_promote_0 on an-c03n02.alteeve.ca
> Jan  1 18:16:34 an-c03n01 stonith-ng[1763]: notice:
> stonith_device_register: Added 'fence_n01_virsh' to the device list (2
> active devices)
> Jan  1 18:16:34 an-c03n01 stonith-ng[1763]: notice:
> stonith_device_register: Added 'fence_n02_virsh' to the device list (2
> active devices)
> Jan  1 18:16:34 an-c03n01 crmd[1767]: notice: run_graph: Transition 40
> (Complete=9, Pending=0, Fired=0, Skipped=4, Incomplete=0,
> Source=/var/lib/pacemaker/pengine/pe-input-178.bz2): Stopped
> Jan  1 18:16:34 an-c03n01 pengine[1766]: notice: unpack_config: On loss
> of CCM Quorum: Ignore
> Jan  1 18:16:34 an-c03n01 pengine[1766]: notice: LogActions: Start
> WebFS    (an-c03n02.alteeve.ca)
> Jan  1 18:16:34 an-c03n01 pengine[1766]: notice: LogActions: Start
> WebSite    (an-c03n02.alteeve.ca)
> Jan  1 18:16:34 an-c03n01 pengine[1766]: notice: process_pe_message:
> Calculated Transition 41: /var/lib/pacemaker/pengine/pe-input-179.bz2
> Jan  1 18:16:34 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 42: start WebFS_start_0 on an-c03n02.alteeve.ca
> Jan  1 18:16:34 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 43: monitor WebFS_monitor_60000 on an-c03n02.alteeve.ca
> Jan  1 18:16:34 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 44: start WebSite_start_0 on an-c03n02.alteeve.ca
> Jan  1 18:16:34 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 45: monitor WebSite_monitor_60000 on an-c03n02.alteeve.ca
> Jan  1 18:16:34 an-c03n01 crmd[1767]: notice: run_graph: Transition 41
> (Complete=4, Pending=0, Fired=0, Skipped=0, Incomplete=0,
> Source=/var/lib/pacemaker/pengine/pe-input-179.bz2): Complete
> Jan  1 18:16:34 an-c03n01 crmd[1767]: notice: do_state_transition: State
> transition S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS
> cause=C_FSA_INTERNAL origin=notify_crmd ]
>
> ==[an-c03n02 logs]===
> Jan  1 18:16:31 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=90, rc=0, cib-update=0, confirmed=true) ok
> Jan  1 18:16:33 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=92, rc=0, cib-update=0, confirmed=true) ok
> Jan  1 18:16:33 an-c03n02 IPaddr2(ClusterIP)[32025]: INFO: Adding inet
> address 192.168.122.10/32 to device eth0
> Jan  1 18:16:33 an-c03n02 avahi-daemon[295]: Registering new address
> record for 192.168.122.10 on eth0.IPv4.
> Jan  1 18:16:33 an-c03n02 IPaddr2(ClusterIP)[32025]: INFO: Bringing
> device eth0 up
> Jan  1 18:16:33 an-c03n02 IPaddr2(ClusterIP)[32025]: INFO:
> /usr/libexec/heartbeat/send_arp -i 200 -r 5 -p
> /var/run/resource-agents/send_arp-192.168.122.10 eth0 192.168.122.10
> auto not_used not_used
> Jan  1 18:16:33 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation ClusterIP_start_0 (call=91, rc=0, cib-update=62,
> confirmed=true) ok
> Jan  1 18:16:33 an-c03n02 kernel: [19473.555292] block drbd0: peer(
> Primary -> Secondary )
> Jan  1 18:16:33 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation ClusterIP_monitor_30000 (call=93, rc=0, cib-update=63,
> confirmed=false) ok
> Jan  1 18:16:33 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=94, rc=0, cib-update=0, confirmed=true) ok
> Jan  1 18:16:33 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=95, rc=0, cib-update=0, confirmed=true) ok
> Jan  1 18:16:33 an-c03n02 kernel: [19473.645249] drbd r0: peer(
> Secondary -> Unknown ) conn( Connected -> TearDown ) pdsk( UpToDate ->
> DUnknown )
> Jan  1 18:16:33 an-c03n02 kernel: [19473.645267] drbd r0: asender
> terminated
> Jan  1 18:16:33 an-c03n02 kernel: [19473.645269] drbd r0: Terminating
> drbd_a_r0
> Jan  1 18:16:33 an-c03n02 kernel: [19473.645531] drbd r0: Connection closed
> Jan  1 18:16:33 an-c03n02 kernel: [19473.645538] drbd r0: conn( TearDown
> -> Unconnected )
> Jan  1 18:16:33 an-c03n02 kernel: [19473.645539] drbd r0: receiver
> terminated
> Jan  1 18:16:33 an-c03n02 kernel: [19473.645540] drbd r0: Restarting
> receiver thread
> Jan  1 18:16:33 an-c03n02 kernel: [19473.645541] drbd r0: receiver
> (re)started
> Jan  1 18:16:33 an-c03n02 kernel: [19473.645546] drbd r0: conn(
> Unconnected -> WFConnection )
> Jan  1 18:16:33 an-c03n02 attrd[16386]: notice: attrd_trigger_update:
> Sending flush op to all hosts for: master-WebData (1000)
> Jan  1 18:16:33 an-c03n02 attrd[16386]: notice: attrd_perform_update:
> Sent update 152: master-WebData=1000
> Jan  1 18:16:33 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=96, rc=0, cib-update=0, confirmed=true) ok
> Jan  1 18:16:33 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=97, rc=0, cib-update=0, confirmed=true) ok
> Jan  1 18:16:34 an-c03n02 kernel: [19473.791812] drbd r0: helper
> command: /sbin/drbdadm fence-peer r0
> Jan  1 18:16:34 an-c03n02 crm-fence-peer.sh[32287]: invoked for r0
> Jan  1 18:16:34 an-c03n02 cibadmin[32319]: notice: crm_log_args:
> Invoked: cibadmin -C -o constraints -X <rsc_location rsc="WebDataClone"
> id="drbd-fence-by-handler-r0-WebDataClone">
>    <rule role="Master" score="-INFINITY"
> id="drbd-fence-by-handler-r0-rule-WebDataClone">
>      <expression attribute="#uname" operation="ne"
> value="an-c03n02.alteeve.ca"
> id="drbd-fence-by-handler-r0-expr-WebDataClone"/>
>    </rule>
> </rsc_location>
> Jan  1 18:16:34 an-c03n02 kernel: [19473.836661] drbd r0: helper
> command: /sbin/drbdadm fence-peer r0 exit code 4 (0x400)
> Jan  1 18:16:34 an-c03n02 kernel: [19473.836664] drbd r0: fence-peer
> helper returned 4 (peer was fenced)
> Jan  1 18:16:34 an-c03n02 kernel: [19473.836676] drbd r0: pdsk( DUnknown
> -> Outdated )
> Jan  1 18:16:34 an-c03n02 kernel: [19473.836681] block drbd0: role(
> Secondary -> Primary )
> Jan  1 18:16:34 an-c03n02 stonith-ng[16384]: notice: unpack_config: On
> loss of CCM Quorum: Ignore
> Jan  1 18:16:34 an-c03n02 crm-fence-peer.sh[32287]: INFO peer is
> reachable, my disk is UpToDate: placed constraint
> 'drbd-fence-by-handler-r0-WebDataClone'
> Jan  1 18:16:34 an-c03n02 kernel: [19473.837205] block drbd0: new
> current UUID
> 3E5C4A9426BEF115:6A994DBF33FF45AE:2A36647E5537B8FC:2A35647E5537B8FC
> Jan  1 18:16:34 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_promote_0 (call=98, rc=0, cib-update=64,
> confirmed=true) ok
> Jan  1 18:16:34 an-c03n02 stonith-ng[16384]: notice:
> stonith_device_register: Device 'fence_n01_virsh' already existed in
> device list (2 active devices)
> Jan  1 18:16:34 an-c03n02 attrd[16386]: notice: attrd_trigger_update:
> Sending flush op to all hosts for: master-WebData (10000)
> Jan  1 18:16:34 an-c03n02 attrd[16386]: notice: attrd_perform_update:
> Sent update 154: master-WebData=10000
> Jan  1 18:16:34 an-c03n02 stonith-ng[16384]: notice:
> stonith_device_register: Device 'fence_n02_virsh' already existed in
> device list (2 active devices)
> Jan  1 18:16:34 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=99, rc=0, cib-update=0, confirmed=true) ok
> Jan  1 18:16:34 an-c03n02 Filesystem(WebFS)[32362]: INFO: Running start
> for /dev/drbd0 on /var/www/html
> Jan  1 18:16:34 an-c03n02 kernel: [19473.948256] EXT4-fs (drbd0):
> mounted filesystem with ordered data mode. Opts: (null)
> Jan  1 18:16:34 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebFS_start_0 (call=100, rc=0, cib-update=65, confirmed=true) ok
> Jan  1 18:16:34 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebFS_monitor_60000 (call=101, rc=0, cib-update=66,
> confirmed=false) ok
> Jan  1 18:16:34 an-c03n02 systemd: pacemaker.service: Got notification
> message from PID 32497, but reception only permitted for PID 16382
> Jan  1 18:16:34 an-c03n02 apache(WebSite)[32427]: INFO: AH00558: httpd:
> Could not reliably determine the server's fully qualified domain name,
> using an-c03n02.alteeve.ca. Set the 'ServerName' directive globally to
> suppress this message
> Jan  1 18:16:34 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebSite_start_0 (call=102, rc=0, cib-update=67,
> confirmed=true) ok
> Jan  1 18:16:34 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebSite_monitor_60000 (call=103, rc=0, cib-update=68,
> confirmed=false) ok
> Jan  1 18:16:43 an-c03n02 systemd: pacemaker.service: Got notification
> message from PID 32497, but reception only permitted for PID 16382
> Jan  1 18:16:53 an-c03n02 systemd: pacemaker.service: Got notification
> message from PID 32497, but reception only permitted for PID 16382
> Jan  1 18:17:03 an-c03n02 systemd: pacemaker.service: Got notification
> message from PID 32497, but reception only permitted for PID 16382
> Jan  1 18:17:13 an-c03n02 systemd: pacemaker.service: Got notification
> message from PID 32497, but reception only permitted for PID 16382
> Jan  1 18:17:23 an-c03n02 systemd: pacemaker.service: Got notification
> message from PID 32497, but reception only permitted for PID 16382
> Jan  1 18:17:33 an-c03n02 systemd: pacemaker.service: Got notification
> message from PID 32497, but reception only permitted for PID 16382
> Jan  1 18:17:43 an-c03n02 systemd: pacemaker.service: Got notification
> message from PID 32497, but reception only permitted for PID 16382
> Jan  1 18:17:53 an-c03n02 systemd: pacemaker.service: Got notification
> message from PID 32497, but reception only permitted for PID 16382
> Jan  1 18:18:03 an-c03n02 systemd: pacemaker.service: Got notification
> message from PID 32497, but reception only permitted for PID 16382
> =====
>
> Not sure why this happened:
> =====
> Jan  1 18:16:34 an-c03n02 cibadmin[32319]: notice: crm_log_args:
> Invoked: cibadmin -C -o constraints -X <rsc_location rsc="WebDataClone"
> id="drbd-fence-by-handler-r0-WebDataClone">
>    <rule role="Master" score="-INFINITY"
> id="drbd-fence-by-handler-r0-rule-WebDataClone">
>      <expression attribute="#uname" operation="ne"
> value="an-c03n02.alteeve.ca"
> id="drbd-fence-by-handler-r0-expr-WebDataClone"/>
>    </rule>
> </rsc_location>
> =====
>
> That seems to be the problem, but let me continue.
>
> Everything fails over properly:
>
> =====
> [root at an-c03n01 ~]# pcs status
> Cluster name: an-cluster-03
> Last updated: Wed Jan  1 18:19:01 2014
> Last change: Wed Jan  1 18:16:34 2014 via cibadmin on an-c03n02.alteeve.ca
> Stack: corosync
> Current DC: an-c03n01.alteeve.ca (1) - partition with quorum
> Version: 1.1.10-19.el7-368c726
> 2 Nodes configured
> 7 Resources configured
>
>
> Node an-c03n01.alteeve.ca (1): standby
> Online: [ an-c03n02.alteeve.ca ]
>
> Full list of resources:
>
>   fence_n01_virsh    (stonith:fence_virsh):    Started an-c03n02.alteeve.ca
>   fence_n02_virsh    (stonith:fence_virsh):    Started an-c03n02.alteeve.ca
>   ClusterIP    (ocf::heartbeat:IPaddr2):    Started an-c03n02.alteeve.ca
>   Master/Slave Set: WebDataClone [WebData]
>       Masters: [ an-c03n02.alteeve.ca ]
>       Stopped: [ an-c03n01.alteeve.ca ]
>   WebFS    (ocf::heartbeat:Filesystem):    Started an-c03n02.alteeve.ca
>   WebSite    (ocf::heartbeat:apache):    Started an-c03n02.alteeve.ca
>
> PCSD Status:
> an-c03n01.alteeve.ca:
>    an-c03n01.alteeve.ca: Online
> an-c03n02.alteeve.ca:
>    an-c03n02.alteeve.ca: Online
>
> Daemon Status:
>    corosync: active/disabled
>    pacemaker: active/disabled
>    pcsd: active/enabled
> =====
>
> Current config:
> =====
> [root at an-c03n01 ~]# pcs config show
> Cluster Name: an-cluster-03
> Corosync Nodes:
>   an-c03n01.alteeve.ca an-c03n02.alteeve.ca
> Pacemaker Nodes:
>   an-c03n01.alteeve.ca an-c03n02.alteeve.ca
>
> Resources:
>   Resource: ClusterIP (class=ocf provider=heartbeat type=IPaddr2)
>    Attributes: ip=192.168.122.10 cidr_netmask=32
>    Operations: monitor interval=30s (ClusterIP-monitor-interval-30s)
>   Master: WebDataClone
>    Meta Attrs: master-max=1 master-node-max=1 clone-max=2
> clone-node-max=1 notify=true
>    Resource: WebData (class=ocf provider=linbit type=drbd)
>     Attributes: drbd_resource=r0
>     Operations: monitor interval=60s (WebData-monitor-60s)
>   Resource: WebFS (class=ocf provider=heartbeat type=Filesystem)
>    Attributes: device=/dev/drbd0 directory=/var/www/html fstype=ext4
>    Operations: monitor interval=60s (WebFS-monitor-interval-60s)
>   Resource: WebSite (class=ocf provider=heartbeat type=apache)
>    Attributes: configfile=/etc/httpd/conf/httpd.conf
> statusurl=http://127.0.0.1/server-status
>    Operations: monitor interval=60s (WebSite-monitor-interval-60s)
>
> Stonith Devices:
>   Resource: fence_n01_virsh (class=stonith type=fence_virsh)
>    Attributes: pcmk_host_list=an-c03n01.alteeve.ca ipaddr=lemass
> login=root passwd_script=/root/lemass.pw delay=15 port=an-c03n01
>    Operations: monitor interval=60s (fence_n01_virsh-monitor-interval-60s)
>   Resource: fence_n02_virsh (class=stonith type=fence_virsh)
>    Attributes: pcmk_host_list=an-c03n02.alteeve.ca ipaddr=lemass
> login=root passwd_script=/root/lemass.pw port=an-c03n02
>    Operations: monitor interval=60s (fence_n02_virsh-monitor-interval-60s)
> Fencing Levels:
>
> Location Constraints:
>    Resource: ClusterIP
>      Enabled on: an-c03n01.alteeve.ca (score:INFINITY) (role: Started)
> (id:cli-prefer-ClusterIP)
>    Resource: WebDataClone
>      Enabled on: an-c03n01.alteeve.ca (score:INFINITY) (role: Started)
> (id:cli-prefer-WebDataClone)
>      Constraint: drbd-fence-by-handler-r0-WebDataClone
>        Rule: score=-INFINITY role=Master
> (id:drbd-fence-by-handler-r0-rule-WebDataClone)
>          Expression: #uname ne an-c03n02.alteeve.ca
> (id:drbd-fence-by-handler-r0-expr-WebDataClone)
>    Resource: WebSite
>      Enabled on: an-c03n01.alteeve.ca (score:INFINITY) (role: Started)
> (id:cli-prefer-WebSite)
> Ordering Constraints:
>    promote WebDataClone then start WebFS (Mandatory)
> (id:order-WebDataClone-WebFS-mandatory)
>    start WebFS then start WebSite (Mandatory)
> (id:order-WebFS-WebSite-mandatory)
> Colocation Constraints:
>    WebFS with WebDataClone (INFINITY) (with-rsc-role:Master)
> (id:colocation-WebFS-WebDataClone-INFINITY)
>    WebSite with ClusterIP (INFINITY)
> (id:colocation-WebSite-ClusterIP-INFINITY)
>    WebSite with WebFS (INFINITY) (id:colocation-WebSite-WebFS-INFINITY)
>
> Cluster Properties:
>   cluster-infrastructure: corosync
>   dc-version: 1.1.10-19.el7-368c726
>   last-lrm-refresh: 1388615115
>   no-quorum-policy: ignore
>   stonith-enabled: true
> Node Attributes:
>   an-c03n01.alteeve.ca: standby=on
> =====
>
> Put an-c03n01 into unstandby:
>
> =====
> [root at an-c03n01 ~]# pcs cluster unstandby an-c03n01.alteeve.ca
>
> ==[an-c03n01 logs]===
> Jan  1 18:22:32 an-c03n01 cib[1762]: notice: cib:diff: Diff: --- 0.191.7
> Jan  1 18:22:32 an-c03n01 cib[1762]: notice: cib:diff: Diff: +++ 0.192.1
> 9eac78791868c0b3878e59927a934eba
> Jan  1 18:22:32 an-c03n01 cib[1762]: notice: cib:diff: -- <nvpair
> id="nodes-1-standby" name="standby" value="on"/>
> Jan  1 18:22:32 an-c03n01 cib[1762]: notice: cib:diff: ++ <cib
> admin_epoch="0" cib-last-written="Wed Jan  1 18:22:32 2014"
> crm_feature_set="3.0.7" epoch="192" have-quorum="1" num_updates="1"
> update-client="crm_attribute" update-origin="an-c03n01.alteeve.ca"
> validate-with="pacemaker-1.2" dc-uuid="1"/>
> Jan  1 18:22:32 an-c03n01 crmd[1767]: notice: do_state_transition: State
> transition S_IDLE -> S_POLICY_ENGINE [ input=I_PE_CALC
> cause=C_FSA_INTERNAL origin=abort_transition_graph ]
> Jan  1 18:22:32 an-c03n01 pengine[1766]: notice: unpack_config: On loss
> of CCM Quorum: Ignore
> Jan  1 18:22:32 an-c03n01 pengine[1766]: notice: LogActions: Move
> ClusterIP    (Started an-c03n02.alteeve.ca -> an-c03n01.alteeve.ca)
> Jan  1 18:22:32 an-c03n01 pengine[1766]: notice: LogActions: Start
> WebData:1    (an-c03n01.alteeve.ca)
> Jan  1 18:22:32 an-c03n01 pengine[1766]: notice: LogActions: Stop
> WebSite    (an-c03n02.alteeve.ca)
> Jan  1 18:22:32 an-c03n01 pengine[1766]: notice: process_pe_message:
> Calculated Transition 42: /var/lib/pacemaker/pengine/pe-input-180.bz2
> Jan  1 18:22:32 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 14: stop ClusterIP_stop_0 on an-c03n02.alteeve.ca
> Jan  1 18:22:32 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 49: stop WebSite_stop_0 on an-c03n02.alteeve.ca
> Jan  1 18:22:32 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 53: notify WebData_pre_notify_start_0 on an-c03n02.alteeve.ca
> Jan  1 18:22:32 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 21: start WebData:1_start_0 on an-c03n01.alteeve.ca (local)
> Jan  1 18:22:32 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 15: start ClusterIP_start_0 on an-c03n01.alteeve.ca (local)
> Jan  1 18:22:32 an-c03n01 IPaddr2(ClusterIP)[15640]: INFO: Adding inet
> address 192.168.122.10/32 to device eth0
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.282453] drbd r0: Starting
> worker thread (from drbdsetup [15705])
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.283364] block drbd0: disk(
> Diskless -> Attaching )
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.283429] drbd r0: Method to
> ensure write ordering: drain
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.283431] block drbd0: max BIO
> size = 1048576
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.283433] block drbd0: Adjusting
> my ra_pages to backing device's (32 -> 1024)
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.283435] block drbd0:
> drbd_bm_resize called with capacity == 102392704
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.283580] block drbd0: resync
> bitmap: bits=12799088 words=199986 pages=391
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.283582] block drbd0: size = 49
> GB (51196352 KB)
> Jan  1 18:22:32 an-c03n01 avahi-daemon[298]: Registering new address
> record for 192.168.122.10 on eth0.IPv4.
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.285244] block drbd0: bitmap
> READ of 391 pages took 2 jiffies
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.285466] block drbd0: recounting
> of set bits took additional 0 jiffies
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.285468] block drbd0: 0 KB (0
> bits) marked out-of-sync by on disk bit-map.
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.285473] block drbd0: disk(
> Attaching -> Consistent )
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.285475] block drbd0: attached
> to UUIDs
> 6A994DBF33FF45AE:0000000000000000:2A36647E5537B8FC:2A35647E5537B8FC
> Jan  1 18:22:32 an-c03n01 IPaddr2(ClusterIP)[15640]: INFO: Bringing
> device eth0 up
> Jan  1 18:22:32 an-c03n01 IPaddr2(ClusterIP)[15640]: INFO:
> /usr/libexec/heartbeat/send_arp -i 200 -r 5 -p
> /var/run/resource-agents/send_arp-192.168.122.10 eth0 192.168.122.10
> auto not_used not_used
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.292172] drbd r0: conn(
> StandAlone -> Unconnected )
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.292185] drbd r0: Starting
> receiver thread (from drbd_w_r0 [15707])
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.292199] drbd r0: receiver
> (re)started
> Jan  1 18:22:32 an-c03n01 kernel: [ 3270.292205] drbd r0: conn(
> Unconnected -> WFConnection )
> Jan  1 18:22:32 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation ClusterIP_start_0 (call=106, rc=0, cib-update=293,
> confirmed=true) ok
> Jan  1 18:22:32 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 16: monitor ClusterIP_monitor_30000 on an-c03n01.alteeve.ca (local)
> Jan  1 18:22:32 an-c03n01 attrd[1765]: notice: attrd_trigger_update:
> Sending flush op to all hosts for: master-WebData (5)
> Jan  1 18:22:32 an-c03n01 attrd[1765]: notice: attrd_perform_update:
> Sent update 160: master-WebData=5
> Jan  1 18:22:32 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_start_0 (call=105, rc=0, cib-update=294,
> confirmed=true) ok
> Jan  1 18:22:32 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 54: notify WebData_post_notify_start_0 on an-c03n02.alteeve.ca
> Jan  1 18:22:32 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 55: notify WebData:1_post_notify_start_0 on an-c03n01.alteeve.ca
> (local)
> Jan  1 18:22:32 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation ClusterIP_monitor_30000 (call=107, rc=0, cib-update=295,
> confirmed=false) ok
> Jan  1 18:22:32 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=108, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.793761] drbd r0: Handshake
> successful: Agreed network protocol version 101
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.793765] drbd r0: Agreed to
> support TRIM on protocol level
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.793809] drbd r0: conn(
> WFConnection -> WFReportParams )
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.793812] drbd r0: Starting
> asender thread (from drbd_r_r0 [15727])
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.799253] block drbd0:
> drbd_sync_handshake:
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.799258] block drbd0: self
> 6A994DBF33FF45AE:0000000000000000:2A36647E5537B8FC:2A35647E5537B8FC
> bits:0 flags:0
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.799261] block drbd0: peer
> 3E5C4A9426BEF115:6A994DBF33FF45AE:2A36647E5537B8FC:2A35647E5537B8FC
> bits:1 flags:0
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.799263] block drbd0:
> uuid_compare()=-1 by rule 50
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.799270] block drbd0: peer(
> Unknown -> Primary ) conn( WFReportParams -> WFBitMapT ) disk(
> Consistent -> Outdated ) pdsk( DUnknown -> UpToDate )
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.803153] block drbd0: receive
> bitmap stats [Bytes(packets)]: plain 0(0), RLE 23(1), total 23;
> compression: 100.0%
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.803403] block drbd0: send
> bitmap stats [Bytes(packets)]: plain 0(0), RLE 23(1), total 23;
> compression: 100.0%
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.803409] block drbd0: conn(
> WFBitMapT -> WFSyncUUID )
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.805842] block drbd0: updated
> sync uuid
> 6A9A4DBF33FF45AE:0000000000000000:2A36647E5537B8FC:2A35647E5537B8FC
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.806189] block drbd0: helper
> command: /sbin/drbdadm before-resync-target minor-0
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.807379] block drbd0: helper
> command: /sbin/drbdadm before-resync-target minor-0 exit code 0 (0x0)
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.807392] block drbd0: conn(
> WFSyncUUID -> SyncTarget ) disk( Outdated -> Inconsistent )
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.807398] block drbd0: Began
> resync as SyncTarget (will sync 4 KB [1 bits set]).
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.809259] block drbd0: Resync
> done (total 1 sec; paused 0 sec; 4 K/sec)
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.809264] block drbd0: updated
> UUIDs 3E5C4A9426BEF114:0000000000000000:6A9A4DBF33FF45AE:6A994DBF33FF45AE
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.809268] block drbd0: conn(
> SyncTarget -> Connected ) disk( Inconsistent -> UpToDate )
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.809382] block drbd0: helper
> command: /sbin/drbdadm after-resync-target minor-0
> Jan  1 18:22:33 an-c03n01 kernel: [ 3270.810777] block drbd0: helper
> command: /sbin/drbdadm after-resync-target minor-0 exit code 0 (0x0)
> Jan  1 18:22:34 an-c03n01 crmd[1767]: notice: run_graph: Transition 42
> (Complete=14, Pending=0, Fired=0, Skipped=2, Incomplete=0,
> Source=/var/lib/pacemaker/pengine/pe-input-180.bz2): Stopped
> Jan  1 18:22:34 an-c03n01 pengine[1766]: notice: unpack_config: On loss
> of CCM Quorum: Ignore
> Jan  1 18:22:34 an-c03n01 pengine[1766]: notice: process_pe_message:
> Calculated Transition 43: /var/lib/pacemaker/pengine/pe-input-181.bz2
> Jan  1 18:22:34 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 17: monitor WebData_monitor_60000 on an-c03n01.alteeve.ca (local)
> Jan  1 18:22:34 an-c03n01 attrd[1765]: notice: attrd_trigger_update:
> Sending flush op to all hosts for: master-WebData (10000)
> Jan  1 18:22:34 an-c03n01 attrd[1765]: notice: attrd_perform_update:
> Sent update 162: master-WebData=10000
> Jan  1 18:22:34 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_monitor_60000 (call=109, rc=0, cib-update=297,
> confirmed=false) ok
> Jan  1 18:22:34 an-c03n01 crmd[1767]: notice: run_graph: Transition 43
> (Complete=1, Pending=0, Fired=0, Skipped=0, Incomplete=0,
> Source=/var/lib/pacemaker/pengine/pe-input-181.bz2): Complete
> Jan  1 18:22:34 an-c03n01 pengine[1766]: notice: unpack_config: On loss
> of CCM Quorum: Ignore
> Jan  1 18:22:34 an-c03n01 pengine[1766]: notice: process_pe_message:
> Calculated Transition 44: /var/lib/pacemaker/pengine/pe-input-182.bz2
> Jan  1 18:22:34 an-c03n01 crmd[1767]: notice: run_graph: Transition 44
> (Complete=0, Pending=0, Fired=0, Skipped=0, Incomplete=0,
> Source=/var/lib/pacemaker/pengine/pe-input-182.bz2): Complete
> Jan  1 18:22:34 an-c03n01 crmd[1767]: notice: do_state_transition: State
> transition S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS
> cause=C_FSA_INTERNAL origin=notify_crmd ]
>
> ==[an-c03n02 logs]===
> Jan  1 18:22:32 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=108, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:22:32 an-c03n02 apache(WebSite)[2631]: INFO: Attempting
> graceful stop of apache PID 32497
> Jan  1 18:22:32 an-c03n02 IPaddr2(ClusterIP)[2630]: INFO: IP status =
> ok, IP_CIP=
> Jan  1 18:22:32 an-c03n02 avahi-daemon[295]: Withdrawing address record
> for 192.168.122.10 on eth0.
> Jan  1 18:22:32 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation ClusterIP_stop_0 (call=105, rc=0, cib-update=69,
> confirmed=true) ok
> Jan  1 18:22:32 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=109, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:22:33 an-c03n02 kernel: [19832.801173] drbd r0: Handshake
> successful: Agreed network protocol version 101
> Jan  1 18:22:33 an-c03n02 kernel: [19832.801177] drbd r0: Agreed to
> support TRIM on protocol level
> Jan  1 18:22:33 an-c03n02 kernel: [19832.801209] drbd r0: conn(
> WFConnection -> WFReportParams )
> Jan  1 18:22:33 an-c03n02 kernel: [19832.801211] drbd r0: Starting
> asender thread (from drbd_r_r0 [19965])
> Jan  1 18:22:33 an-c03n02 kernel: [19832.810255] block drbd0:
> drbd_sync_handshake:
> Jan  1 18:22:33 an-c03n02 kernel: [19832.810260] block drbd0: self
> 3E5C4A9426BEF115:6A994DBF33FF45AE:2A36647E5537B8FC:2A35647E5537B8FC
> bits:1 flags:0
> Jan  1 18:22:33 an-c03n02 kernel: [19832.810263] block drbd0: peer
> 6A994DBF33FF45AE:0000000000000000:2A36647E5537B8FC:2A35647E5537B8FC
> bits:0 flags:0
> Jan  1 18:22:33 an-c03n02 kernel: [19832.810265] block drbd0:
> uuid_compare()=1 by rule 70
> Jan  1 18:22:33 an-c03n02 kernel: [19832.810270] block drbd0: peer(
> Unknown -> Secondary ) conn( WFReportParams -> WFBitMapS ) pdsk(
> Outdated -> Consistent )
> Jan  1 18:22:33 an-c03n02 kernel: [19832.810573] block drbd0: send
> bitmap stats [Bytes(packets)]: plain 0(0), RLE 23(1), total 23;
> compression: 100.0%
> Jan  1 18:22:33 an-c03n02 kernel: [19832.811274] block drbd0: receive
> bitmap stats [Bytes(packets)]: plain 0(0), RLE 23(1), total 23;
> compression: 100.0%
> Jan  1 18:22:33 an-c03n02 kernel: [19832.811279] block drbd0: helper
> command: /sbin/drbdadm before-resync-source minor-0
> Jan  1 18:22:33 an-c03n02 kernel: [19832.812937] block drbd0: helper
> command: /sbin/drbdadm before-resync-source minor-0 exit code 0 (0x0)
> Jan  1 18:22:33 an-c03n02 kernel: [19832.812951] block drbd0: conn(
> WFBitMapS -> SyncSource ) pdsk( Consistent -> Inconsistent )
> Jan  1 18:22:33 an-c03n02 kernel: [19832.812959] block drbd0: Began
> resync as SyncSource (will sync 4 KB [1 bits set]).
> Jan  1 18:22:33 an-c03n02 kernel: [19832.812996] block drbd0: updated
> sync UUID
> 3E5C4A9426BEF115:6A9A4DBF33FF45AE:6A994DBF33FF45AE:2A36647E5537B8FC
> Jan  1 18:22:33 an-c03n02 kernel: [19832.816963] block drbd0: Resync
> done (total 1 sec; paused 0 sec; 4 K/sec)
> Jan  1 18:22:33 an-c03n02 kernel: [19832.816978] block drbd0: updated
> UUIDs 3E5C4A9426BEF115:0000000000000000:6A9A4DBF33FF45AE:6A994DBF33FF45AE
> Jan  1 18:22:33 an-c03n02 kernel: [19832.816985] block drbd0: conn(
> SyncSource -> Connected ) pdsk( Inconsistent -> UpToDate )
> Jan  1 18:22:34 an-c03n02 apache(WebSite)[2631]: INFO: apache stopped.
> Jan  1 18:22:34 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebSite_stop_0 (call=107, rc=0, cib-update=70, confirmed=true) ok
> =====
>
> WebSite stopped and won't start now.
> =====
> [root at an-c03n01 ~]# pcs status
> Cluster name: an-cluster-03
> Last updated: Wed Jan  1 18:23:44 2014
> Last change: Wed Jan  1 18:22:32 2014 via crm_attribute on
> an-c03n01.alteeve.ca
> Stack: corosync
> Current DC: an-c03n01.alteeve.ca (1) - partition with quorum
> Version: 1.1.10-19.el7-368c726
> 2 Nodes configured
> 7 Resources configured
>
>
> Online: [ an-c03n01.alteeve.ca an-c03n02.alteeve.ca ]
>
> Full list of resources:
>
>   fence_n01_virsh    (stonith:fence_virsh):    Started an-c03n02.alteeve.ca
>   fence_n02_virsh    (stonith:fence_virsh):    Started an-c03n02.alteeve.ca
>   ClusterIP    (ocf::heartbeat:IPaddr2):    Started an-c03n01.alteeve.ca
>   Master/Slave Set: WebDataClone [WebData]
>       Masters: [ an-c03n02.alteeve.ca ]
>       Slaves: [ an-c03n01.alteeve.ca ]
>   WebFS    (ocf::heartbeat:Filesystem):    Started an-c03n02.alteeve.ca
>   WebSite    (ocf::heartbeat:apache):    Stopped
>
> PCSD Status:
> an-c03n01.alteeve.ca:
>    an-c03n01.alteeve.ca: Online
> an-c03n02.alteeve.ca:
>    an-c03n02.alteeve.ca: Online
>
> Daemon Status:
>    corosync: active/disabled
>    pacemaker: active/disabled
>    pcsd: active/enabled
> =====
>
> Here is what I see when I try to start it manually:
>
> =====
> [root at an-c03n01 ~]# pcs resource debug-start WebSite --full
> Operation start for WebSite (ocf:heartbeat:apache) returned 0
>   >  stderr: + 18:24:17: 46: . /usr/lib/ocf/lib/heartbeat/apache-conf.sh
>   >  stderr: + 18:24:17: 47: . /usr/lib/ocf/lib/heartbeat/http-mon.sh
>   >  stderr: ++ 18:24:17: 18: bind_address=127.0.0.1
>   >  stderr: ++ 18:24:17: 19: curl_ipv6_opts=
>   >  stderr: ++ 18:24:17: 20: ocf_is_true ''
>   >  stderr: ++ 18:24:17: ocf_is_true:101: case "$1" in
>   >  stderr: ++ 18:24:17: ocf_is_true:103: false
>   >  stderr: ++ 18:24:17: 20: grep -qs ::
>   >  stderr: ++ 18:24:17: 20: echo ''
>   >  stderr: ++ 18:24:17: 24: WGETOPTS='-O- -q -L --no-proxy
> --bind-address=127.0.0.1'
>   >  stderr: ++ 18:24:17: 25: CURLOPTS='-o - -Ss -L --interface lo '
>   >  stderr: + 18:24:17: 48: HA_VARRUNDIR=/var/run/
>   >  stderr: + 18:24:17: 56: IBMHTTPD=/opt/IBMHTTPServer/bin/httpd
>   >  stderr: + 18:24:17: 57: HTTPDLIST='/sbin/httpd2 /usr/sbin/httpd2
> /usr/sbin/apache2 /sbin/httpd /usr/sbin/httpd /usr/sbin/apache
> /opt/IBMHTTPServer/bin/httpd'
>   >  stderr: + 18:24:17: 58: MPM=/usr/share/apache2/find_mpm
>   >  stderr: + 18:24:17: 59: '[' -x /usr/share/apache2/find_mpm ']'
>   >  stderr: + 18:24:17: 63: LOCALHOST=http://localhost
>   >  stderr: + 18:24:17: 64: HTTPDOPTS=-DSTATUS
>   >  stderr: + 18:24:17: 65:
> DEFAULT_IBMCONFIG=/opt/IBMHTTPServer/conf/httpd.conf
>   >  stderr: + 18:24:17: 66: DEFAULT_SUSECONFIG=/etc/apache2/httpd.conf
>   >  stderr: + 18:24:17: 67: DEFAULT_RHELCONFIG=/etc/httpd/conf/httpd.conf
>   >  stderr: ++ 18:24:17: 79: basename
> /usr/lib/ocf/resource.d/heartbeat/apache
>   >  stderr: + 18:24:17: 79: CMD=apache
>   >  stderr: + 18:24:17: 654: OCF_REQUIRED_PARAMS=
>   >  stderr: + 18:24:17: 655: OCF_REQUIRED_BINARIES=
>   >  stderr: + 18:24:17: 656: ocf_rarun start
>   >  stderr: + 18:24:17: ocf_rarun:137: mk_action_func
>   >  stderr: ++ 18:24:17: mk_action_func:50: echo apache_start
>   >  stderr: ++ 18:24:17: mk_action_func:50: tr - _
>   >  stderr: + 18:24:17: mk_action_func:50: ACTION_FUNC=apache_start
>   >  stderr: + 18:24:17: ocf_rarun:138: validate_args
>   >  stderr: + 18:24:17: validate_args:53: is_function apache_start
>   >  stderr: ++ 18:24:17: is_function:41: command -v apache_start
>   >  stderr: + 18:24:17: is_function:41: test zapache_start = zapache_start
>   >  stderr: + 18:24:17: ocf_rarun:139: simple_actions
>   >  stderr: + 18:24:17: simple_actions:60: case $__OCF_ACTION in
>   >  stderr: + 18:24:17: ocf_rarun:140: check_required_params
>   >  stderr: + 18:24:17: check_required_params:74: local v
>   >  stderr: + 18:24:17: ocf_rarun:141: run_function apache_getconfig
>   >  stderr: + 18:24:17: run_function:44: is_function apache_getconfig
>   >  stderr: ++ 18:24:17: is_function:41: command -v apache_getconfig
>   >  stderr: + 18:24:17: is_function:41: test zapache_getconfig =
> zapache_getconfig
>   >  stderr: + 18:24:17: run_function:44: apache_getconfig
>   >  stderr: + 18:24:17: apache_getconfig:628: HTTPD=
>   >  stderr: + 18:24:17: apache_getconfig:629: PORT=
>   >  stderr: + 18:24:17: apache_getconfig:630:
> STATUSURL=http://127.0.0.1/server-status
>   >  stderr: + 18:24:17: apache_getconfig:631:
> CONFIGFILE=/etc/httpd/conf/httpd.conf
>   >  stderr: + 18:24:17: apache_getconfig:632: OPTIONS=
>   >  stderr: + 18:24:17: apache_getconfig:633: CLIENT=
>   >  stderr: + 18:24:17: apache_getconfig:634: TESTREGEX='</ *html *>'
>   >  stderr: + 18:24:17: apache_getconfig:635: TESTURL=
>   >  stderr: + 18:24:17: apache_getconfig:636: TESTREGEX10=
>   >  stderr: + 18:24:17: apache_getconfig:637: TESTCONFFILE=
>   >  stderr: + 18:24:17: apache_getconfig:638: TESTNAME=
>   >  stderr: + 18:24:17: apache_getconfig:639: : /etc/apache2/envvars
>   >  stderr: + 18:24:17: apache_getconfig:640: source_envfiles
> /etc/apache2/envvars
>   >  stderr: + 18:24:17: source_envfiles:16: for f in '"$@"'
>   >  stderr: + 18:24:17: source_envfiles:17: '[' -f /etc/apache2/envvars
> -a -r /etc/apache2/envvars ']'
>   >  stderr: + 18:24:17: apache_getconfig:642: '[' X = X -o '!' -f '' -o
> '!' -x '' ']'
>   >  stderr: + 18:24:17: apache_getconfig:643: find_httpd_prog
>   >  stderr: + 18:24:17: find_httpd_prog:603: case $0 in
>   >  stderr: + 18:24:17: find_httpd_prog:608: HTTPD=
>   >  stderr: + 18:24:17: find_httpd_prog:609: for h in '$HTTPDLIST'
>   >  stderr: + 18:24:17: find_httpd_prog:611: '[' -f /sbin/httpd2 -a -x
> /sbin/httpd2 ']'
>   >  stderr: + 18:24:17: find_httpd_prog:609: for h in '$HTTPDLIST'
>   >  stderr: + 18:24:17: find_httpd_prog:611: '[' -f /usr/sbin/httpd2 -a
> -x /usr/sbin/httpd2 ']'
>   >  stderr: + 18:24:17: find_httpd_prog:609: for h in '$HTTPDLIST'
>   >  stderr: + 18:24:17: find_httpd_prog:611: '[' -f /usr/sbin/apache2
> -a -x /usr/sbin/apache2 ']'
>   >  stderr: + 18:24:17: find_httpd_prog:609: for h in '$HTTPDLIST'
>   >  stderr: + 18:24:17: find_httpd_prog:611: '[' -f /sbin/httpd -a -x
> /sbin/httpd ']'
>   >  stderr: + 18:24:17: find_httpd_prog:612: HTTPD=/sbin/httpd
>   >  stderr: + 18:24:17: find_httpd_prog:613: break
>   >  stderr: + 18:24:17: find_httpd_prog:618: '[' X '!=' X -a
> X/sbin/httpd '!=' X ']'
>   >  stderr: ++ 18:24:17: find_httpd_prog:621: detect_default_config
>   >  stderr: ++ 18:24:17: detect_default_config:387: '[' -f
> /etc/apache2/httpd.conf ']'
>   >  stderr: ++ 18:24:17: detect_default_config:390: echo
> /etc/httpd/conf/httpd.conf
>   >  stderr: + 18:24:17: find_httpd_prog:621:
> DefaultConfig=/etc/httpd/conf/httpd.conf
>   >  stderr: + 18:24:17: apache_getconfig:646:
> CONFIGFILE=/etc/httpd/conf/httpd.conf
>   >  stderr: ++ 18:24:17: apache_getconfig:647: basename /sbin/httpd
>   >  stderr: + 18:24:17: apache_getconfig:647: httpd_basename=httpd
>   >  stderr: + 18:24:17: apache_getconfig:648: case $httpd_basename in
>   >  stderr: + 18:24:17: apache_getconfig:651: GetParams
> /etc/httpd/conf/httpd.conf
>   >  stderr: + 18:24:17: GetParams:134:
> ConfigFile=/etc/httpd/conf/httpd.conf
>   >  stderr: + 18:24:17: GetParams:135: '[' '!' -f
> /etc/httpd/conf/httpd.conf ']'
>   >  stderr: + 18:24:17: GetParams:138: get_apache_params
> /etc/httpd/conf/httpd.conf ServerRoot PidFile Port Listen
>   >  stderr: + 18:24:17: get_apache_params:79:
> configfile=/etc/httpd/conf/httpd.conf
>   >  stderr: + 18:24:17: get_apache_params:80: shift 1
>   >  stderr: ++ 18:24:17: get_apache_params:81: echo ServerRoot PidFile
> Port Listen
>   >  stderr: ++ 18:24:17: get_apache_params:81: sed 's/ /,/g'
>   >  stderr: + 18:24:17: get_apache_params:81:
> vars=ServerRoot,PidFile,Port,Listen
>   >  stderr: ++ 18:24:17: get_apache_params:99: apachecat
> /etc/httpd/conf/httpd.conf
>   >  stderr: ++ 18:24:17: get_apache_params:99: awk -v
> vars=ServerRoot,PidFile,Port,Listen '
>   >  stderr:     BEGIN{
>   >  stderr:         split(vars,v,",");
>   >  stderr:         for( i in v )
>   >  stderr:             vl[i]=tolower(v[i]);
>   >  stderr:     }
>   >  stderr:     {
>   >  stderr:         for( i in v )
>   >  stderr:             if( tolower($1)==vl[i] ) {
>   >  stderr:             print v[i]"="$2
>   >  stderr:             delete vl[i]
>   >  stderr:             break
>   >  stderr:         }
>   >  stderr:     }
>   >  stderr:     '
>   >  stderr: ++ 18:24:17: apachecat:71: sed
> 's/#.*//;s/[[:blank:]]*$//;s/^[[:blank:]]*//'
>   >  stderr: ++ 18:24:17: apachecat:70: awk '
>   >  stderr:     function procline() {
>   >  stderr:         split($0,a);
>   >  stderr:         if( a[1]~/^[Ii]nclude$/ ) {
>   >  stderr:             procinclude(a[2]);
>   >  stderr:         } else {
>   >  stderr:             if( a[1]=="ServerRoot" ) {
>   >  stderr:                 rootdir=a[2];
>   >  stderr:                 gsub("\"","",rootdir);
>   >  stderr:             }
>   >  stderr:             print;
>   >  stderr:         }
>   >  stderr:     }
>   >  stderr:     function printfile(infile, a) {
>   >  stderr:         while( (getline<infile) > 0 ) {
>   >  stderr:             procline();
>   >  stderr:         }
>   >  stderr:         close(infile);
>   >  stderr:     }
>   >  stderr:     function allfiles(dir, cmd,f) {
>   >  stderr:         cmd="find -L "dir" -type f";
>   >  stderr:         while( ( cmd | getline f ) > 0 ) {
>   >  stderr:             printfile(f);
>   >  stderr:         }
>   >  stderr:         close(cmd);
>   >  stderr:     }
>   >  stderr:     function listfiles(pattern, cmd,f) {
>   >  stderr:         cmd="ls "pattern" 2>/dev/null";
>   >  stderr:         while( ( cmd | getline f ) > 0 ) {
>   >  stderr:             printfile(f);
>   >  stderr:         }
>   >  stderr:         close(cmd);
>   >  stderr:     }
>   >  stderr:     function procinclude(spec) {
>   >  stderr:         if( rootdir!="" && spec!~/^\// ) {
>   >  stderr:             spec=rootdir"/"spec;
>   >  stderr:         }
>   >  stderr:         if( isdir(spec) ) {
>   >  stderr:             allfiles(spec); # read all files in a directory
> (and subdirs)
>   >  stderr:         } else {
>   >  stderr:             listfiles(spec); # there could be jokers
>   >  stderr:         }
>   >  stderr:     }
>   >  stderr:     function isdir(s) {
>   >  stderr:         return !system("test -d \""s"\"");
>   >  stderr:     }
>   >  stderr:     { procline(); }
>   >  stderr:     ' /etc/httpd/conf/httpd.conf
>   >  stderr: ++ 18:24:17: apachecat:72: grep -v '^$'
>   >  stderr: + 18:24:17: get_apache_params:98: eval
> 'ServerRoot="/etc/httpd"' Listen=80
>   >  stderr: ++ 18:24:17: get_apache_params:98: ServerRoot=/etc/httpd
>   >  stderr: ++ 18:24:17: get_apache_params:98: Listen=80
>   >  stderr: + 18:24:17: GetParams:139: case $PidFile in
>   >  stderr: + 18:24:17: GetParams:145: PidFile=/var/run//httpd.pid
>   >  stderr: + 18:24:17: GetParams:149: PIDFILE_DIRECTIVE=true
>   >  stderr: + 18:24:17: GetParams:153: for p in '"$PORT"' '"$Port"' 80
>   >  stderr: + 18:24:17: GetParams:154: CheckPort ''
>   >  stderr: + 18:24:17: CheckPort:116: ocf_is_decimal ''
>   >  stderr: + 18:24:17: ocf_is_decimal:92: case "$1" in
>   >  stderr: + 18:24:17: ocf_is_decimal:94: false
>   >  stderr: + 18:24:17: GetParams:153: for p in '"$PORT"' '"$Port"' 80
>   >  stderr: + 18:24:17: GetParams:154: CheckPort ''
>   >  stderr: + 18:24:17: CheckPort:116: ocf_is_decimal ''
>   >  stderr: + 18:24:17: ocf_is_decimal:92: case "$1" in
>   >  stderr: + 18:24:17: ocf_is_decimal:94: false
>   >  stderr: + 18:24:17: GetParams:153: for p in '"$PORT"' '"$Port"' 80
>   >  stderr: + 18:24:17: GetParams:154: CheckPort 80
>   >  stderr: + 18:24:17: CheckPort:116: ocf_is_decimal 80
>   >  stderr: + 18:24:17: ocf_is_decimal:92: case "$1" in
>   >  stderr: + 18:24:17: ocf_is_decimal:96: true
>   >  stderr: + 18:24:17: CheckPort:116: '[' 80 -gt 0 ']'
>   >  stderr: + 18:24:17: GetParams:155: PORT=80
>   >  stderr: + 18:24:17: GetParams:156: break
>   >  stderr: + 18:24:17: GetParams:160: echo 80
>   >  stderr: + 18:24:17: GetParams:160: grep :
>   >  stderr: + 18:24:17: GetParams:161: Listen=localhost:80
>   >  stderr: + 18:24:17: GetParams:183: '['
> Xhttp://127.0.0.1/server-status = X ']'
>   >  stderr: + 18:24:17: GetParams:189: test /var/run//httpd.pid
>   >  stderr: + 18:24:17: GetParams:192: return 0
>   >  stderr: + 18:24:17: ocf_rarun:142: validate_env
>   >  stderr: + 18:24:17: validate_env:122: check_required_binaries
>   >  stderr: + 18:24:17: check_required_binaries:113: local v
>   >  stderr: + 18:24:17: validate_env:123: is_function apache_validate_all
>   >  stderr: ++ 18:24:17: is_function:41: command -v apache_validate_all
>   >  stderr: + 18:24:17: is_function:41: test zapache_validate_all =
> zapache_validate_all
>   >  stderr: + 18:24:17: validate_env:125: local rc
>   >  stderr: + 18:24:17: validate_env:126: LSB_STATUS_STOPPED=3
>   >  stderr: + 18:24:17: validate_env:127: apache_validate_all
>   >  stderr: + 18:24:17: apache_validate_all:573: CheckPort 80
>   >  stderr: + 18:24:17: CheckPort:116: ocf_is_decimal 80
>   >  stderr: + 18:24:17: ocf_is_decimal:92: case "$1" in
>   >  stderr: + 18:24:17: ocf_is_decimal:96: true
>   >  stderr: + 18:24:17: CheckPort:116: '[' 80 -gt 0 ']'
>   >  stderr: + 18:24:17: apache_validate_all:575: : OK
>   >  stderr: + 18:24:17: apache_validate_all:581: case $STATUSURL in
>   >  stderr: + 18:24:17: apache_validate_all:588: '[' '!' -x /sbin/httpd
> ']'
>   >  stderr: + 18:24:17: apache_validate_all:593: '[' '!' -f
> /etc/httpd/conf/httpd.conf ']'
>   >  stderr: + 18:24:17: apache_validate_all:599: return 0
>   >  stderr: + 18:24:17: validate_env:128: rc=0
>   >  stderr: + 18:24:17: validate_env:129: '[' 0 -ne 0 ']'
>   >  stderr: + 18:24:17: ocf_rarun:143: ocf_is_probe
>   >  stderr: + 18:24:17: ocf_is_probe:438: '[' start = monitor -a 0 = 0 ']'
>   >  stderr: + 18:24:17: ocf_rarun:144: shift 1
>   >  stderr: + 18:24:17: ocf_rarun:145: apache_start
>   >  stderr: + 18:24:17: apache_start:164: silent_status
>   >  stderr: + 18:24:17: silent_status:125: '[' -f /var/run//httpd.pid ']'
>   >  stderr: + 18:24:17: silent_status:128: : No pid file
>   >  stderr: + 18:24:17: silent_status:129: false
>   >  stderr: + 18:24:17: apache_start:170: validate_default_config
>   >  stderr: + 18:24:17: validate_default_config:135: '[' -e
> /etc/SuSE-release ']'
>   >  stderr: + 18:24:17: validate_default_config:138: return 0
>   >  stderr: + 18:24:17: apache_start:172: '[' -d /var/run/apache2 ']'
>   >  stderr: + 18:24:17: apache_start:174: '[' -z true ']'
>   >  stderr: + 18:24:17: apache_start:177: ocf_run /sbin/httpd -DSTATUS
> -f /etc/httpd/conf/httpd.conf -c 'PidFile /var/run//httpd.pid'
>   >  stderr: + 18:24:17: ocf_run:361: local rc
>   >  stderr: + 18:24:17: ocf_run:362: local output
>   >  stderr: + 18:24:17: ocf_run:363: local verbose=1
>   >  stderr: + 18:24:17: ocf_run:364: local loglevel=err
>   >  stderr: + 18:24:17: ocf_run:365: local var
>   >  stderr: + 18:24:17: ocf_run:367: for var in 1 2
>   >  stderr: + 18:24:17: ocf_run:369: case "$1" in
>   >  stderr: + 18:24:17: ocf_run:367: for var in 1 2
>   >  stderr: + 18:24:17: ocf_run:369: case "$1" in
>   >  stderr: ++ 18:24:17: ocf_run:381: /sbin/httpd -DSTATUS -f
> /etc/httpd/conf/httpd.conf -c 'PidFile /var/run//httpd.pid'
>   >  stderr: + 18:24:17: ocf_run:381: output=
>   >  stderr: + 18:24:17: ocf_run:382: rc=0
>   >  stderr: ++ 18:24:17: ocf_run:383: echo
>   >  stderr: + 18:24:17: ocf_run:383: output=
>   >  stderr: + 18:24:17: ocf_run:384: '[' 0 -eq 0 ']'
>   >  stderr: + 18:24:17: ocf_run:385: '[' 1 -a '!' -z '' ']'
>   >  stderr: + 18:24:17: ocf_run:388: return 0
>   >  stderr: + 18:24:17: apache_start:180: tries=0
>   >  stderr: + 18:24:17: apache_start:181: :
>   >  stderr: + 18:24:17: apache_start:183: apache_monitor
>   >  stderr: + 18:24:17: apache_monitor:367: silent_status
>   >  stderr: + 18:24:17: silent_status:125: '[' -f /var/run//httpd.pid ']'
>   >  stderr: ++ 18:24:17: silent_status:126: cat /var/run//httpd.pid
>   >  stderr: + 18:24:17: silent_status:126: ProcessRunning 16404
>   >  stderr: + 18:24:17: ProcessRunning:114: ApachePID=16404
>   >  stderr: + 18:24:17: ProcessRunning:116: '[' -d /proc -a -d /proc/1 ']'
>   >  stderr: + 18:24:17: ProcessRunning:117: '[' -d /proc/16404 ']'
>   >  stderr: + 18:24:17: apache_monitor:368: '[' 0 -ne 0 ']'
>   >  stderr: ++ 18:24:17: apache_monitor:373: findhttpclient
>   >  stderr: ++ 18:24:17: findhttpclient:66: '[' x '!=' x ']'
>   >  stderr: ++ 18:24:17: findhttpclient:68: which wget
>   >  stderr: ++ 18:24:17: findhttpclient:69: echo wget
>   >  stderr: + 18:24:17: apache_monitor:373: ourhttpclient=wget
>   >  stderr: + 18:24:17: apache_monitor:374: '[' -z wget ']'
>   >  stderr: + 18:24:17: apache_monitor:379: case `ocf_check_level 10` in
>   >  stderr: ++ 18:24:17: apache_monitor:379: ocf_check_level 10
>   >  stderr: ++ 18:24:17: ocf_check_level:643: local lvl prev
>   >  stderr: ++ 18:24:17: ocf_check_level:644: lvl=0
>   >  stderr: ++ 18:24:17: ocf_check_level:645: prev=0
>   >  stderr: ++ 18:24:17: ocf_check_level:646: ocf_is_decimal 0
>   >  stderr: ++ 18:24:17: ocf_is_decimal:92: case "$1" in
>   >  stderr: ++ 18:24:17: ocf_is_decimal:96: true
>   >  stderr: ++ 18:24:17: ocf_check_level:648: for lvl in '"$@"'
>   >  stderr: ++ 18:24:17: ocf_check_level:649: '[' 10 -eq 0 ']'
>   >  stderr: ++ 18:24:17: ocf_check_level:651: '[' 10 -gt 0 ']'
>   >  stderr: ++ 18:24:17: ocf_check_level:652: lvl=0
>   >  stderr: ++ 18:24:17: ocf_check_level:653: break
>   >  stderr: ++ 18:24:17: ocf_check_level:658: echo 0
>   >  stderr: + 18:24:17: apache_monitor:380: apache_monitor_basic
>   >  stderr: + 18:24:17: apache_monitor_basic:351: wget_func
> http://127.0.0.1/server-status
>   >  stderr: + 18:24:17: apache_monitor_basic:351: grep -Ei '</ *html *>'
>   >  stderr: + 18:24:17: wget_func:49: auth=
>   >  stderr: + 18:24:17: wget_func:50: cl_opts='-O- -q -L --no-proxy
> --bind-address=127.0.0.1 '
>   >  stderr: + 18:24:17: wget_func:51: '[' x '!=' x ']'
>   >  stderr: + 18:24:17: wget_func:53: wget -O- -q -L --no-proxy
> --bind-address=127.0.0.1 http://127.0.0.1/server-status
>   >  stderr: + 18:24:17: apache_monitor_basic:353: return 0
>   >  stderr: + 18:24:17: apache_start:184: ec=0
>   >  stderr: + 18:24:17: apache_start:185: '[' 0 -eq 7 ']'
>   >  stderr: + 18:24:17: apache_start:191: break
>   >  stderr: + 18:24:17: apache_start:195: '[' 0 -ne 0 ']'
>   >  stderr: + 18:24:17: apache_start:198: return 0
> =====
>
> Only log entry is on an-c03n01 (where I ran the command):
>
> =====
> Jan  1 18:24:17 an-c03n01 crm_resource[16170]: notice: crm_log_args:
> Invoked: /usr/sbin/crm_resource -r WebSite --force-start -V
> =====
>
> When I delete the constraint that was added when the node went into
> standby, WebSite starts.
>
> =====
> [root at an-c03n01 ~]# pcs config show | grep uname -B 1
>        Rule: score=-INFINITY role=Master
> (id:drbd-fence-by-handler-r0-rule-WebDataClone)
>          Expression: #uname ne an-c03n02.alteeve.ca
> (id:drbd-fence-by-handler-r0-expr-WebDataClone)
> [root at an-c03n01 ~]# pcs constraint delete
> drbd-fence-by-handler-r0-rule-WebDataClone
> =====
>
> ==[an-c03n01 logs]===
> Jan  1 18:26:44 an-c03n01 cibadmin[17037]: notice: crm_log_args:
> Invoked: /usr/sbin/cibadmin -c -R --xml-text <constraints>
>    <rsc_location id="cli-prefer-WebDataClone"
> node="an-c03n01.alteeve.ca" role="Started" rsc="WebDataClone"
> score="INFINITY"/>
>    <rsc_location id="cli-prefer-ClusterIP" node="an-c03n01.alteeve.ca"
> role="Started" rsc="ClusterIP" score="INFINITY"/>
>    <rsc_colocation id="colocation-WebFS-WebDataClone-INFINITY"
> rsc="WebFS" score="INFINITY" with-rsc="WebDataClone"
> with-rsc-role="Master"/>
>    <rsc_order first="WebDataClone" first-ac
> Jan  1 18:26:44 an-c03n01 cib[1762]: notice: cib:diff: Diff: --- 0.192.9
> Jan  1 18:26:44 an-c03n01 cib[1762]: notice: cib:diff: Diff: +++ 0.193.1
> 695289e37618e5e9a184d2bc8c90d421
> Jan  1 18:26:44 an-c03n01 cib[1762]: notice: cib:diff: -- <rsc_location
> rsc="WebDataClone" id="drbd-fence-by-handler-r0-WebDataClone">
> Jan  1 18:26:44 an-c03n01 cib[1762]: notice: cib:diff: --         <rule
> role="Master" score="-INFINITY"
> id="drbd-fence-by-handler-r0-rule-WebDataClone">
> Jan  1 18:26:44 an-c03n01 cib[1762]: notice: cib:diff: -- <expression
> attribute="#uname" operation="ne" value="an-c03n02.alteeve.ca"
> id="drbd-fence-by-handler-r0-expr-WebDataClone"/>
> Jan  1 18:26:44 an-c03n01 cib[1762]: notice: cib:diff: --         </rule>
> Jan  1 18:26:44 an-c03n01 cib[1762]: notice: cib:diff: -- </rsc_location>
> Jan  1 18:26:44 an-c03n01 cib[1762]: notice: cib:diff: ++ <cib
> admin_epoch="0" cib-last-written="Wed Jan  1 18:26:44 2014"
> crm_feature_set="3.0.7" epoch="193" have-quorum="1" num_updates="1"
> update-client="cibadmin" update-origin="an-c03n01.alteeve.ca"
> validate-with="pacemaker-1.2" dc-uuid="1"/>
> Jan  1 18:26:44 an-c03n01 crmd[1767]: notice: do_state_transition: State
> transition S_IDLE -> S_POLICY_ENGINE [ input=I_PE_CALC
> cause=C_FSA_INTERNAL origin=abort_transition_graph ]
> Jan  1 18:26:44 an-c03n01 stonith-ng[1763]: notice: unpack_config: On
> loss of CCM Quorum: Ignore
> Jan  1 18:26:44 an-c03n01 crmd[1767]: notice: do_state_transition: State
> transition S_ELECTION -> S_INTEGRATION [ input=I_ELECTION_DC
> cause=C_FSA_INTERNAL origin=do_election_check ]
> Jan  1 18:26:44 an-c03n01 stonith-ng[1763]: notice:
> stonith_device_register: Added 'fence_n01_virsh' to the device list (2
> active devices)
> Jan  1 18:26:44 an-c03n01 stonith-ng[1763]: notice:
> stonith_device_register: Added 'fence_n02_virsh' to the device list (2
> active devices)
> Jan  1 18:26:45 an-c03n01 attrd[1765]: notice: attrd_local_callback:
> Sending full refresh (origin=crmd)
> Jan  1 18:26:45 an-c03n01 attrd[1765]: notice: attrd_trigger_update:
> Sending flush op to all hosts for: master-WebData (10000)
> Jan  1 18:26:45 an-c03n01 attrd[1765]: notice: attrd_trigger_update:
> Sending flush op to all hosts for: probe_complete (true)
> Jan  1 18:26:46 an-c03n01 pengine[1766]: notice: unpack_config: On loss
> of CCM Quorum: Ignore
> Jan  1 18:26:46 an-c03n01 pengine[1766]: notice: LogActions: Promote
> WebData:0    (Slave -> Master an-c03n01.alteeve.ca)
> Jan  1 18:26:46 an-c03n01 pengine[1766]: notice: LogActions: Demote
> WebData:1    (Master -> Slave an-c03n02.alteeve.ca)
> Jan  1 18:26:46 an-c03n01 pengine[1766]: notice: LogActions: Move
> WebFS    (Started an-c03n02.alteeve.ca -> an-c03n01.alteeve.ca)
> Jan  1 18:26:46 an-c03n01 pengine[1766]: notice: LogActions: Start
> WebSite    (an-c03n01.alteeve.ca)
> Jan  1 18:26:46 an-c03n01 pengine[1766]: notice: process_pe_message:
> Calculated Transition 45: /var/lib/pacemaker/pengine/pe-input-183.bz2
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 2: cancel WebData_cancel_60000 on an-c03n01.alteeve.ca (local)
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 47: stop WebFS_stop_0 on an-c03n02.alteeve.ca
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 67: notify WebData_pre_notify_demote_0 on an-c03n01.alteeve.ca
> (local)
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 69: notify WebData_pre_notify_demote_0 on an-c03n02.alteeve.ca
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=111, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: run_graph: Transition 45
> (Complete=5, Pending=0, Fired=0, Skipped=15, Incomplete=10,
> Source=/var/lib/pacemaker/pengine/pe-input-183.bz2): Stopped
> Jan  1 18:26:46 an-c03n01 pengine[1766]: notice: unpack_config: On loss
> of CCM Quorum: Ignore
> Jan  1 18:26:46 an-c03n01 pengine[1766]: notice: LogActions: Promote
> WebData:0    (Slave -> Master an-c03n01.alteeve.ca)
> Jan  1 18:26:46 an-c03n01 pengine[1766]: notice: LogActions: Demote
> WebData:1    (Master -> Slave an-c03n02.alteeve.ca)
> Jan  1 18:26:46 an-c03n01 pengine[1766]: notice: LogActions: Start
> WebFS    (an-c03n01.alteeve.ca)
> Jan  1 18:26:46 an-c03n01 pengine[1766]: notice: LogActions: Start
> WebSite    (an-c03n01.alteeve.ca)
> Jan  1 18:26:46 an-c03n01 pengine[1766]: notice: process_pe_message:
> Calculated Transition 46: /var/lib/pacemaker/pengine/pe-input-184.bz2
> Jan  1 18:26:46 an-c03n01 crmd[1767]: warning: destroy_action:
> Cancelling timer for action 2 (src=476)
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 64: notify WebData_pre_notify_demote_0 on an-c03n01.alteeve.ca
> (local)
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 66: notify WebData_pre_notify_demote_0 on an-c03n02.alteeve.ca
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=112, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 18: demote WebData_demote_0 on an-c03n02.alteeve.ca
> Jan  1 18:26:46 an-c03n01 kernel: [ 3524.517000] block drbd0: peer(
> Primary -> Secondary )
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 65: notify WebData_post_notify_demote_0 on an-c03n01.alteeve.ca
> (local)
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 67: notify WebData_post_notify_demote_0 on an-c03n02.alteeve.ca
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=113, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 60: notify WebData_pre_notify_promote_0 on an-c03n01.alteeve.ca
> (local)
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 62: notify WebData_pre_notify_promote_0 on an-c03n02.alteeve.ca
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=114, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 16: promote WebData_promote_0 on an-c03n01.alteeve.ca (local)
> Jan  1 18:26:46 an-c03n01 kernel: [ 3524.605607] block drbd0: role(
> Secondary -> Primary )
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_promote_0 (call=115, rc=0, cib-update=324,
> confirmed=true) ok
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 61: notify WebData_post_notify_promote_0 on an-c03n01.alteeve.ca
> (local)
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 63: notify WebData_post_notify_promote_0 on an-c03n02.alteeve.ca
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=116, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 45: start WebFS_start_0 on an-c03n01.alteeve.ca (local)
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 20: monitor WebData_monitor_60000 on an-c03n02.alteeve.ca
> Jan  1 18:26:46 an-c03n01 Filesystem(WebFS)[17211]: INFO: Running start
> for /dev/drbd0 on /var/www/html
> Jan  1 18:26:46 an-c03n01 kernel: [ 3524.688633] EXT4-fs (drbd0):
> mounted filesystem with ordered data mode. Opts: (null)
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebFS_start_0 (call=117, rc=0, cib-update=325, confirmed=true) ok
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 46: monitor WebFS_monitor_60000 on an-c03n01.alteeve.ca (local)
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 47: start WebSite_start_0 on an-c03n01.alteeve.ca (local)
> Jan  1 18:26:46 an-c03n01 apache(WebSite)[17276]: INFO: apache already
> running (pid 16404)
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebSite_start_0 (call=119, rc=0, cib-update=326,
> confirmed=true) ok
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebFS_monitor_60000 (call=118, rc=0, cib-update=327,
> confirmed=false) ok
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: te_rsc_command: Initiating
> action 48: monitor WebSite_monitor_60000 on an-c03n01.alteeve.ca (local)
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: process_lrm_event: LRM
> operation WebSite_monitor_60000 (call=120, rc=0, cib-update=328,
> confirmed=false) ok
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: run_graph: Transition 46
> (Complete=27, Pending=0, Fired=0, Skipped=0, Incomplete=0,
> Source=/var/lib/pacemaker/pengine/pe-input-184.bz2): Complete
> Jan  1 18:26:46 an-c03n01 crmd[1767]: notice: do_state_transition: State
> transition S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS
> cause=C_FSA_INTERNAL origin=notify_crmd ]
>
> ==[an-c03n02 logs]===
> Jan  1 18:26:44 an-c03n02 crmd[16388]: notice: do_state_transition:
> State transition S_NOT_DC -> S_PENDING [ input=I_PENDING
> cause=C_FSA_INTERNAL origin=do_election_count_vote ]
> Jan  1 18:26:44 an-c03n02 stonith-ng[16384]: notice: unpack_config: On
> loss of CCM Quorum: Ignore
> Jan  1 18:26:44 an-c03n02 stonith-ng[16384]: notice:
> stonith_device_register: Device 'fence_n01_virsh' already existed in
> device list (2 active devices)
> Jan  1 18:26:44 an-c03n02 stonith-ng[16384]: notice:
> stonith_device_register: Device 'fence_n02_virsh' already existed in
> device list (2 active devices)
> Jan  1 18:26:45 an-c03n02 crmd[16388]: notice: do_state_transition:
> State transition S_PENDING -> S_NOT_DC [ input=I_NOT_DC
> cause=C_HA_MESSAGE origin=do_cl_join_finalize_respond ]
> Jan  1 18:26:45 an-c03n02 attrd[16386]: notice: attrd_local_callback:
> Sending full refresh (origin=crmd)
> Jan  1 18:26:45 an-c03n02 attrd[16386]: notice: attrd_trigger_update:
> Sending flush op to all hosts for: master-WebData (10000)
> Jan  1 18:26:45 an-c03n02 attrd[16386]: notice: attrd_trigger_update:
> Sending flush op to all hosts for: probe_complete (true)
> Jan  1 18:26:46 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=112, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:26:46 an-c03n02 Filesystem(WebFS)[4152]: INFO: Running stop
> for /dev/drbd0 on /var/www/html
> Jan  1 18:26:46 an-c03n02 Filesystem(WebFS)[4152]: INFO: Trying to
> unmount /var/www/html
> Jan  1 18:26:46 an-c03n02 Filesystem(WebFS)[4152]: INFO: unmounted
> /var/www/html successfully
> Jan  1 18:26:46 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebFS_stop_0 (call=111, rc=0, cib-update=73, confirmed=true) ok
> Jan  1 18:26:46 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=113, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:26:46 an-c03n02 kernel: [20086.520840] block drbd0: role(
> Primary -> Secondary )
> Jan  1 18:26:46 an-c03n02 kernel: [20086.520911] block drbd0: bitmap
> WRITE of 0 pages took 0 jiffies
> Jan  1 18:26:46 an-c03n02 kernel: [20086.520914] block drbd0: 0 KB (0
> bits) marked out-of-sync by on disk bit-map.
> Jan  1 18:26:46 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_demote_0 (call=114, rc=0, cib-update=74,
> confirmed=true) ok
> Jan  1 18:26:46 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=115, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:26:46 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=116, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:26:46 an-c03n02 kernel: [20086.610152] block drbd0: peer(
> Secondary -> Primary )
> Jan  1 18:26:46 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_notify_0 (call=117, rc=0, cib-update=0,
> confirmed=true) ok
> Jan  1 18:26:46 an-c03n02 crmd[16388]: notice: process_lrm_event: LRM
> operation WebData_monitor_60000 (call=118, rc=0, cib-update=75,
> confirmed=false) ok
> =====
>
> Now WebSite is working:
> =====
> [root at an-c03n01 ~]# pcs status
> Cluster name: an-cluster-03
> Last updated: Wed Jan  1 18:28:04 2014
> Last change: Wed Jan  1 18:26:44 2014 via cibadmin on an-c03n01.alteeve.ca
> Stack: corosync
> Current DC: an-c03n01.alteeve.ca (1) - partition with quorum
> Version: 1.1.10-19.el7-368c726
> 2 Nodes configured
> 7 Resources configured
>
>
> Online: [ an-c03n01.alteeve.ca an-c03n02.alteeve.ca ]
>
> Full list of resources:
>
>   fence_n01_virsh    (stonith:fence_virsh):    Started an-c03n02.alteeve.ca
>   fence_n02_virsh    (stonith:fence_virsh):    Started an-c03n02.alteeve.ca
>   ClusterIP    (ocf::heartbeat:IPaddr2):    Started an-c03n01.alteeve.ca
>   Master/Slave Set: WebDataClone [WebData]
>       Masters: [ an-c03n01.alteeve.ca ]
>       Slaves: [ an-c03n02.alteeve.ca ]
>   WebFS    (ocf::heartbeat:Filesystem):    Started an-c03n01.alteeve.ca
>   WebSite    (ocf::heartbeat:apache):    Started an-c03n01.alteeve.ca
>
> PCSD Status:
> an-c03n01.alteeve.ca:
>    an-c03n01.alteeve.ca: Online
> an-c03n02.alteeve.ca:
>    an-c03n02.alteeve.ca: Online
>
> Daemon Status:
>    corosync: active/disabled
>    pacemaker: active/disabled
>    pcsd: active/enabled
> =====
>
> Is this a bug?
>


-- 
Digimer
Papers and Projects: https://alteeve.ca/w/
What if the cure for cancer is trapped in the mind of a person without 
access to education?




More information about the Pacemaker mailing list