[Pacemaker] Resource monitor attempting to run on 'other' node.

Andrew Beekhof andrew at beekhof.net
Fri Apr 1 08:12:29 UTC 2011


On Tue, Mar 29, 2011 at 6:43 PM, David Coulson <david at davidcoulson.net> wrote:
> Pretty simple configuration - Two nodes running cman backed pacemaker. I
> have three resources which are group together to support an application.
> Filesystem, IP, and the app itself. My app is currently misconfiguration, so
> I expect it to blow up when it tries to start.
>
> In crm_mon, I have a consistent monitor failure for the failed service on
> the other node. If it was trying to move the group over to the other node, I
> would expect it to switch the filesystem mount over to the other host before
> trying to monitor, but it doesn't appear to (otherwise it would not give me
> the 'not installed' error). Is this the expected behavior?

Yes.  We're not trying to move it, we're trying to verify that its not
_already_ running there before we start it somewhere.

>
> Configuration and crm_mon output is below.
>
> primitive ip_openfire ocf:heartbeat:IPaddr \
>        params ip="10.250.53.80" cidr_netmask="23" \
>        op monitor interval="30s"
> primitive lv_openfire ocf:heartbeat:Filesystem \
>        params fstype="gfs2" device="/dev/vg_gfs00/openfire"
> directory="/opt/openfire" \
>        op monitor interval="10" timeout="5" on-fail="stop"
> primitive svc_openfire lsb:openfire \
>        op monitor interval="30" timeout="5s" \
>        op start interval="30" timeout="30s" \
>        op stop interval="30" timeout="30s"
> group openfire lv_openfire ip_openfire svc_openfire
> order openfire_svc_lv inf: lv_openfire svc_openfire
>
>
> Relevant output from crm_mon -1
>
> # crm_mon -1A
> ============
> Last updated: Tue Mar 29 12:42:43 2011
> Stack: cman
> Current DC: rhesproddns01 - partition with quorum
> Version: 1.1.2-f059ec7ced7a86f18e5490b67ebf4a0b963bccfe
> 2 Nodes configured, 2 expected votes
> 7 Resources configured.
> ============
>
> Online: [ rhesproddns01 rhesproddns02 ]
>
>  Resource Group: openfire
>     lv_openfire        (ocf::heartbeat:Filesystem):    Started rhesproddns02
>     ip_openfire        (ocf::heartbeat:IPaddr):        Started rhesproddns02
>     svc_openfire       (lsb:openfire): Stopped
>
> Failed actions:
>    svc_openfire_monitor_0 (node=rhesproddns01, call=66, rc=5,
> status=complete): not installed
>
>
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs:
> http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
>




More information about the Pacemaker mailing list