[Pacemaker] Pacemaker error trying to add Apache resource

Jake Smith jsmith at argotec.com
Wed Jun 26 12:07:50 EDT 2013




----- Original Message -----
> From: "Colin Blair" <cblair at technicacorp.com>
> To: "The Pacemaker cluster resource manager" <pacemaker at oss.clusterlabs.org>
> Sent: Wednesday, June 26, 2013 10:56:49 AM
> Subject: [Pacemaker] Pacemaker error trying to add Apache resource
> 
> All,
> Couldn't find a solution in the forum. Configuration info:
> 
> Ubuntu 12.04 Server
> Corosync 1.4.2 cman plugin
> Pacemaker 1.1.6
> Apache 2.2.22
> 
> I have an active/passive 2-node cluster running.
> 
> I am receiving the following error when adding the web-server
> resource:
> 
> Failed actions:
> web-server_start_0 (node=funl-pear, call37, rc=1, status=complete):
> unknown error
> web-server_start_0 (node=funl-pear2, call38, rc=1, status=complete):
> unknown error
> 

rc = 1 is not an LSB compliant response to a status check.  I assume you are using the LSB init script for Apache in your cluster?

Test the init script as indicated here:
http://clusterlabs.org/doc/en-US/Pacemaker/1.0/html/Pacemaker_Explained/ap-lsb.html
and here:
http://oss.clusterlabs.org/pipermail/pacemaker/2010-July/007008.html


Could possibly be related to this too but I doubt it:
https://bugs.launchpad.net/ubuntu/+source/apache2/+bug/1018171

> -My configfile=/etc/apache2/apache2.conf
> -My server-status is allowed from all and is tested to work.
> -There are no errors in the apache log.
> 
> Event from the corosync.log:
> 
> Jun 26 10:13:28 funl-pear crmd: [4576]: info: update_dc: Unset DC
> funl-pear2
> Jun 26 10:13:28 funl-pear crmd: [4576]: info: do_state_transition:
> State transition S_NOT_DC -> S_PENDING [ input=I_PENDING
> cause=C_FSA_INTERNAL origin=do_election_count_vote ]
> Jun 26 10:13:28 funl-pear crmd: [4576]: info: update_dc: Set DC to
> funl-pear2 (3.0.5)
> Jun 26 10:13:28 funl-pear crmd: [4576]: info: do_state_transition:
> State transition S_PENDING -> S_NOT_DC [ input=I_NOT_DC
> cause=C_HA_MESSAGE origin=do_cl_join_finalize_respond ]
> Jun 26 10:13:28 funl-pear crmd: [4576]: info: do_lrm_rsc_op:
> Performing key=5:633:7:0209fb9d-82d7-488c-9eec-3a2070d4f4b2
> op=web-server_monitor_0 )
> Jun 26 10:13:28 funl-pear lrmd: [4572]: info: rsc:web-server
> probe[29] (pid 30943)
> Jun 26 10:13:28 funl-pear lrmd: [4572]: info: operation monitor[29]
> on web-server for client 4576: pid 30943 exited with return code 1
> THIS LOOKS FISHY***

Yes it is

> Jun 26 10:13:28 funl-pear crmd: [4576]: info: process_lrm_event: LRM
> operation web-server_monitor_0 (call=29, rc=1, cib-update=113,
> confirmed=true) unknown error
> 
> Jun 26 10:13:28 funl-pear crmd: [4576]: info: do_lrm_rsc_op:
> Performing key=2:634:0:0209fb9d-82d7-488c-9eec-3a2070d4f4b2
> op=web-server_stop_0 )
> Jun 26 10:13:28 funl-pear lrmd: [4572]: info: rsc:web-server stop[30]
> (pid 31010)
> Jun 26 10:13:29 funl-pear lrmd: [4572]: info: RA output:
> (web-server:stop:stderr) /usr/lib/ocf/resource.d//heartbeat/apache:
> 442: kill: No such process

This looks odd too... not only that it's not running but also the double //
But figure out the monitor problem first then re-evaluate

Your cluster config would also help...

HTH

Jake

> Jun 26 10:13:29 funl-pear lrmd: [4572]: info: operation stop[30] on
> web-server for client 4576: pid 31010 exited with return code 0
> Jun 26 10:13:29 funl-pear crmd: [4576]: info: process_lrm_event: LRM
> operation web-server_stop_0 (call=30, rc=0, cib-update=114,
> confirmed=true) ok
> Jun 26 10:13:33 funl-pear crmd: [4576]: info: do_lrm_rsc_op:
> Performing key=9:636:0:0209fb9d-82d7-488c-9eec-3a2070d4f4b2
> op=web-server_start_0 )
> Jun 26 10:13:33 funl-pear lrmd: [4572]: info: rsc:web-server
> start[31] (pid 31110)
> Jun 26 10:13:36 funl-pear lrmd: [4572]: info: RA output:
> (web-server:start:stderr) /usr/lib/ocf/resource.d//heartbeat/apache:
> 442: kill: No such process
> Jun 26 10:13:36 funl-pear lrmd: [4572]: info: operation start[31] on
> web-server for client 4576: pid 31110 exited with return code 1
> Jun 26 10:13:36 funl-pear crmd: [4576]: info: process_lrm_event: LRM
> operation web-server_start_0 (call=31, rc=1, cib-update=115,
> confirmed=true) unknown error
> Jun 26 10:13:36 funl-pear crmd: [4576]: info: do_lrm_rsc_op:
> Performing key=2:638:0:0209fb9d-82d7-488c-9eec-3a2070d4f4b2
> op=web-server_stop_0 )
> Jun 26 10:13:36 funl-pear lrmd: [4572]: info: rsc:web-server stop[32]
> (pid 31272)
> Jun 26 10:13:36 funl-pear lrmd: [4572]: info: operation stop[32] on
> web-server for client 4576: pid 31272 exited with return code 0
> Jun 26 10:13:36 funl-pear crmd: [4576]: info: process_lrm_event: LRM
> operation web-server_stop_0 (call=32, rc=0, cib-update=116,
> confirmed=true) ok
> 
> Any ideas?
> R,
> CB
> 
> The information contained in this transmission may contain privileged
> and confidential information.
> It is intended only for the use of the person(s) named above.
> If you are not the intended recipient, you are hereby notified that
> any review, dissemination, distribution or duplication of this
> communication is strictly prohibited.
> If you are not the intended recipient, please contact the sender by
> reply e-mail and destroy all copies of the original message.
> Technica Corporation does not represent this e-mail to be free from
> any virus, fault or defect and it is therefore the responsibility of
> the recipient to first scan it for viruses, faults and defects.
> To reply to our e-mail administrator directly, please send an e-mail
> to postmaster at technicacorp.com. Thank you.
> 
> 
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started:
> http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
> 
> 




More information about the Pacemaker mailing list