[Pacemaker] PSQL 9.3.4 and Pacemaker
Andrew Beekhof
andrew at beekhof.net
Fri Jun 13 01:54:50 CEST 2014
On 12 Jun 2014, at 9:51 pm, Christian Gebler <geblerchristian at googlemail.com> wrote:
> Hi all,
>
> I'm trying to configure my PSQL primitive. I have a 2 node cluster running with pacemaker 1.1.10, corosync 1.3.3 on Ubuntu Server 14.04 LTS. It is an active/passive cluster, so PostgreSQL (and all other resources) just have to run on the left or the right node.
>
> It's strange...PSQL is running fine on node1, but Pacemaker want to migrate it to the second node and I don't no why.
I see no indication that pacemaker wants to move it.
> Everytime I clear the failed Actions:
>
> Failed actions (of cource it is not installed, because of the DRBD....I want to run ALL resources on the same node):
> p_psql-server_start_0 (node=node2, call=265, rc=5, status=complete, last-rc-change=Thu Jun 12 10:55:03 2014, queued=34ms, exec=0ms ): not installed
>
> I get the following log message:
> Jun 12 12:03:50 node1 pgsql(p_psql-server)[6003]: INFO: Don't check /var/lib/pgsql/data during probe
> Jun 12 12:03:50 node1 crmd[16412]: notice: process_lrm_event: LRM operation p_psql-server_monitor_0 (call=443, rc=0, cib-update=295, confirmed=true) ok
> Jun 12 12:03:50 node1 crmd[16412]: warning: status_from_rc: Action 7 (p_psql-server_monitor_0) on node1 failed (target: 7 vs. rc: 0): Error
> Jun 12 12:03:50 node1 crmd[16412]: notice: te_rsc_command: Initiating action 6: probe_complete probe_complete on node1 (local) - no waiting
> Jun 12 12:03:50 node1 crmd[16412]: notice: run_graph: Transition 65 (Complete=3, Pending=0, Fired=0, Skipped=10, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-292.bz2): Stopped
> Jun 12 12:03:50 node1 pengine[16411]: notice: unpack_config: On loss of CCM Quorum: Ignore
> Jun 12 12:03:50 node1 pengine[16411]: notice: unpack_rsc_op: Operation monitor found resource p_drbd_r0:0 active in master mode on node1
> Jun 12 12:03:50 node1 pengine[16411]: notice: unpack_rsc_op: Preventing p_psql-server from re-starting on node2: operation start failed 'not installed' (rc=5)
> Jun 12 12:03:50 node1 pengine[16411]: warning: unpack_rsc_op: Processing failed op start for p_psql-server on node2: not installed (5)
>
> But it's started:
> Node node1 (739670273): online
> fs_postgres (ocf::heartbeat:Filesystem): Started
> p_tomcat-server (lsb:tomcat-server): Started
> p_lvm_r0 (ocf::heartbeat:LVM): Started
> p_drbd_r0 (ocf::linbit:drbd): Master
> p_ip (ocf::heartbeat:IPaddr2): Started
> fs_opt (ocf::heartbeat:Filesystem): Started
> fs_srv (ocf::heartbeat:Filesystem): Started
> p_psql-server (ocf::heartbeat:pgsql): Started
> Node snode2 (739670274): online
> p_drbd_r0 (ocf::linbit:drbd): Started
>
>
> My Config:
>
> node $id="739670273" node1 \
> attributes standby="off"
> node $id="739670274" node2 \
> attributes standby="off"
> primitive fs_opt ocf:heartbeat:Filesystem \
> params device="/dev/mapper/vgdata-opt--lv" directory="/opt" fstype="ext4" \
> meta target-role="Started"
> primitive fs_postgres ocf:heartbeat:Filesystem \
> params device="/dev/mapper/vgdata-postgres--lv" directory="/postgres" fstype="ext4" options="acl,noatime,nodiratime" \
> meta target-role="Started"
> primitive fs_srv ocf:heartbeat:Filesystem \
> params device="/dev/mapper/vgdata-srv--lv" directory="/srv" fstype="ext4" \
> meta target-role="Started"
> primitive p_drbd_r0 ocf:linbit:drbd \
> params drbd_resource="r0-lvm-drbd" \
> op monitor interval="29s" role="Master" \
> op monitor interval="31s" role="Slave"
> primitive p_ip ocf:heartbeat:IPaddr2 \
> params ip="192.168.10.150" cidr_netmask="24" nic="eth0" \
> meta target-role="Started"
> primitive p_lvm_r0 ocf:heartbeat:LVM \
> params volgrpname="vgdata" \
> meta target-role="Started"
> primitive p_psql-server ocf:heartbeat:pgsql \
> op monitor interval="30" timeout="30" \
> meta is-managed="true" target-role="Started"
> primitive p_tomcat-server lsb:tomcat-server \
> op monitor interval="30s" timeout="20s" \
> meta is-managed="true" target-role="Started"
> group g_webserver p_psql-server p_tomcat-server \
> meta is-managed="true" target-role="Started"
> ms ms_drbd_r0 p_drbd_r0 \
> meta master-max="1" master-node-max="1" clone-max="2" clone-node-max="1" notify="true" target-role="Started" is-managed="true"
> colocation c_lvm_on_drbd inf: p_lvm_r0 p_ip ms_drbd_r0:Master
> order o_drbd_before_lvm inf: ms_drbd_r0:promote p_lvm_r0:start fs_srv:start fs_postgres:start fs_opt:start p_ip:start g_webserver
> property $id="cib-bootstrap-options" \
> dc-version="1.1.10-42f2063" \
> cluster-infrastructure="corosync" \
> stonith-enabled="false" \
> stonith-action="poweroff" \
> no-quorum-policy="ignore" \
> default-resource-stickiness="100" \
> last-lrm-refresh="1402567892"
>
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 841 bytes
Desc: Message signed with OpenPGP using GPGMail
URL: <http://oss.clusterlabs.org/pipermail/pacemaker/attachments/20140613/ce4e5e15/attachment.sig>
More information about the Pacemaker
mailing list