[Pacemaker] Master won't get promoted

Charles Richard chachi.richard at gmail.com
Mon Sep 26 08:28:33 EDT 2011


Hi,

I'm making some headway finally with my pacemaker install but now that
crm_mon doesn't return errors any more and crm_verify is clear, I'm having a
problem where my master won't get promoted.  Not sure what to do with this
one, any suggestions?   Here's the log snippet and config files:

Sep 26 04:06:12 staging1 crmd: [1686]: info: crm_timer_popped: PEngine
Recheck Timer (I_PE_CALC) just popped!
Sep 26 04:06:12 staging1 crmd: [1686]: info: do_state_transition: State
transition S_IDLE -> S_POLICY_ENGINE [ input=I_PE_CALC cause=C_TIMER_POPPED
origin=crm_timer_popped ]
Sep 26 04:06:12 staging1 crmd: [1686]: info: do_state_transition: Progressed
to state S_POLICY_ENGINE after C_TIMER_POPPED
Sep 26 04:06:12 staging1 crmd: [1686]: info: do_state_transition: All 2
cluster nodes are eligible to run resources.
Sep 26 04:06:12 staging1 crmd: [1686]: info: do_pe_invoke: Query 106:
Requesting the current CIB: S_POLICY_ENGINE
Sep 26 04:06:12 staging1 crmd: [1686]: info: do_pe_invoke_callback: Invoking
the PE: query=106, ref=pe_calc-dc-1317020772-95, seq=2564, quorate=1
Sep 26 04:06:12 staging1 pengine: [1685]: info: unpack_config: Startup
probes: enabled
Sep 26 04:06:12 staging1 pengine: [1685]: notice: unpack_config: On loss of
CCM Quorum: Ignore
Sep 26 04:06:12 staging1 pengine: [1685]: info: unpack_config: Node scores:
'red' = -INFINITY, 'yellow' = 0, 'green' = 0
Sep 26 04:06:12 staging1 pengine: [1685]: info: unpack_domains: Unpacking
domains
Sep 26 04:06:12 staging1 pengine: [1685]: info: determine_online_status:
Node staging1.dev.applepeak.com is online
Sep 26 04:06:12 staging1 pengine: [1685]: info: determine_online_status:
Node staging2.dev.applepeak.com is online
Sep 26 04:06:12 staging1 pengine: [1685]: notice: group_print:  Resource
Group: mysql
Sep 26 04:06:12 staging1 pengine: [1685]: notice: native_print:
fs_mysql#011(ocf::heartbeat:Filesystem):#011Stopped
Sep 26 04:06:12 staging1 pengine: [1685]: notice: native_print:
ip_mysql#011(ocf::heartbeat:IPaddr2):#011Stopped
Sep 26 04:06:12 staging1 pengine: [1685]: notice: native_print:
mysqld#011(lsb:mysqld):#011Stopped
Sep 26 04:06:12 staging1 pengine: [1685]: notice: clone_print:  Master/Slave
Set: ms_drbd_mysql
Sep 26 04:06:12 staging1 pengine: [1685]: notice: short_print:      Stopped:
[ drbd_mysql:0 drbd_mysql:1 ]
Sep 26 04:06:12 staging1 pengine: [1685]: info: master_color: ms_drbd_mysql:
Promoted 0 instances of a possible 1 to master
Sep 26 04:06:12 staging1 pengine: [1685]: info: native_merge_weights:
fs_mysql: Rolling back scores from ip_mysql
Sep 26 04:06:12 staging1 pengine: [1685]: info: native_merge_weights:
ip_mysql: Rolling back scores from mysqld
Sep 26 04:06:12 staging1 pengine: [1685]: info: master_color: ms_drbd_mysql:
Promoted 0 instances of a possible 1 to master
Sep 26 04:06:12 staging1 pengine: [1685]: notice: LogActions: Leave resource
fs_mysql#011(Stopped)
Sep 26 04:06:12 staging1 pengine: [1685]: notice: LogActions: Leave resource
ip_mysql#011(Stopped)
Sep 26 04:06:12 staging1 pengine: [1685]: notice: LogActions: Leave resource
mysqld#011(Stopped)
Sep 26 04:06:12 staging1 pengine: [1685]: notice: LogActions: Leave resource
drbd_mysql:0#011(Stopped)
Sep 26 04:06:12 staging1 pengine: [1685]: notice: LogActions: Leave resource
drbd_mysql:1#011(Stopped)
Sep 26 04:06:12 staging1 crmd: [1686]: info: do_state_transition: State
transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS
cause=C_IPC_MESSAGE origin=handle_response ]
Sep 26 04:06:12 staging1 crmd: [1686]: info: unpack_graph: Unpacked
transition 72: 0 actions in 0 synapses
Sep 26 04:06:12 staging1 crmd: [1686]: info: do_te_invoke: Processing graph
72 (ref=pe_calc-dc-1317020772-95) derived from
/var/lib/pengine/pe-input-3802.bz2
Sep 26 04:06:12 staging1 crmd: [1686]: info: run_graph:
====================================================
Sep 26 04:06:12 staging1 crmd: [1686]: notice: run_graph: Transition 72
(Complete=0, Pending=0, Fired=0, Skipped=0, Incomplete=0,
Source=/var/lib/pengine/pe-input-3802.bz2): Complete
Sep 26 04:06:12 staging1 crmd: [1686]: info: te_graph_trigger: Transition 72
is now complete
Sep 26 04:06:12 staging1 crmd: [1686]: info: notify_crmd: Transition 72
status: done - <null>
Sep 26 04:06:12 staging1 crmd: [1686]: info: do_state_transition: State
transition S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS
cause=C_FSA_INTERNAL origin=notify_crmd ]
Sep 26 04:06:12 staging1 crmd: [1686]: info: do_state_transition: Starting
PEngine Recheck Timer
Sep 26 04:06:12 staging1 pengine: [1685]: info: process_pe_message:
Transition 72: PEngine Input stored in: /var/lib/pengine/pe-input-3802.bz2
Sep 26 04:15:09 staging1 cib: [1682]: info: cib_stats: Processed 1
operations (0.00us average, 0% utilization) in the last 10min

My drbd config file:

resource mysqld {

protocol C;

startup { wfc-timeout 0; degr-wfc-timeout 120; }

disk { on-io-error detach; }


on staging1 {

device /dev/drbd0;

disk /dev/vg_staging1/lv_data;

meta-disk internal;

address 10.10.20.1:7788;

}

on staging2 {

device /dev/drbd0;

disk /dev/vg_staging2/lv_data;

meta-disk internal;

address 10.10.20.2:7788;

}

}

corosync.conf:

compatibility: whitetank

aisexec {
  user: root
  group: root
}

totem {
        version: 2
        secauth: off
        threads: 0
        interface {
                ringnumber: 0
                bindnetaddr: 10.10.10.0
                mcastaddr: 226.94.1.1
                mcastport: 5405
        }
}

logging {
        fileline: off
        to_stderr: no
        to_logfile: no
        to_syslog: yes
        logfile: /var/log/cluster/corosync.log
        debug: off
        timestamp: on
        logger_subsys {
                subsys: AMF
                debug: off
        }
}

amf {
        mode: disabled
}

service {
#Load Pacemaker
name: pacemaker
ver: 0
use_mgmtd: yes
}

And my crm config:

node staging1.dev.applepeak.com
node staging2.dev.applepeak.com
primitive drbd_mysql ocf:linbit:drbd \
        params drbd_resource="mysqld" \
        op monitor interval="15s" \
        op start interval="0" timeout="240s" \
        op stop interval="0" timeout="100s"
primitive fs_mysql ocf:heartbeat:Filesystem \
        params device="/dev/drbd0" directory="/opt/data/mysql/data/mysql"
fstype="ext4" \
        op start interval="0" timeout="60s" \
        op stop interval="0" timeout="60s"
primitive ip_mysql ocf:heartbeat:IPaddr2 \
        params ip="10.10.10.31" nic="eth0"
primitive mysqld lsb:mysqld
group mysql fs_mysql ip_mysql mysqld
ms ms_drbd_mysql drbd_mysql \
        meta master-max="1" master-node-max="1" clone-max="2"
clone-node-max="1" notify="true"
colocation mysql_on_drbd inf: mysql ms_drbd_mysql:Master
order mysql_after_drbd inf: ms_drbd_mysql:promote mysql:start
property $id="cib-bootstrap-options" \
        dc-version="1.1.2-f059ec7ced7a86f18e5490b67ebf4a0b963bccfe" \
        cluster-infrastructure="openais" \
        expected-quorum-votes="2" \
        stonith-enabled="false" \
        last-lrm-refresh="1316961847" \
        stop-all-resources="true" \
        no-quorum-policy="ignore"
rsc_defaults $id="rsc-options" \
        resource-stickiness="100"

Thanks,
Charles
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.clusterlabs.org/pipermail/pacemaker/attachments/20110926/d86deb24/attachment-0002.html>


More information about the Pacemaker mailing list