[Pacemaker] why does pacemaker migrate a vm by stopping and starting instead of migrating action?

bin chen free2coder at gmail.com
Wed Dec 19 02:11:02 EST 2012


Hi,all
        My cluster is pacemaker 1.1.7 + corosync 2.0. I have write a
resource agent to manage the virtual machine.The RA supports
start,stop,migrate_from,migrate_to,monitor.
        But when I try to migrate a running cluster vm(resource name is
dcbh6f1c-GtNhnB-8597) from the host(h66)  to a target host (h65),the
cluster just stops it in host A,and then start it in host B.
        I check the pacemaker log and find that ,it just don`t execute the
migrate action!
        My command is crm resource migrate dcbh6f1c-GtNhnB-8597 h65.
--------------------------------- pcmk log in host h66(source)
------------------------------------------------------
2012-12-19T14:36:27.687027+08:00 h66 cib[2744]:     info:
cib_process_request: Operation complete: op cib_delete for section
constraints (origin=h65/crm_resource/3, version=0.438.2): ok (rc=0)
2012-12-19T14:36:27.696603+08:00 h66 crmd[2749]:     info:
abort_transition_graph: te_update_diff:126 - Triggered transition abort
(complete=1, tag=diff, id=(null), magic=NA, cib=0.439.1) : Non-status change
2012-12-19T14:36:27.696915+08:00 h66 crmd[2749]:   notice:
do_state_transition: State transition S_IDLE -> S_POLICY_ENGINE [
input=I_PE_CALC cause=C_FSA_INTERNAL origin=abort_transition_graph ]
2012-12-19T14:36:27.697075+08:00 h66 cib[2744]:     info: cib:diff: - <cib
admin_epoch="0" epoch="438" num_updates="2" />
2012-12-19T14:36:27.697122+08:00 h66 cib[2744]:     info: cib:diff: + <cib
epoch="439" num_updates="1" admin_epoch="0" validate-with="pacemaker-1.2"
crm_feature_set="3.0.6" update-origin="h66" update-client="crm_resource"
cib-last-written="Wed Dec 19 11:43:42 2012" have-quorum="1"
dc-uuid="1107364362" >
2012-12-19T14:36:27.697158+08:00 h66 cib[2744]:     info: cib:diff: +
<configuration >
2012-12-19T14:36:27.697192+08:00 h66 cib[2744]:     info: cib:diff: +
<constraints >
2012-12-19T14:36:27.697229+08:00 h66 cib[2744]:     info: cib:diff: +
<rsc_location id="cli-prefer-dcbh6f1c-GtNhnB-8597"
rsc="dcbh6f1c-GtNhnB-8597" __crm_diff_marker__="added:top" >
2012-12-19T14:36:27.697330+08:00 h66 cib[2744]:     info: cib:diff: +
  <rule id="cli-prefer-rule-dcbh6f1c-GtNhnB-8597" score="INFINITY"
boolean-op="and" >
2012-12-19T14:36:27.697368+08:00 h66 cib[2744]:     info: cib:diff: +
    <expression id="cli-prefer-expr-dcbh6f1c-GtNhnB-8597"
attribute="#uname" operation="eq" value="h65" type="string" />
2012-12-19T14:36:27.697409+08:00 h66 cib[2744]:     info: cib:diff: +
  </rule>
2012-12-19T14:36:27.697442+08:00 h66 cib[2744]:     info: cib:diff: +
</rsc_location>
2012-12-19T14:36:27.697477+08:00 h66 cib[2744]:     info: cib:diff: +
</constraints>
2012-12-19T14:36:27.697511+08:00 h66 cib[2744]:     info: cib:diff: +
</configuration>
2012-12-19T14:36:27.697781+08:00 h66 cib[2744]:     info: cib:diff: + </cib>
2012-12-19T14:36:27.697854+08:00 h66 cib[2744]:     info:
cib_process_request: Operation complete: op cib_modify for section
constraints (origin=h65/crm_resource/4, version=0.439.1): ok (rc=0)
2012-12-19T14:36:27.702575+08:00 h66 pengine[2748]:     info:
unpack_config: Startup probes: enabled
2012-12-19T14:36:27.702640+08:00 h66 pengine[2748]:     info:
unpack_config: Node scores: 'red' = -INFINITY, 'yellow' = 0, 'green' = 0
2012-12-19T14:36:27.702645+08:00 h66 pengine[2748]:     info:
unpack_domains: Unpacking domains
2012-12-19T14:36:27.702649+08:00 h66 pengine[2748]:     info:
determine_online_status: Node h66 is online
2012-12-19T14:36:27.702654+08:00 h66 pengine[2748]:     info:
determine_online_status: Node h65 is online
2012-12-19T14:36:27.702827+08:00 h66 pengine[2748]:   notice:
unpack_rsc_op: Operation monitor found resource dcbh6f1c-GtNhnB-8597 active
on h66
2012-12-19T14:36:27.702840+08:00 h66 pengine[2748]:     info: clone_print:
 Clone Set: fence-clone [fence]
2012-12-19T14:36:27.703147+08:00 h66 pengine[2748]:     info: short_print:
     Started: [ h66 h65 ]
2012-12-19T14:36:27.703151+08:00 h66 pengine[2748]:     info: native_print:
ip#011(ocf::heartbeat:IPaddr):#011Stopped
2012-12-19T14:36:27.703155+08:00 h66 pengine[2748]:     info: native_print:
dcbh6f1c-GtNhnB-8597#011(ocf::heartbeat:FronVm):#011Started h66
2012-12-19T14:36:27.705574+08:00 h66 pengine[2748]:     info: native_color:
Resource ip cannot run anywhere
2012-12-19T14:36:27.706136+08:00 h66 pengine[2748]:     info: LogActions:
Leave   fence:0#011(Started h66)
2012-12-19T14:36:27.706141+08:00 h66 pengine[2748]:     info: LogActions:
Leave   fence:1#011(Started h65)
2012-12-19T14:36:27.706146+08:00 h66 pengine[2748]:     info: LogActions:
Leave   ip#011(Stopped)
2012-12-19T14:36:27.706153+08:00 h66 pengine[2748]:   notice: LogActions:
Move    dcbh6f1c-GtNhnB-8597#011(Started h66 -> h65)
2012-12-19T14:36:27.706520+08:00 h66 crmd[2749]:   notice:
do_state_transition: State transition S_POLICY_ENGINE ->
S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE
origin=handle_response ]
2012-12-19T14:36:27.706675+08:00 h66 crmd[2749]:     info: do_te_invoke:
Processing graph 56 (ref=pe_calc-dc-1355898987-152) derived from
/var/lib/pengine/pe-input-144.bz2
2012-12-19T14:36:27.706680+08:00 h66 crmd[2749]:     info: te_rsc_command:
Initiating action 15: stop dcbh6f1c-GtNhnB-8597_stop_0 on h66 (local)
2012-12-19T14:36:27.707855+08:00 h66 lrmd: [2746]: info:
rsc:dcbh6f1c-GtNhnB-8597:25: stop
2012-12-19T14:36:27.746224+08:00 h66 pengine[2748]:   notice:
process_pe_message: Transition 56: PEngine Input stored in:
/var/lib/pengine/pe-input-144.bz2
2012-12-19T14:36:34.054989+08:00 h66 lrmd: [2746]: info: Managed
dcbh6f1c-GtNhnB-8597:stop process 20868 exited with return code 0.
2012-12-19T14:36:34.055752+08:00 h66 crmd[2749]:     info:
process_lrm_event: LRM operation dcbh6f1c-GtNhnB-8597_stop_0 (call=25,
rc=0, cib-update=194, confirmed=true) ok
2012-12-19T14:36:34.058078+08:00 h66 crmd[2749]:     info: te_rsc_command:
Initiating action 16: start dcbh6f1c-GtNhnB-8597_start_0 on h65
2012-12-19T14:36:40.367183+08:00 h66 crmd[2749]:   notice: run_graph: ====
Transition 56 (Complete=3, Pending=0, Fired=0, Skipped=0, Incomplete=0,
Source=/var/lib/pengine/pe-input-144.bz2): Complete
2012-12-19T14:36:40.367230+08:00 h66 crmd[2749]:   notice:
do_state_transition: State transition S_TRANSITION_ENGINE -> S_IDLE [
input=I_TE_SUCCESS cause=C_FSA_INTERNAL origin=notify_crmd ]
2012-12-19T14:36:40.619226+08:00 h66 Fronqd[2728]: Fronqd: 'ping
192.168.1.1 -W 1 -c 1' missed (1/3)
2012-12-19T14:39:07.654122+08:00 h66 Fronqd[2728]: Fronqd: 'ping
192.168.1.1 -W 1 -c 1' missed (1/3)
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

------------------------------------------------pacemaker log in host
B(target
host)----------------------------------------------------------------------------------
[[3;5~2012-12-19T14:25:52.602285+08:00 h65 lrmd: [2466]: info:
rsc:dcbh6f1c-GtNhnB-8597:28: start
2012-12-19T14:25:58.875182+08:00 h65 lrmd: [2466]: info: Managed
dcbh6f1c-GtNhnB-8597:start process 14036 exited with return code 0.
2012-12-19T14:25:58.903940+08:00 h65 crmd[2469]:     info:
process_lrm_event: LRM operation dcbh6f1c-GtNhnB-8597_start_0 (call=28,
rc=0, cib-update=48, confirmed=true) ok
2012-12-19T14:27:46.356118+08:00 h65 Fronqd[2450]: Fronqd: 'ping
192.168.1.1 -W 1 -c 1' missed (1/3)
2012-12-19T14:28:43.407999+08:00 h65 Fronqd[2450]: Fronqd: 'ping
192.168.1.1 -W 1 -c 1' missed (1/3)
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------

Thanks
Cherish
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.clusterlabs.org/pipermail/pacemaker/attachments/20121219/412ef8c0/attachment-0002.html>


More information about the Pacemaker mailing list