[Pacemaker] the behavior of clone resource
Junko IKEDA
ikedaj at intellilink.co.jp
Tue Mar 16 13:20:16 UTC 2010
Hi,
There is just a little strange clone behavior.
I found that;
(1) start the group which contains three primitive resources,
and clone set
# crm_mon -1
============
Last updated: Tue Mar 16 21:39:10 2010
Stack: openais
Current DC: cspm01 - partition with quorum
Version: 1.0.8-a77303a7adce stable-1.0 tip
4 Nodes configured, 4 expected votes
2 Resources configured.
============
Online: [ cspm01 cspm02 cspm03 cspm04 ]
Resource Group: UMgroup01
UmDummy01 (ocf::heartbeat:Dummy): Started cspm01
UmDummy02 (ocf::heartbeat:Dummy): Started cspm01
UmDummy03 (ocf::heartbeat:Dummy): Started cspm01
Clone Set: clnUMgroup01
Started: [ cspm01 cspm04 ]
(2) edit Dummy RA to create clnUMgroup01 stop NG.
# vim /usr/lib/ocf/resource.d/heartbeat/Dummy01
-----------------------------------------------
dummy_stop() {
exit $OCF_ERR_GENERIC # intentional error
dummy_monitor
if [ $? = $OCF_SUCCESS ]; then
rm ${OCF_RESKEY_state}
fi
return $OCF_SUCCESS
}
-----------------------------------------------
(on cspm01)
# rm -f /var/run/heartbeat/rsctmp/Dummy-clnUMdummy01:0.state
(3) check the status of each resources
# crm_mon -1
============
Last updated: Tue Mar 16 21:40:11 2010
Stack: openais
Current DC: cspm01 - partition with quorum
Version: 1.0.8-a77303a7adce stable-1.0 tip
4 Nodes configured, 4 expected votes
2 Resources configured.
============
Online: [ cspm01 cspm02 cspm03 cspm04 ]
Clone Set: clnUMgroup01
Resource Group: clnUmResource:0
clnUMdummy01:0 (ocf::heartbeat:Dummy01): Started
cspm01
(unmanaged) FAILED
clnUMdummy02:0 (ocf::heartbeat:Dummy02): Stopped
Started: [ cspm04 ]
Failed actions:
clnUMdummy01:0_monitor_10000 (node=cspm01, call=8, rc=7,
status=complete): not running
clnUMdummy01:0_stop_0 (node=cspm01, call=18, rc=1,
status=complete):
unknown error
UmDummy03_monitor_10000 (node=cspm01, call=16, rc=7,
status=complete):
not running
UmDummy01_monitor_10000 (node=cspm01, call=12, rc=7,
status=complete):
not running
clnUMdummy02:0_monitor_10000 (node=cspm01, call=10, rc=7,
status=complete): not running
In this case, clone instance on cspm04 keeps running.
but when I added the other resource in group, like this;
============
Last updated: Tue Mar 16 21:53:26 2010
Stack: openais
Current DC: cspm01 - partition with quorum
Version: 1.0.8-a77303a7adce stable-1.0 tip
4 Nodes configured, 4 expected votes
2 Resources configured.
============
Online: [ cspm01 cspm02 cspm03 cspm04 ]
Resource Group: UMgroup01
UmDummy01 (ocf::heartbeat:Dummy): Started cspm01
UmDummy02 (ocf::heartbeat:Dummy): Started cspm01
UmDummy03 (ocf::heartbeat:Dummy): Started cspm01
UmDummy04 (ocf::heartbeat:Dummy): Started cspm01
Clone Set: clnUMgroup01
Started: [ cspm01 cspm04 ]
after the same error as the above,
the result of crm_mon was strange.
============
Last updated: Tue Mar 16 21:54:46 2010
Stack: openais
Current DC: cspm01 - partition with quorum
Version: 1.0.8-a77303a7adce stable-1.0 tip
4 Nodes configured, 4 expected votes
2 Resources configured.
============
Online: [ cspm01 cspm02 cspm03 cspm04 ]
Clone Set: clnUMgroup01
Resource Group: clnUmResource:0
clnUMdummy01:0 (ocf::heartbeat:Dummy01): Started
cspm01
(unmanaged) FAILED
clnUMdummy02:0 (ocf::heartbeat:Dummy02): Stopped
Stopped: [ clnUmResource:1 ]
Failed actions:
clnUMdummy01:0_monitor_10000 (node=cspm01, call=9, rc=7,
status=complete): not running
clnUMdummy01:0_stop_0 (node=cspm01, call=21, rc=1,
status=complete):
unknown error
In this case, clone instance on cspm04 was stopped.
I didn't change the rsc_colocation or order setting.
Which case is the expected?
By the way, I tried to get the log with hb_report,
but it failed to gather ha_log.txt,
its size was 0 bite...
anyway, I attached hb_report.
Thanks,
Junko
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Dummy_x4.tar.bz2
Type: application/octet-stream
Size: 38804 bytes
Desc: not available
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20100316/8fe892e7/attachment-0002.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Dummy_x3.tar.bz2
Type: application/octet-stream
Size: 66242 bytes
Desc: not available
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20100316/8fe892e7/attachment-0003.obj>
More information about the Pacemaker
mailing list