[Pacemaker] [BUG] Clone + group = orphan(s) ?
Thomas Guthmann
tguthmann at iseek.com.au
Thu Jan 7 01:31:11 UTC 2010
Re,
> I've just had 2 orphans (tom-DNS:2,tom-DNS:3) for a clone containing 2
> groups (tom-DNS:0, tom-DNS:1). Anyway the situation is far better than
> before, no more craziness during cleanups. I will dig and test more
> tomorrow and give you an update to see if I can reproduce the issue.
1. I was wondering if my always 2 orphans and no more (displayed by
hb_gui) that appears sometimes after the clone creation are linked to
the fact that I have 4 nodes in the cluster. Indeed we are using an
asymmetrical cluster and only 2 nodes can be a DNS servers so the 2
others can't. I was wondering if the 2 "orphans" could be the 2 other
ones. I found that in the DC logs :
Jan 7 09:38:19 tom-dns1 pengine: [3227]: info: unpack_config: Node
scores: 'red' = -INFINITY, 'yellow' = 0, 'green' = 0
Jan 7 09:38:19 tom-dns1 pengine: [3227]: info: determine_online_status:
Node tom-lb2.clone.wars.com.au is online
Jan 7 09:38:19 tom-dns1 pengine: [3227]: WARN: process_orphan_resource:
Nothing known about resource tom-lo-vip-ns1 running on
tom-lb2.clone.wars.com.au
Jan 7 09:38:19 tom-dns1 pengine: [3227]: info: log_data_element:
create_fake_resource: Orphan resource <primitive id="tom-lo-vip-ns1"
type="IPaddr" class="ocf" provider="heartbeat" />
Jan 7 09:38:19 tom-dns1 pengine: [3227]: info: process_orphan_resource:
Making sure orphan tom-lo-vip-ns1 is stopped
Jan 7 09:38:19 tom-dns1 pengine: [3227]: WARN: process_orphan_resource:
Nothing known about resource tom-lo-vip-ns2 running on
tom-lb2.clone.wars.com.au
Jan 7 09:38:19 tom-dns1 pengine: [3227]: info: log_data_element:
create_fake_resource: Orphan resource <primitive id="tom-lo-vip-ns2"
type="IPaddr" class="ocf" provider="heartbeat" />
Jan 7 09:38:19 tom-dns1 pengine: [3227]: info: process_orphan_resource:
Making sure orphan tom-lo-vip-ns2 is stopped
Jan 7 09:38:19 tom-dns1 pengine: [3227]: WARN: process_orphan_resource:
Nothing known about resource tom-lo-vip-dns1 running on
tom-lb2.clone.wars.com.au
Jan 7 09:38:19 tom-dns1 pengine: [3227]: info: log_data_element:
create_fake_resource: Orphan resource <primitive id="tom-lo-vip-dns1"
type="IPaddr" class="ocf" provider="heartbeat" />
Jan 7 09:38:19 tom-dns1 pengine: [3227]: info: process_orphan_resource:
Making sure orphan tom-lo-vip-dns1 is stopped
Jan 7 09:38:19 tom-dns1 pengine: [3227]: WARN: process_orphan_resource:
Nothing known about resource tom-lo-vip-dns2 running on
tom-lb2.clone.wars.com.au
Jan 7 09:38:19 tom-dns1 pengine: [3227]: info: log_data_element:
create_fake_resource: Orphan resource <primitive id="tom-lo-vip-dns2"
type="IPaddr" class="ocf" provider="heartbeat" />
Jan 7 09:38:19 tom-dns1 pengine: [3227]: info: process_orphan_resource:
Making sure orphan tom-lo-vip-dns2 is stopped
Jan 7 09:38:19 tom-dns1 pengine: [3227]: notice: unpack_rsc_op: Hard
error - tom-named:0_monitor_0 failed with rc=5: Preventing tom-DNS-clone
from re-starting on tom-lb2.clone.wars.com.au
Jan 7 09:38:19 tom-dns1 pengine: [3227]: WARN: process_orphan_resource:
Nothing known about resource tom-named running on tom-lb2.clone.wars.com.au
Jan 7 09:38:19 tom-dns1 pengine: [3227]: info: log_data_element:
create_fake_resource: Orphan resource <primitive id="tom-named"
type="named" class="ocf" provider="iseek" />
Jan 7 09:38:19 tom-dns1 pengine: [3227]: info: process_orphan_resource:
Making sure orphan tom-named is stopped
Is it possible that they are the one displayed by hb_gui ? Is there a
text way to see that ? crm_mon -1 doesn't display them and show me only
that. So do these orphans really exists ? Do you need the CIB to have a
look ?
Clone Set: tom-DNS-clone
Started: [ tom-dns2.clone.wars.com.au tom-dns1.clone.wars.com.au ]
2. Is it a normal behaviour to still have these when you have just
deleted a clone ? Extract from the HTML output of crm_mon (see 3. as well)
(Partially) Inactive Resources
tom-lo-vip-ns2:0 (ocf::heartbeat:IPaddr): ORPHANED Stopped
tom-lo-vip-dns1:0 (ocf::heartbeat:IPaddr): ORPHANED Stopped
tom-named:0 (ocf::iseek:named): ORPHANED Stopped
tom-lo-vip-dns2:0 (ocf::heartbeat:IPaddr): ORPHANED Stopped
tom-lo-vip-ns1:0 (ocf::heartbeat:IPaddr): ORPHANED Stopped
tom-named:1 (ocf::iseek:named): ORPHANED Stopped
tom-lo-vip-dns2:1 (ocf::heartbeat:IPaddr): ORPHANED Stopped
tom-lo-vip-ns1:1 (ocf::heartbeat:IPaddr): ORPHANED Stopped
tom-lo-vip-ns2:1 (ocf::heartbeat:IPaddr): ORPHANED Stopped
tom-lo-vip-dns1:1 (ocf::heartbeat:IPaddr): ORPHANED Stopped
3. Weird issue, definitely a small bug, "crm_mon -r -n -1" and "crm_mon
-r -n -h /tmp/tom.html" doesn't exactly give the same results. In the
HTML version I can see the orphans (written in red) and in console mode
I have nothing except the title 'Inactive resources:' (note it's a
different title than the HTML version, there is no 'partially' word -
see 2. above for the HTML version)
Sorry guys to bother you with these issues but we'd like to master and
to understand pacemaker well before going in production. It's a nice
product but need a bit of practice to ride it :)
Cheers,
Thomas
More information about the Pacemaker
mailing list