[Pacemaker] stonith woes (ignore my other message with this title)
Dejan Muhamedagic
dejanmm at fastmail.fm
Mon Mar 1 11:47:48 UTC 2010
Hi,
On Fri, Feb 26, 2010 at 08:45:48PM +0100, Sander van Vugt wrote:
> Hi list,
>
> My daily call for help with my stonith woes. This is a problem that I've
> been struggling with for a while and I just can't get rid of it. The
> basic configuration SLES 11 with HAE, configured for high availability
> of xen virtual machines, using clvm as the storage backend and sbd
> stonith to guarantee integrity. For those that are willing to have a
> look, the output of hb_report and supportconfig (with all the required
> documentation) are at http://www.sandervanvugt.nl/novellsupport
>
> The short situation description: I'm using sbd stonith and I've got a
> stonith request roaming around in the cluster (but not being executed).
> It gives messages like:
>
> nd1 stonithd: ... tengine requests a STONITH operation RESET on node
> nd2.
> nd1 stonithd: info: we can't manage nd2, broadcast request to other
> nodes.
Looks like sbd when retrieving the list of hosts from the device
didn't find nd2. Though in that case the stonith resource
shouldn't have started. On monitor, it takes a list of nodes from
crm_node -l and then checks if they have slots on the device.
> At the same time, all my three nodes are unclean and no STONITH is
> happening at all.
How comes that all are unclean?
> So basically I have two questions now:
> * Is there any way to get rid of the stonith action which
> shouldn't be there?
Normally, there's a good reason to fence a node. Why do you think
the requests are wrong?
> * Is there any way to get my nodes back to a clean status?
Only by fencing or restarting the cluster. Alternatively, a node
may be deleted.
Thanks,
Dejan
> Thanks in advance,
> Sander van Vugt
>
>
>
>
> _______________________________________________
> Pacemaker mailing list
> Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
More information about the Pacemaker
mailing list