[Pacemaker] Help with OCFS2 / DLM Stability

Darren.Mansell at opengi.co.uk Darren.Mansell at opengi.co.uk
Tue Mar 9 11:37:02 UTC 2010


Hi everyone.

 

Further to some discussions a couple of weeks ago with regard to OCFS2
on SLES 11 HAE I'm looking to finally nail this problem.

 

We have a 3 node cluster that has a STONITH shootout every week. This
morning one node got stuck in a state where it couldn't be fenced due
the RSA not being responsive.

 

I'm not sure if the problem is due to:

 

*         Network interruption causing Totem failures.

*         Java (Tomcat) processes falling over.

*         DLM falling over.

*         Any of the above in any combination.

 

I've attached a hb_report. Could you see if you can see anything?

 

Thanks

Darren Mansell



 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20100309/38a92433/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: hb_report.tar.bz2
Type: application/octet-stream
Size: 216521 bytes
Desc: hb_report.tar.bz2
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20100309/38a92433/attachment-0001.obj>


More information about the Pacemaker mailing list