Hello,<br><br>please found attach to this mail the corosync logs.<br>If you have any tips :)<br><br><br><br>Regards,<br><br>Hugo<br><br><div class="gmail_quote">On 8 February 2012 15:39, Florian Haas <span dir="ltr">&lt;<a href="mailto:florian@hastexo.com" target="_blank">florian@hastexo.com</a>&gt;</span> wrote:<br>


<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div>On Wed, Feb 8, 2012 at 2:29 PM, Hugo Deprez &lt;<a href="mailto:hugo.deprez@gmail.com" target="_blank">hugo.deprez@gmail.com</a>&gt; wrote:<br>


&gt; Dear community,<br>

&gt;<br>

&gt; I am currently running different corosync / drbd cluster using VM running on<br>

&gt; vmware esxi host.<br>

&gt; Guest Os are Debian Squeeze.<br>

&gt;<br>

&gt; the active member of the cluster just freeze the VM was unreachable.<br>

&gt; But the resources didn&#39;t achieved to move to the other node.<br>

&gt;<br>

&gt; My cluster has the following ressources :<br>

&gt;<br>

&gt; Resource Group: grp<br>

&gt;      fs-data    (ocf::heartbeat:Filesystem):<br>

&gt;      nagios-ip  (ocf::heartbeat:IPaddr2):<br>

&gt;      apache2    (ocf::heartbeat:apache):<br>

&gt;      nagios     (lsb:nagios3):<br>

&gt;      pnp        (lsb:npcd):<br>

&gt;<br>

&gt;<br>

&gt; I am currently troubleshooting this issue. I don&#39;t really know where to<br>

&gt; look. Of course I had a look at the logs, but it is pretty hard for me to<br>

&gt; understand what happen.<br>

<br>

</div>It&#39;s pretty hard for anyone else to understand _without_ logs. :)<br>

<div><br>

&gt; I noticed that the VM crash at 12:09 and that the cluster only try to move<br>

&gt; the ressources at  12:58, this does not make sens for me. Or maybe the host<br>

&gt; wasn&#39;t totaly down ?<br>

&gt;<br>

&gt; Do you have any idea how I can troubleshoot ?<br>

<br>

</div>Log analysis is where I would start.<br>

<div><br>

&gt; Last thing, I notice that If I start apache2 on the slave server, corosync<br>

&gt; didn&#39;t detect that the resource is started, could that be an issue ?<br>

<br>

</div>Sure it could, but Pacemaker should happily recover from that.<br>

<br>

Cheers,<br>

Florian<br>

<span><font color="#888888"><br>

--<br>

Need help with High Availability?<br>

<a href="http://www.hastexo.com/now" target="_blank">http://www.hastexo.com/now</a><br>

<br>

_______________________________________________<br>

Pacemaker mailing list: <a href="mailto:Pacemaker@oss.clusterlabs.org" target="_blank">Pacemaker@oss.clusterlabs.org</a><br>

<a href="http://oss.clusterlabs.org/mailman/listinfo/pacemaker" target="_blank">http://oss.clusterlabs.org/mailman/listinfo/pacemaker</a><br>

<br>

Project Home: <a href="http://www.clusterlabs.org" target="_blank">http://www.clusterlabs.org</a><br>

Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" target="_blank">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a><br>

Bugs: <a href="http://bugs.clusterlabs.org" target="_blank">http://bugs.clusterlabs.org</a><br>

</font></span></blockquote></div><br>