<div dir="ltr"><div><div><div><div>Hi Andrew, <br></div>Checked the logs and I felt OCFS2 taking time to recover, can anyone please verify my log and confirm if I&#39;m correct.<br><br></div>And if OCFS2 is the reason for delay in failover may I know a way to reduce that delay caused.<br>

<br></div>Attached is my syslog and pacemaker configuration<br><br></div>Looking forward for a solution <br></div><div class="gmail_extra"><br><br><div class="gmail_quote">On Fri, Jun 13, 2014 at 8:55 AM, kamal kishi <span dir="ltr">&lt;<a href="mailto:kamal.kishi@gmail.com" target="_blank">kamal.kishi@gmail.com</a>&gt;</span> wrote:<br>

<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div><div><div><div><div>Fine Andrew, will check it out but does the timeouts provided for pacemaker affect this??<br>

</div>Which part of the time configuration will be considered by pacemaker to decide if the other node is actually down and the resources should be taken over by it.<br>

<br></div>And Alexis, I&#39;m not facing any issue while putting node to standby mode.<br></div>I&#39;m using DRBD 8.3.11 (apt-get install drbd8-utils=2:8.3.11-0ubuntu1)<br></div>Had to force the download to particular version as the current download/patch is not compatible with pacemaker.<br>


</div>You too try to install 8.3.11 and check once, all the best<br></div><div class="HOEnZb"><div class="h5"><div class="gmail_extra"><br><br><div class="gmail_quote">On Fri, Jun 13, 2014 at 5:22 AM, Andrew Beekhof <span dir="ltr">&lt;<a href="mailto:andrew@beekhof.net" target="_blank">andrew@beekhof.net</a>&gt;</span> wrote:<br>


<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div><br>

On 12 Jun 2014, at 9:15 pm, kamal kishi &lt;<a href="mailto:kamal.kishi@gmail.com" target="_blank">kamal.kishi@gmail.com</a>&gt; wrote:<br>

<br>

&gt; Hi All,<br>

&gt;<br>

&gt; This might be a basic question but I&#39;m not sure whats taking time for failover switching.<br>

&gt; Hope anyone can figure it out.<br>

<br>

</div>How about looking in the logs and seeing when the various stop/start actions occur and which ones take the longest?<br>

<div><div><br>

&gt;<br>

&gt; Scenario -<br>

&gt; Pacemaker running DRBD(Dual primary mode)+OCFS2+XEN for Virtual windows machine<br>

&gt;<br>

&gt; Pacemaker startup starts -<br>

&gt; DRBD -&gt; OCFS2 -&gt; XEN<br>

&gt; Lets consider under Server1  - DRBD, OCFS2(clone) and XEN are started<br>

&gt;<br>

&gt; Server2 - DRBD, OCFS2(clone) are started<br>

&gt;<br>

&gt; Now if Server1 power is OFF<br>

&gt;<br>

&gt; The XEN resource which was running under Server1 should be failed over to Server2.<br>

&gt;<br>

&gt; In my case, its taking almost 90 to 110 seconds to do this.<br>

&gt;<br>

&gt; Can anyone suggest me ways to reduce it to within 30 to 40 seconds<br>

&gt;<br>

&gt; My pacemaker configuration is -<br>

&gt; crm configure<br>

&gt; property no-quorum-policy=ignore<br>

&gt; property stonith-enabled=false<br>

&gt; property default-resource-stickiness=1000<br>

&gt;<br>

&gt; primitive resDRBDr1 ocf:linbit:drbd \<br>

&gt; params drbd_resource=&quot;r0&quot; \<br>

&gt; op start interval=&quot;0&quot; timeout=&quot;240s&quot; \<br>

&gt; op stop interval=&quot;0&quot; timeout=&quot;100s&quot; \<br>

&gt; op monitor interval=&quot;20s&quot; role=&quot;Master&quot; timeout=&quot;240s&quot; \<br>

&gt; op monitor interval=&quot;30s&quot; role=&quot;Slave&quot; timeout=&quot;240s&quot; \<br>

&gt; meta migration-threshold=&quot;3&quot; failure-timeout=&quot;60s&quot;<br>

&gt; primitive resOCFS2r1 ocf:heartbeat:Filesystem \<br>

&gt; params device=&quot;/dev/drbd/by-res/r0&quot; directory=&quot;/cluster&quot; fstype=&quot;ocfs2&quot; \<br>

&gt; op monitor interval=&quot;10s&quot; timeout=&quot;60s&quot; \<br>

&gt; op start interval=&quot;0&quot; timeout=&quot;90s&quot; \<br>

&gt; op stop interval=&quot;0&quot; timeout=&quot;60s&quot; \<br>

&gt; meta migration-threshold=&quot;3&quot; failure-timeout=&quot;60s&quot;<br>

&gt; primitive resXen1 ocf:heartbeat:Xen \<br>

&gt; params xmfile=&quot;/home/cluster/xen/win7.cfg&quot; name=&quot;xenwin7&quot; \<br>

&gt; op monitor interval=&quot;20s&quot; timeout=&quot;60s&quot; \<br>

&gt; op start interval=&quot;0&quot; timeout=&quot;90s&quot; \<br>

&gt; op stop interval=&quot;0&quot; timeout=&quot;60s&quot; \<br>

&gt; op migrate_from interval=&quot;0&quot; timeout=&quot;120s&quot; \<br>

&gt; op migrate_to interval=&quot;0&quot; timeout=&quot;120s&quot; \<br>

&gt; meta allow-migrate=&quot;true&quot; target-role=&quot;started&quot;<br>

&gt;<br>

&gt; ms msDRBDr1 resDRBDr1 \<br>

&gt; meta notify=&quot;true&quot; master-max=&quot;2&quot; interleave=&quot;true&quot; target-role=&quot;Started&quot;<br>

&gt; clone cloOCFS2r1 resOCFS2r1 \<br>

&gt; meta interleave=&quot;true&quot; ordered=&quot;true&quot; target-role=&quot;Started&quot;<br>

&gt;<br>

&gt; colocation colOCFS12-with-DRBDrMaster inf: cloOCFS2r1 msDRBDr1:Master<br>

&gt; colocation colXen-with-OCFSr1 inf: resXen1 cloOCFS2r1<br>

&gt; order ordDRBD-before-OCFSr1 inf: msDRBDr1:promote cloOCFS2r1:start<br>

&gt; order ordOCFS2r1-before-Xen1 inf: cloOCFS2r1:start resXen1:start<br>

&gt;<br>

&gt; commit<br>

&gt; bye<br>

&gt;<br>

&gt; --<br>

&gt; Regards,<br>

&gt; Kamal Kishore B V<br>

&gt;<br>

</div></div><div><div>&gt; _______________________________________________<br>

&gt; Pacemaker mailing list: <a href="mailto:Pacemaker@oss.clusterlabs.org" target="_blank">Pacemaker@oss.clusterlabs.org</a><br>

&gt; <a href="http://oss.clusterlabs.org/mailman/listinfo/pacemaker" target="_blank">http://oss.clusterlabs.org/mailman/listinfo/pacemaker</a><br>

&gt;<br>

&gt; Project Home: <a href="http://www.clusterlabs.org" target="_blank">http://www.clusterlabs.org</a><br>

&gt; Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" target="_blank">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a><br>

&gt; Bugs: <a href="http://bugs.clusterlabs.org" target="_blank">http://bugs.clusterlabs.org</a><br>

<br>

</div></div><br>_______________________________________________<br>

Pacemaker mailing list: <a href="mailto:Pacemaker@oss.clusterlabs.org" target="_blank">Pacemaker@oss.clusterlabs.org</a><br>

<a href="http://oss.clusterlabs.org/mailman/listinfo/pacemaker" target="_blank">http://oss.clusterlabs.org/mailman/listinfo/pacemaker</a><br>

<br>

Project Home: <a href="http://www.clusterlabs.org" target="_blank">http://www.clusterlabs.org</a><br>

Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" target="_blank">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a><br>

Bugs: <a href="http://bugs.clusterlabs.org" target="_blank">http://bugs.clusterlabs.org</a><br>

<br></blockquote></div><br><br clear="all"><br>-- <br>Regards,<br>Kamal Kishore B V<br>

</div>

</div></div></blockquote></div><br><br clear="all"><br>-- <br>Regards,<br>Kamal Kishore B V<br>

</div>