<html><head></head><body><div style="font-family: Verdana;font-size: 12.0px;"><div>

<div>I can agree, Master monitor operation is broken in 1.1.10 release.</div>


<div>When the slave monitor action is started, the master monitor action is not called any more.</div>


<div>&nbsp;</div>


<div>I have created a setup with Stateful resource with two nodes.</div>


<div>Then the Pacemaker installation is changed to different versions without changing the configuration part of the CIB.</div>


<div>

<div>&nbsp;</div>


<div>Result:</div>


<div>1.1.10-rc5, 1.1.10-rc6 and 1.1.10-rc7 does not have this error</div>


<div>1.1.10-1 release has the error</div>


<div>&nbsp;</div>


<div>Installation order (just that anybody know how it was done):</div>


<div>1.1.10-1 -&gt; error</div>


<div>1.1.10-rc5 -&gt; no error</div>


<div>1.1.10-rc6 -&gt; no error</div>


<div>1.1.10-rc7 -&gt; no error</div>


<div>1.1.10-1 -&gt; error</div>


<div>&nbsp;</div>


<div>Rainer</div>


<div name="quote" style="margin:10px 5px 5px 10px; padding: 10px 0 10px 10px; border-left:2px solid #C3D9E5; word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;">

<div style="margin:0 0 10px 0;"><b>Gesendet:</b>&nbsp;Freitag, 26. Juli 2013 um 09:32 Uhr<br/>

<b>Von:</b>&nbsp;&quot;Takatoshi MATSUO&quot; &lt;matsuo.tak@gmail.com&gt;<br/>

<b>An:</b>&nbsp;&quot;The Pacemaker cluster resource manager&quot; &lt;pacemaker@oss.clusterlabs.org&gt;<br/>

<b>Betreff:</b>&nbsp;Re: [Pacemaker] Announce: Pacemaker 1.1.10 now available</div>


<div name="quoted-content">Hi<br/>

<br/>

I used Stateful RA and caught a same issue.<br/>

<br/>

1. before starting slave<br/>

<br/>

# crm_simulate -VVV -S -x /var/lib/pacemaker/pengine/pe-input-1543.bz2<br/>

&#124; grep &quot;Resource action&quot;<br/>

* Resource action: stateful monitor=2000 on 16-sl6<br/>

<br/>

2. starting slave<br/>

# crm_simulate -VVV -S -x /var/lib/pacemaker/pengine/pe-input-1544.bz2<br/>

&#124; grep &quot;Resource action&quot;<br/>

* Resource action: stateful monitor on 17-sl6<br/>

* Resource action: stateful notify on 16-sl6<br/>

* Resource action: stateful start on 17-sl6<br/>

* Resource action: stateful notify on 16-sl6<br/>

* Resource action: stateful notify on 17-sl6<br/>

* Resource action: stateful monitor=3000 on 17-sl6<br/>

<br/>

3. after<br/>

# crm_simulate -VVV -S -x /var/lib/pacemaker/pengine/pe-input-1545.bz2<br/>

&#124; grep &quot;Resource action&quot;<br/>

* Resource action: stateful monitor=3000 on 17-sl6<br/>

<br/>

Monitor=2000 is deleted.<br/>

Is this correct ?<br/>

<br/>

<br/>

My setting<br/>

--------<br/>

property &#92;<br/>

no-quorum-policy=&quot;ignore&quot; &#92;<br/>

stonith-enabled=&quot;false&quot;<br/>

<br/>

rsc_defaults &#92;<br/>

resource-stickiness=&quot;INFINITY&quot; &#92;<br/>

migration-threshold=&quot;1&quot;<br/>

<br/>

ms msStateful stateful &#92;<br/>

meta &#92;<br/>

master-max=&quot;1&quot; &#92;<br/>

master-node-max=&quot;1&quot; &#92;<br/>

clone-max=&quot;2&quot; &#92;<br/>

clone-node-max=&quot;1&quot; &#92;<br/>

notify=&quot;true&quot;<br/>

<br/>

primitive stateful ocf:heartbeat:Stateful &#92;<br/>

op start timeout=&quot;60s&quot; interval=&quot;0s&quot; on-fail=&quot;restart&quot; &#92;<br/>

op monitor timeout=&quot;60s&quot; interval=&quot;3s&quot; on-fail=&quot;restart&quot; &#92;<br/>

op monitor timeout=&quot;60s&quot; interval=&quot;2s&quot; on-fail=&quot;restart&quot; role=&quot;Master&quot; &#92;<br/>

op promote timeout=&quot;60s&quot; interval=&quot;0s&quot; on-fail=&quot;restart&quot; &#92;<br/>

op demote timeout=&quot;60s&quot; interval=&quot;0s&quot; on-fail=&quot;stop&quot; &#92;<br/>

op stop timeout=&quot;60s&quot; interval=&quot;0s&quot; on-fail=&quot;block&quot;<br/>

--------<br/>

<br/>

Regards,<br/>

Takatoshi MATSUO<br/>

<br/>

2013/7/26 Takatoshi MATSUO &lt;matsuo.tak@gmail.com&gt;:<br/>

&gt; Hi<br/>

&gt;<br/>

&gt; My report is late for 1.1.10 :(<br/>

&gt;<br/>

&gt; I am using pacemaker 1.1.10-0.1.ab2e209.git.<br/>

&gt; It seems that master&#39;s monitor is stopped when slave is started.<br/>

&gt;<br/>

&gt; Does someone encounter same problem ?<br/>

&gt; I attach a log and settings.<br/>

&gt;<br/>

&gt;<br/>

&gt; Thanks,<br/>

&gt; Takatoshi MATSUO<br/>

&gt;<br/>

&gt; 2013/7/26 Digimer &lt;lists@alteeve.ca&gt;:<br/>

&gt;&gt; Congrats!! I know this was a long time in the making.<br/>

&gt;&gt;<br/>

&gt;&gt; digimer<br/>

&gt;&gt;<br/>

&gt;&gt;<br/>

&gt;&gt; On 25/07/13 20:43, Andrew Beekhof wrote:<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; Announcing the release of Pacemaker 1.1.10<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; <a href="https://github.com/ClusterLabs/pacemaker/releases/Pacemaker-1.1.10" target="_blank">https://github.com/ClusterLabs/pacemaker/releases/Pacemaker-1.1.10</a><br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; There were three changes of note since rc7:<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; + Bug cl#5161 - crmd: Prevent memory leak in operation cache<br/>

&gt;&gt;&gt; + cib: Correctly read back archived configurations if the primary is<br/>

&gt;&gt;&gt; corrupted<br/>

&gt;&gt;&gt; + cman: Do not pretend we know the state of nodes we&#39;ve never seen<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; Along with assorted bug fixes, the major topics for this release were:<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; - stonithd fixes<br/>

&gt;&gt;&gt; - fixing memory leaks, often caused by incorrect use of glib reference<br/>

&gt;&gt;&gt; counting<br/>

&gt;&gt;&gt; - supportability improvements (code cleanup and deduplication,<br/>

&gt;&gt;&gt; standardized error codes)<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; Release candidates for the next Pacemaker release (1.1.11) can be<br/>

&gt;&gt;&gt; expected some time around Novemeber.<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; A big thankyou to everyone that spent time testing the release<br/>

&gt;&gt;&gt; candidates and/or contributed patches. However now that Pacemaker is<br/>

&gt;&gt;&gt; perfect, anyone reporting bugs will be shot :-)<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; To build &#96;rpm&#96; packages:<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; 1. Clone the current sources:<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; # git clone --depth 0 git://github.com/ClusterLabs/pacemaker.git<br/>

&gt;&gt;&gt; # cd pacemaker<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; 1. Install dependancies (if you haven&#39;t already)<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; [Fedora] # sudo yum install -y yum-utils<br/>

&gt;&gt;&gt; [ALL] # make rpm-dep<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; 1. Build Pacemaker<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; # make release<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; 1. Copy and deploy as needed<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; ## Details - 1.1.10 - final<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; Changesets: 602<br/>

&gt;&gt;&gt; Diff: 143 files changed, 8162 insertions(+), 5159 deletions(-)<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; ## Highlights<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; ### Features added since Pacemaker-1.1.9<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; + Core: Convert all exit codes to positive errno values<br/>

&gt;&gt;&gt; + crm_error: Add the ability to list and print error symbols<br/>

&gt;&gt;&gt; + crm_resource: Allow individual resources to be reprobed<br/>

&gt;&gt;&gt; + crm_resource: Allow options to be set recursively<br/>

&gt;&gt;&gt; + crm_resource: Implement --ban for moving resources away from nodes<br/>

&gt;&gt;&gt; and --clear (replaces --unmove)<br/>

&gt;&gt;&gt; + crm_resource: Support OCF tracing when using<br/>

&gt;&gt;&gt; --force-(check&#124;start&#124;stop)<br/>

&gt;&gt;&gt; + PE: Allow active nodes in our current membership to be fenced without<br/>

&gt;&gt;&gt; quorum<br/>

&gt;&gt;&gt; + PE: Suppress meaningless IDs when displaying anonymous clone status<br/>

&gt;&gt;&gt; + Turn off auto-respawning of systemd services when the cluster starts<br/>

&gt;&gt;&gt; them<br/>

&gt;&gt;&gt; + Bug cl#5128 - pengine: Support maintenance mode for a single node<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; ### Changes since Pacemaker-1.1.9<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; + crmd: cib: stonithd: Memory leaks resolved and improved use of glib<br/>

&gt;&gt;&gt; reference counting<br/>

&gt;&gt;&gt; + attrd: Fixes deleted attributes during dc election<br/>

&gt;&gt;&gt; + Bug cf#5153 - Correctly display clone failcounts in crm_mon<br/>

&gt;&gt;&gt; + Bug cl#5133 - pengine: Correctly observe on-fail=block for failed<br/>

&gt;&gt;&gt; demote operation<br/>

&gt;&gt;&gt; + Bug cl#5148 - legacy: Correctly remove a node that used to have a<br/>

&gt;&gt;&gt; different nodeid<br/>

&gt;&gt;&gt; + Bug cl#5151 - Ensure node names are consistently compared without<br/>

&gt;&gt;&gt; case<br/>

&gt;&gt;&gt; + Bug cl#5152 - crmd: Correctly clean up fenced nodes during membership<br/>

&gt;&gt;&gt; changes<br/>

&gt;&gt;&gt; + Bug cl#5154 - Do not expire failures when on-fail=block is present<br/>

&gt;&gt;&gt; + Bug cl#5155 - pengine: Block the stop of resources if any depending<br/>

&gt;&gt;&gt; resource is unmanaged<br/>

&gt;&gt;&gt; + Bug cl#5157 - Allow migration in the absence of some colocation<br/>

&gt;&gt;&gt; constraints<br/>

&gt;&gt;&gt; + Bug cl#5161 - crmd: Prevent memory leak in operation cache<br/>

&gt;&gt;&gt; + Bug cl#5164 - crmd: Fixes crash when using pacemaker-remote<br/>

&gt;&gt;&gt; + Bug cl#5164 - pengine: Fixes segfault when calculating transition<br/>

&gt;&gt;&gt; with remote-nodes.<br/>

&gt;&gt;&gt; + Bug cl#5167 - crm_mon: Only print &quot;stopped&quot; node list for incomplete<br/>

&gt;&gt;&gt; clone sets<br/>

&gt;&gt;&gt; + Bug cl#5168 - Prevent clones from being bounced around the cluster<br/>

&gt;&gt;&gt; due to location constraints<br/>

&gt;&gt;&gt; + Bug cl#5170 - Correctly support on-fail=block for clones<br/>

&gt;&gt;&gt; + cib: Correctly read back archived configurations if the primary is<br/>

&gt;&gt;&gt; corrupted<br/>

&gt;&gt;&gt; + cib: The result is not valid when diffs fail to apply cleanly for CLI<br/>

&gt;&gt;&gt; tools<br/>

&gt;&gt;&gt; + cib: Restore the ability to embed comments in the configuration<br/>

&gt;&gt;&gt; + cluster: Detect and warn about node names with capitals<br/>

&gt;&gt;&gt; + cman: Do not pretend we know the state of nodes we&#39;ve never seen<br/>

&gt;&gt;&gt; + cman: Do not unconditionally start cman if it is already running<br/>

&gt;&gt;&gt; + cman: Support non-blocking CPG calls<br/>

&gt;&gt;&gt; + Core: Ensure the blackbox is saved on abnormal program termination<br/>

&gt;&gt;&gt; + corosync: Detect the loss of members for which we only know the<br/>

&gt;&gt;&gt; nodeid<br/>

&gt;&gt;&gt; + corosync: Do not pretend we know the state of nodes we&#39;ve never seen<br/>

&gt;&gt;&gt; + corosync: Ensure removed peers are erased from all caches<br/>

&gt;&gt;&gt; + corosync: Nodes that can persist in sending CPG messages must be<br/>

&gt;&gt;&gt; alive afterall<br/>

&gt;&gt;&gt; + crmd: Do not get stuck in S_POLICY_ENGINE if a node we couldn&#39;t fence<br/>

&gt;&gt;&gt; returns<br/>

&gt;&gt;&gt; + crmd: Do not update fail-count and last-failure for old failures<br/>

&gt;&gt;&gt; + crmd: Ensure all membership operations can complete while trying to<br/>

&gt;&gt;&gt; cancel a transition<br/>

&gt;&gt;&gt; + crmd: Ensure operations for cleaned up resources don&#39;t block recovery<br/>

&gt;&gt;&gt; + crmd: Ensure we return to a stable state if there have been too many<br/>

&gt;&gt;&gt; fencing failures<br/>

&gt;&gt;&gt; + crmd: Initiate node shutdown if another node claims to have<br/>

&gt;&gt;&gt; successfully fenced us<br/>

&gt;&gt;&gt; + crmd: Prevent messages for remote crmd clients from being relayed to<br/>

&gt;&gt;&gt; wrong daemons<br/>

&gt;&gt;&gt; + crmd: Properly handle recurring monitor operations for remote-node<br/>

&gt;&gt;&gt; agent<br/>

&gt;&gt;&gt; + crmd: Store last-run and last-rc-change for all operations<br/>

&gt;&gt;&gt; + crm_mon: Ensure stale pid files are updated when a new process is<br/>

&gt;&gt;&gt; started<br/>

&gt;&gt;&gt; + crm_report: Correctly collect logs when &#39;uname -n&#39; reports fully<br/>

&gt;&gt;&gt; qualified names<br/>

&gt;&gt;&gt; + fencing: Fail the operation once all peers have been exhausted<br/>

&gt;&gt;&gt; + fencing: Restore the ability to manually confirm that fencing<br/>

&gt;&gt;&gt; completed<br/>

&gt;&gt;&gt; + ipc: Allow unpriviliged clients to clean up after server failures<br/>

&gt;&gt;&gt; + ipc: Restore the ability for members of the haclient group to connect<br/>

&gt;&gt;&gt; to the cluster<br/>

&gt;&gt;&gt; + legacy: Support &quot;crm_node --remove&quot; with a node name for corosync<br/>

&gt;&gt;&gt; plugin (bnc#805278)<br/>

&gt;&gt;&gt; + lrmd: Default to the upstream location for resource agent scratch<br/>

&gt;&gt;&gt; directory<br/>

&gt;&gt;&gt; + lrmd: Pass errors from lsb metadata generation back to the caller<br/>

&gt;&gt;&gt; + pengine: Correctly handle resources that recover before we operate on<br/>

&gt;&gt;&gt; them<br/>

&gt;&gt;&gt; + pengine: Delete the old resource state on every node whenever the<br/>

&gt;&gt;&gt; resource type is changed<br/>

&gt;&gt;&gt; + pengine: Detect constraints with inappropriate actions (ie. promote<br/>

&gt;&gt;&gt; for a clone)<br/>

&gt;&gt;&gt; + pengine: Ensure per-node resource parameters are used during probes<br/>

&gt;&gt;&gt; + pengine: If fencing is unavailable or disabled, block further<br/>

&gt;&gt;&gt; recovery for resources that fail to stop<br/>

&gt;&gt;&gt; + pengine: Implement the rest of get_timet_now() and rename to<br/>

&gt;&gt;&gt; get_effective_time<br/>

&gt;&gt;&gt; + pengine: Re-initiate _active_ recurring monitors that previously<br/>

&gt;&gt;&gt; failed but have timed out<br/>

&gt;&gt;&gt; + remote: Workaround for inconsistent tls handshake behavior between<br/>

&gt;&gt;&gt; gnutls versions<br/>

&gt;&gt;&gt; + systemd: Ensure we get shut down correctly by systemd<br/>

&gt;&gt;&gt; + systemd: Reload systemd after adding/removing override files for<br/>

&gt;&gt;&gt; cluster services<br/>

&gt;&gt;&gt; + xml: Check for and replace non-printing characters with their octal<br/>

&gt;&gt;&gt; equivalent while exporting xml text<br/>

&gt;&gt;&gt; + xml: Prevent lockups by setting a more reliable buffer allocation<br/>

&gt;&gt;&gt; strategy<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; _______________________________________________<br/>

&gt;&gt;&gt; Pacemaker mailing list: Pacemaker@oss.clusterlabs.org<br/>

&gt;&gt;&gt; <a href="http://oss.clusterlabs.org/mailman/listinfo/pacemaker" target="_blank">http://oss.clusterlabs.org/mailman/listinfo/pacemaker</a><br/>

&gt;&gt;&gt;<br/>

&gt;&gt;&gt; Project Home: <a href="http://www.clusterlabs.org" target="_blank">http://www.clusterlabs.org</a><br/>

&gt;&gt;&gt; Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" target="_blank">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a><br/>

&gt;&gt;&gt; Bugs: <a href="http://bugs.clusterlabs.org" target="_blank">http://bugs.clusterlabs.org</a><br/>

&gt;&gt;&gt;<br/>

&gt;&gt;<br/>

&gt;&gt;<br/>

&gt;&gt; --<br/>

&gt;&gt; Digimer<br/>

&gt;&gt; Papers and Projects: <a href="https://alteeve.ca/w/" target="_blank">https://alteeve.ca/w/</a><br/>

&gt;&gt; What if the cure for cancer is trapped in the mind of a person without<br/>

&gt;&gt; access to education?<br/>

&gt;&gt;<br/>

&gt;&gt;<br/>

&gt;&gt; _______________________________________________<br/>

&gt;&gt; Pacemaker mailing list: Pacemaker@oss.clusterlabs.org<br/>

&gt;&gt; <a href="http://oss.clusterlabs.org/mailman/listinfo/pacemaker" target="_blank">http://oss.clusterlabs.org/mailman/listinfo/pacemaker</a><br/>

&gt;&gt;<br/>

&gt;&gt; Project Home: <a href="http://www.clusterlabs.org" target="_blank">http://www.clusterlabs.org</a><br/>

&gt;&gt; Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" target="_blank">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a><br/>

&gt;&gt; Bugs: <a href="http://bugs.clusterlabs.org" target="_blank">http://bugs.clusterlabs.org</a><br/>

<br/>

_______________________________________________<br/>

Pacemaker mailing list: Pacemaker@oss.clusterlabs.org<br/>

<a href="http://oss.clusterlabs.org/mailman/listinfo/pacemaker" target="_blank">http://oss.clusterlabs.org/mailman/listinfo/pacemaker</a><br/>

<br/>

Project Home: <a href="http://www.clusterlabs.org" target="_blank">http://www.clusterlabs.org</a><br/>

Getting started: <a href="http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf" target="_blank">http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf</a><br/>

Bugs: <a href="http://bugs.clusterlabs.org" target="_blank">http://bugs.clusterlabs.org</a></div>

</div>

</div>

</div></div></body></html>