[Pacemaker] Service restoration in clone resource group
Andrew Beekhof
andrew at beekhof.net
Tue Oct 15 22:21:47 UTC 2013
On 10/10/2013, at 12:52 PM, Sean Lutner <sean at rentul.net> wrote:
>
> On Oct 8, 2013, at 9:45 AM, Sean Lutner <sean at rentul.net> wrote:
>
>>
>> On Oct 8, 2013, at 9:33 AM, Lars Marowsky-Bree <lmb at suse.com> wrote:
>>
>>> On 2013-10-08T09:29:14, Sean Lutner <sean at rentul.net> wrote:
>>>
>>>> The clone was created using the interleave=true option, yes.
You might want to trawl the raw xml to make sure pcs did the right thing.
cibadmin -Ql | grep interleave
would tell you.
>>>
>>> Ok, so pcs hides that (interesting to know).
>>>
>>>> Does this have an affect on what I'm trying to accomplish?
>>>
>>> Yes, if you hadn't set that, it might have been an explanation. My best
>>> guess right now would be to upgrade first; the PE has gotten quite a few
>>> fixes since 1.1.8 again.
>>
>> Are you indicating that the behavior I expect to see, which is the resource being marked as Started on the now passive node, is what pacemaker should be doing and this could be a bug?
>>
>> If it would help, I can provide a full cib configuration and logs while I execute the tests I've been running. I won't be able to do that until tonight (EST time) but can if it may help.
>>
>> Thanks
>> Sean
>
> Sorry for following up on my own post but I have a follow-up question about the failcount for a resource. Does a crm_resource --cleanup erase the failcount on the resource it's run against?
Older versions didn't but I don't exactly recall when we started doing that.
> I'm looking at making changes to the failure-timeout and cluster-recheck-interval which when combined with my values of resource-stickiness=100 and migration-threshold=1 should allow for the services on the now failed node to be restarted and be marked as Started in the cluster without causing an unnecessary failover.
>
> Does this make sense?
yes
>
>>
>>>
>>>
>>> Regards,
>>> Lars
>>>
>>> --
>>> Architect Storage/HA
>>> SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 21284 (AG Nürnberg)
>>> "Experience is the name everyone gives to their mistakes." -- Oscar Wilde
>>>
>>>
>>> _______________________________________________
>>> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
>>> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>>>
>>> Project Home: http://www.clusterlabs.org
>>> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
>>> Bugs: http://bugs.clusterlabs.org
>>
>> _______________________________________________
>> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
>> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>>
>> Project Home: http://www.clusterlabs.org
>> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
>> Bugs: http://bugs.clusterlabs.org
>
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 841 bytes
Desc: Message signed with OpenPGP using GPGMail
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20131016/e6284fee/attachment-0004.sig>
More information about the Pacemaker
mailing list