[Pacemaker] libqb-0.16 instability with standby/unstandby ?

David Vossel dvossel at redhat.com
Wed Oct 23 13:44:44 UTC 2013





----- Original Message -----
> From: "Andrew Beekhof" <andrew at beekhof.net>
> To: "The Pacemaker cluster resource manager" <pacemaker at oss.clusterlabs.org>
> Sent: Wednesday, October 23, 2013 12:12:32 AM
> Subject: Re: [Pacemaker] libqb-0.16 instability with standby/unstandby ?
> 
> 
> On 23 Oct 2013, at 8:39 am, David Vossel <dvossel at redhat.com> wrote:
> 
> > ----- Original Message -----
> >> From: "Mike Pomraning" <mjp at pilcrow.madison.wi.us>
> >> To: pacemaker at oss.clusterlabs.org
> >> Sent: Tuesday, October 22, 2013 10:49:28 AM
> >> Subject: [Pacemaker] libqb-0.16 instability with standby/unstandby ?
> >> 
> >> Regarding Justin Burnham's recent "Pacemaker crash on node
> >> unstandby/standby"[0] message, is anyone else seeing this behavior with
> >> libqb-0.16?
> >> 
> >> I'm getting anecdotal reports of the same behavior from a team at work
> >> using
> >> RHEL-derived pcmk-1.1.8 and corosync-1.4.1 with libqb-0.16. Reverting to
> >> libqb-0.14 appears to have solved the issue. Sorry, I don't have enough to
> >> reproduce yet, but the similarities in symptoms are suggestive.
> >> 
> >> FWIW, Justin also noted off list that his problems appear to have begun
> >> after
> >> updating to 0.16 a short time ago.
> > 
> > I've tracked this down. Don't use libqb v0.16.0 with any pacemaker version
> > less than 1.1.10.
> 
> So this isn't just a question of older pacemaker versions needing a rebuild?

A rebuild won't help

> > 
> > There are multiple elements involved with this problem.   Libqb had
> > reference count leaks in 0.14.4, once those got resolved we discovered a
> > race condition in pacemaker 1.1.8 that caused a double free... Ultimately
> > the reference count leaks looked like they covered up the problem in
> > pacemaker... Updating to libqb 0.16.0 when using pacemaker 1.1.8 exposes
> > the race condition problem, which is what you all are seeing.
> > 
> > -- Vossel
> > 
> >> -Mike
> >> 
> >> [0] http://comments.gmane.org/gmane.linux.highavailability.pacemaker/19289
> >> 
> >> _______________________________________________
> >> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> >> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> >> 
> >> Project Home: http://www.clusterlabs.org
> >> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> >> Bugs: http://bugs.clusterlabs.org
> >> 
> > 
> > _______________________________________________
> > Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> > http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> > 
> > Project Home: http://www.clusterlabs.org
> > Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> > Bugs: http://bugs.clusterlabs.org
> 
> 
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
> 




More information about the Pacemaker mailing list