[Pacemaker] Following the clusters from scratch v2 document, and coming up with weird (erroneous?) errors...

Colin Hines colinhines at gmail.com
Sat Apr 9 03:18:56 CET 2011


Okey dokey, I've done some further troubleshooting and started again from
scratch on a new node.  I'm performing this setup on a CentOS 5.5 node.
 Here's an excerpt from my messages file taken after doing a "yum -y install
pacemaker corosync"

Apr  8 11:50:19 cvt-db-003 yum: Updated: bzip2-libs-1.0.3-6.el5_5.x86_64
....many packages removed......
Apr  8 11:50:34 cvt-db-003 yum: Installed: corosync-1.2.7-1.1.el5.i386
Apr  8 11:50:34 cvt-db-003 yum: Installed: corosynclib-1.2.7-1.1.el5.x86_64
Apr  8 11:50:34 cvt-db-003 yum: Installed:
pacemaker-libs-1.0.10-1.4.el5.x86_64
Apr  8 11:50:34 cvt-db-003 yum: Installed: corosync-1.2.7-1.1.el5.x86_64
Apr  8 11:50:35 cvt-db-003 yum: Installed:
heartbeat-stonith-2.1.4-11.el5.x86_64
Apr  8 11:50:35 cvt-db-003 yum: Installed: pacemaker-1.0.10-1.4.el5.i386
Apr  8 11:50:35 cvt-db-003 yum: Updated: rpm-libs-4.4.2.3-20.el5_5.1.x86_64
Apr  8 11:50:35 cvt-db-003 yum: Updated: rpm-4.4.2.3-20.el5_5.1.x86_64
Apr  8 11:50:35 cvt-db-003 yum: Updated:
rpm-python-4.4.2.3-20.el5_5.1.x86_64
Apr  8 11:50:36 cvt-db-003 yum: Installed: pacemaker-1.0.10-1.4.el5.x86_64
Apr  8 11:50:39 cvt-db-003 cl_status: [18858]: ERROR: Cannot signon with
heartbeat
Apr  8 11:50:39 cvt-db-003 cl_status: [18858]: ERROR: REASON: hb_api_signon:
Can't initiate connection  to heartbeat
Apr  8 11:50:39 cvt-db-003 cl_status: [18859]: ERROR: Cannot signon with
heartbeat
Apr  8 11:50:39 cvt-db-003 cl_status: [18859]: ERROR: REASON: hb_api_signon:
Can't initiate connection  to heartbeat
Apr  8 11:51:39 cvt-db-003 cl_status: [18971]: ERROR: Cannot signon with
heartbeat
...many more follow....


What's weird to me is that I hadn't started ANY services or run any commands
by this point, I'm thinking something in the RPM is kicking off that
cl_status command.

I believe I've determined that when rpm package
heartbeat-3.0.3-2.3.el5.x86_64.rpm is installed, that's when the errors
start occurring.  It seems like that is a required dependency for the latest
pacemaker RPM on http://www.clusterlabs.org/rpm/epel-5/.  I removed the
pacemaker and heartbeat packages using yum, and then re-added them via RPMs,
but found out that pacemaker requires the heartbeat-libs package or tools
such as crm_verify fail.  Following re-install of heartbeat-libs, pacemaker,
and pacemaker-libs with --no-deps, no more erroneous error messages.  I can
break/fix the issue by installing and removing
the heartbeat-3.0.3-2.3.el5.x86_64 package.

c


On Fri, Apr 8, 2011 at 9:48 AM, Lars Ellenberg <lars.ellenberg at linbit.com>wrote:

> On Fri, Apr 08, 2011 at 09:13:45AM +0200, Andrew Beekhof wrote:
> > On Thu, Apr 7, 2011 at 11:48 PM, Colin Hines <colinhines at gmail.com>
> wrote:
> > > I've recently followed the clusters from scratch v2 document for RHEL
> and
> > > although my cluster works and fails over correctly using corosync, I
> have
> > > the following error message coming up in my logs consistently, twice a
> > > minute:
> > > Apr  7 17:44:41 cvt-db-005 cl_status: [5901]: ERROR: Cannot signon with
> > > heartbeat
> > > Apr  7 17:44:41 cvt-db-005 cl_status: [5901]: ERROR: REASON:
> hb_api_signon:
> > > Can't initiate connection  to heartbeat
> >
> > Someone/something is running cl_status.
> > Find out who/what and stop them - it has no place in a corosync based
> cluster.
>
> That could be the status action of the SBD stonith plugin,
> between commits
> http://hg.linux-ha.org/glue/rev/faada7f3d069    (Apr 2010)
> http://hg.linux-ha.org/glue/rev/1448deafdf79    (May 2010)
>
> if so, upgrade your "cluster glue".
>
> > > I can send my configs, but they're pretty vanilla, has anyone seen
> anything
> > > like this before.   I did have a heartbeat installation on this host
> before
> > > I followed the CFSv2 document, but heartbeat is stopped and I've
> verified
> > > that cl_status doesn't output those errors if I stop corosync.
> > > c
>
> --
> : Lars Ellenberg
> : LINBIT | Your Way to High Availability
> : DRBD/HA support and consulting http://www.linbit.com
>
> DRBD® and LINBIT® are registered trademarks of LINBIT, Austria.
>
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs:
> http://developerbugs.linux-foundation.org/enter_bug.cgi?product=Pacemaker
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://oss.clusterlabs.org/pipermail/pacemaker/attachments/20110408/6010e22e/attachment.html>


More information about the Pacemaker mailing list