[Pacemaker] Getting split brain after all reboot of a cluster node

Wed Mar 5 11:26:12 CET 2014

because you don't have fencing configured


2014-03-05 9:28 GMT+01:00 Anne Nicolas <ennael1 at gmail.com>:

> Hi
>
> I'm having trouble setting a very simple cluster with 2 nodes. After all
> reboot I'm getting split brain that I have to solve by hand then.
> Looking for a solution for that one...
>
> Both nodes have 4 network interfaces. We use 3 of them: one for an IP
> cluster, one for a bridge for a vm and the last one for the private
> network of the cluster
>
> I'm using
> drbd : 8.3.9
> drbd-utils: 8.3.9
>
> DRBD configuration:
> ============
> $ cat global_common.conf
> global {
>         usage-count no;
>         disable-ip-verification;
>  }
> common { syncer { rate 500M; } }
>
> cat server.res
> resource server {
>         protocol C;
>         net {
>                  cram-hmac-alg sha1;
>                  shared-secret "eafcupps";
>             }
>  on dzacupsvr {
>     device     /dev/drbd0;
>     disk       /dev/vg0/server;
>     address    172.16.1.1:7788;
>     flexible-meta-disk  internal;
>   }
>   on dzacupsvr2 {
>     device     /dev/drbd0;
>     disk       /dev/vg0/server;
>     address    172.16.1.2:7788;
>     flexible-meta-disk  internal;
>   }
> }
>
> Pacemaker configuration
> ===============
> node $id="16847020" dzacupsvr
> node $id="33624236" dzacupsvr2
> primitive apache ocf:heartbeat:apache \
>         params configfile="/etc/httpd/conf/httpd.conf" \
>         op start interval="0" timeout="40s" \
>         op stop interval="0" timeout="60s"
> primitive clusterip ocf:heartbeat:IPaddr2 \
>         params ip="192.168.1.5" cidr_netmask="24" nic="eth0"
> primitive drbdserv ocf:linbit:drbd \
>         params drbd_resource="server" \
>         op monitor interval="60s"
> primitive fsserv ocf:heartbeat:Filesystem \
>         params device="/dev/drbd/by-res/server" directory="/EdgeServer"
> fstype="ext4"
> primitive libvirt-guests lsb:libvirt-guests
> primitive libvirtd lsb:libvirtd
> primitive mysql ocf:heartbeat:mysql \
>         params binary="/usr/bin/mysqld_safe" config="/etc/my.cnf"
> datadir="/EdgeServer/mysql" \
>         op start interval="0" timeout="40s" \
>         op stop interval="0" timeout="60s" \
>         meta target-role="Started"
> primitive named lsb:named
> primitive samba lsb:smb
> group services fsserv clusterip libvirtd samba apache mysql
> ms drbdservClone drbdserv \
>         meta master-max="1" master-node-max="1" clone-max="2"
> clone-node-max="1" notify="true"
> colocation fs_on_drbd inf: fsserv drbdservClone:Master
> order fsserv-after-drbdserv inf: drbdservClone:promote fsserv:start
> property $id="cib-bootstrap-options" \
>         dc-version="1.1.7-2.mga1-ee0730e13d124c3d58f00016c3376a1de5323cff"
> \
>         cluster-infrastructure="corosync" \
>         stonith-enabled="false" \
>         no-quorum-policy="ignore"
>
> and here are the logs
> http://pastebin.com/DcV2WM3M
>
> After looking for more information, I've added fences in drbd configuration
>
> handlers {
>     fence-peer "/usr/lib/drbd/crm-fence-peer.sh";
>     after-resync-target "/usr/lib/drbd/crm-unfence-peer.sh";
>   }
> but still without any success...
>
> Any help appreciated
>
> Cheers
>
> --
> Anne
>
>
> --
> Anne
> http://mageia.org
>
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
>


-- 
esta es mi vida e me la vivo hasta que dios quiera
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://oss.clusterlabs.org/pipermail/pacemaker/attachments/20140305/2ac21bbe/attachment.html>