[ClusterLabs] multiple drives looks like balancing but why and causing troubles
Digimer
lists at alteeve.ca
Wed Aug 26 19:01:19 UTC 2015
On 26/08/15 02:46 PM, Streeter, Michelle N wrote:
> I have a two node cluster. Both nodes are virtual and have five shared
> drives attached via sas controller. For some reason, the cluster shows
> both nodes have half the drives started on them. Not sure if this is
> called split brain or not. It definitely looks load balancing. But I
> did not set up load balancing. On my client, I only see the data for
> the shares on the active cluster node. But they should all be on the
> active cluster node. Any suggestions as to why this is happening? Is
> there a setting so that everything works on only one node at a time?
Can you explain what you mean by "shared drives"? Are these iSCSI LUNs
or direct connections to either port on SAS drives?
A split-brain is when either node things the other is dead and is
operating without coordinating with the peer. It is a disasterous
situation with shared storage and it is what fencing (stonith) prevents,
which you don't have configured.
If you are using KVM, use fence_virsh or fence_virt. If you're using
vmware, use fence_vmware. Please make this a priority before solving
your storage issue.
> pcs cluster status:
>
> Cluster name: CNAS
>
> Last updated: Wed Aug 26 13:35:47 2015
>
> Last change: Wed Aug 26 13:28:55 2015
>
> Stack: classic openais (with plugin)
>
> Current DC: nas02 - partition with quorum
>
> Version: 1.1.11-97629de
>
> 2 Nodes configured, 2 expected votes
>
> 11 Resources configured
>
>
>
>
>
> Online: [ nas01 nas02 ]
>
>
>
> Full list of resources:
>
>
>
> NAS (ocf::heartbeat:IPaddr2): Started nas01
>
> Resource Group: datag
>
> datashare (ocf::heartbeat:Filesystem): Started nas02
>
> dataserver (ocf::heartbeat:nfsserver): Started nas02
>
> Resource Group: oomtlg
>
> oomtlshare (ocf::heartbeat:Filesystem): Started nas01
>
> oomtlserver (ocf::heartbeat:nfsserver): Started nas01
>
> Resource Group: oomtrg
>
> oomtrshare (ocf::heartbeat:Filesystem): Started nas02
>
> oomtrserver (ocf::heartbeat:nfsserver): Started as02
>
> Resource Group: oomblg
>
> oomblshare (ocf::heartbeat:Filesystem): Started nas01
>
> oomblserver (ocf::heartbeat:nfsserver): Started nas01
>
> Resource Group: oombrg
>
> oombrshare (ocf::heartbeat:Filesystem): Started nas02
>
> oombrserver (ocf::heartbeat:nfsserver): Started nas02
>
>
>
> pcs config show:
>
> Cluster Name: CNAS
>
> Corosync Nodes:
>
> nas01 nas02
>
> Pacemaker Nodes:
>
> nas01 nas02
>
>
>
> Resources:
>
> Resource: NAS (class=ocf provider=heartbeat type=IPaddr2)
>
> Attributes: ip=192.168.56.110 cidr_netmask=24
>
> Operations: start interval=0s timeout=20s (NAS-start-timeout-20s)
>
> stop interval=0s timeout=20s (NAS-stop-timeout-20s)
>
> monitor interval=10s timeout=20s (NAS-monitor-interval-10s)
>
> Group: datag
>
> Resource: datashare (class=ocf provider=heartbeat type=Filesystem)
>
> Attributes: device=/dev/sdb1 directory=/data fstype=ext4
>
> Operations: start interval=0s timeout=60 (datashare-start-timeout-60)
>
> stop interval=0s timeout=60 (datashare-stop-timeout-60)
>
> monitor interval=20 timeout=40
> (datashare-monitor-interval-20)
>
> Resource: dataserver (class=ocf provider=heartbeat type=nfsserver)
>
> Attributes: nfs_shared_infodir=/data/nfsinfo nfs_no_notify=true
>
> Operations: start interval=0s timeout=40 (dataserver-start-timeout-40)
>
> stop interval=0s timeout=20s (dataserver-stop-timeout-20s)
>
> monitor interval=10 timeout=20s
> (dataserver-monitor-interval-10)
>
> Group: oomtlg
>
> Resource: oomtlshare (class=ocf provider=heartbeat type=Filesystem)
>
> Attributes: device=/dev/sdc1 directory=/oomtl fstype=ext4
>
> Operations: start interval=0s timeout=60 (oomtlshare-start-timeout-60)
>
> stop interval=0s timeout=60 (oomtlshare-stop-timeout-60)
>
> monitor interval=20 timeout=40
> (oomtlshare-monitor-interval-20)
>
> Resource: oomtlserver (class=ocf provider=heartbeat type=nfsserver)
>
> Attributes: nfs_shared_infodir=/oomtl/nfsinfo nfs_no_notify=true
>
> Operations: start interval=0s timeout=40 (oomtlserver-start-timeout-40)
>
> stop interval=0s timeout=20s (oomtlserver-stop-timeout-20s)
>
> monitor interval=10 timeout=20s
> (oomtlserver-monitor-interval-10)
>
> Group: oomtrg
>
> Resource: oomtrshare (class=ocf provider=heartbeat type=Filesystem)
>
> Attributes: device=/dev/sdd1 directory=/oomtr fstype=ext4
>
> Operations: start interval=0s timeout=60 (oomtrshare-start-timeout-60)
>
> stop interval=0s timeout=60 (oomtrshare-stop-timeout-60)
>
> monitor interval=20 timeout=40
> (oomtrshare-monitor-interval-20)
>
> Resource: oomtrserver (class=ocf provider=heartbeat type=nfsserver)
>
> Attributes: nfs_shared_infodir=/oomtr/nfsinfo nfs_no_notify=true
>
> Operations: start interval=0s timeout=40 (oomtrserver-start-timeout-40)
>
> stop interval=0s timeout=20s (oomtrserver-stop-timeout-20s)
>
> monitor interval=10 timeout=20s
> (oomtrserver-monitor-interval-10)
>
> Group: oomblg
>
> Resource: oomblshare (class=ocf provider=heartbeat type=Filesystem)
>
> Attributes: device=/dev/sde1 directory=/oombl fstype=ext4
>
> Operations: start interval=0s timeout=60 (oomblshare-start-timeout-60)
>
> stop interval=0s timeout=60 (oomblshare-stop-timeout-60)
>
> monitor interval=20 timeout=40
> (oomblshare-monitor-interval-20)
>
> Resource: oomblserver (class=ocf provider=heartbeat type=nfsserver)
>
> Attributes: nfs_shared_infodir=/oombl/nfsinfo nfs_no_notify=true
>
> Operations: start interval=0s timeout=40 (oomblserver-start-timeout-40)
>
> stop interval=0s timeout=20s (oomblserver-stop-timeout-20s)
>
> monitor interval=10 timeout=20s
> (oomblserver-monitor-interval-10)
>
> Group: oombrg
>
> Resource: oombrshare (class=ocf provider=heartbeat type=Filesystem)
>
> Attributes: device=/dev/sdf1 directory=/oombr fstype=ext4
>
> Operations: start interval=0s timeout=60 (oombrshare-start-timeout-60)
>
> stop interval=0s timeout=60 (oombrshare-stop-timeout-60)
>
> monitor interval=20 timeout=40
> (oombrshare-monitor-interval-20)
>
> Resource: oombrserver (class=ocf provider=heartbeat type=nfsserver)
>
> Attributes: nfs_shared_infodir=/oombr/nfsinfo nfs_no_notify=true
>
> Operations: start interval=0s timeout=40 (oombrserver-start-timeout-40)
>
> stop interval=0s timeout=20s (oombrserver-stop-timeout-20s)
>
> monitor interval=10 timeout=20s
> (oombrserver-monitor-interval-10)
>
>
>
> Stonith Devices:
>
> Fencing Levels:
>
>
>
> Location Constraints:
>
> Ordering Constraints:
>
> Colocation Constraints:
>
>
>
> Cluster Properties:
>
> cluster-infrastructure: classic openais (with plugin)
>
> dc-version: 1.1.11-97629de
>
> expected-quorum-votes: 2
>
> no-quorum-policy: ignore
>
> stonith-enabled: false
>
>
>
> Michelle Streeter
>
> ASC2 MCS – SDE/ACL/SDL/EDL OKC Software Engineer
> The Boeing Company
>
>
>
>
>
> _______________________________________________
> Users mailing list: Users at clusterlabs.org
> http://clusterlabs.org/mailman/listinfo/users
>
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org
>
--
Digimer
Papers and Projects: https://alteeve.ca/w/
What if the cure for cancer is trapped in the mind of a person without
access to education?
More information about the Users
mailing list