[ClusterLabs] Error creating a fence mechanism

Douglas Restrepo lucas_max2002 at yahoo.es
Mon Mar 21 21:17:48 CET 2016


Hi guys, 

Im trying to create a new cluster using pacemaker but Im having a problem with the fence mechanism.Until de moment I have perform the following tasks:
+ I have installed two different machines with centos 7 (infrastructure mode).+ Each machine have been configured with two network cards (one for management and a second one for service)
+ Corosync pacemaker pcs dlm dlm-lib fence-agents-all lvm2-cluster have been installed 
+ The corosync.conf file is configured.+ The cluster have been initialized using pcs cluster start --all 
+ At executing the pcs status command, I get the following output
Cluster name: mycluster-ha
WARNING: no stonith devices and stonith-enabled is not false
Last updated: Sat Mar 19 10:01:43 2016          Last change: Fri Mar 18 03:36:23 2016 by root via cibadmin on node-01
Stack: corosync
Current DC: node-01 (version 1.1.13-10.el7_2.2-44eb2dd) - partition with quorum
2 nodes and 0 resources configured

Online: [ node-01 node-02 ]

Full list of resources:

PCSD Status:
  node-01: Online
  node-02: Online

Daemon Status:
  corosync: active/disabled
  pacemaker: active/disabled
  pcsd: active/enabled


The problem what Im having is that I don't have a physical PDU, so I have to simulate one.For this purpose I have created a fence script in python  and it has been added to the /usr/sbin folder (with its respective permissions), where are located the all fence scripts.
Now, when a execute the command pcs stonith list I get the outputError: no metadata for /usr/sbin/fence_manual
Any way, to add the fence mechanism I executed the command 

#pcs cluster cib stonith_cfg
#pcs -f stonith_cfg stonith create fence_node_01 fence_manual ipadd="node-01" action="off" passwd="passwd" login="admin" pcmk_host_list="node-01" op monitor interval=60s

but after this I get the outputError: Unable to create resource stonith fence_manual, it is not installed on this system (use --force to override)
By using the --option, the resource is added, but its status is always stopped. 
By executing again the command pcs status give me the following output
Cluster name: mycluster-ha
Last updated: Thu Mar 17 06:38:52 2016          Last change: Thu Mar 17 06:26:17 2016 by root via cibadmin on node-01
Stack: corosync
Current DC: node-01 (version 1.1.13-10.el7_2.2-44eb2dd) - partition with quorum
2 nodes and 1 resource configured

Online: [ node-01 node-02 ]

Full list of resources:

 fence_node-01  (stonith:fence_manual): Stopped

Failed Actions:
* fence_node-01_start_0 on node-02 'unknown error' (1): call=6, status=Error, exitreason='none',
    last-rc-change='Thu Mar 17 23:11:30 2016', queued=0ms, exec=2008ms
* fence_node-01_start_0 on node-01 'unknown error' (1): call=6, status=Error, exitreason='none',
    last-rc-change='Thu Mar 17 06:30:01 2016', queued=0ms, exec=2007ms


PCSD Status:
  node-01: Online
  node-02: Online

Daemon Status:
  corosync: active/disabled
  pacemaker: active/disabled
  pcsd: active/enabled
So to remove this error and go back to the previous status I executed the command 

pcs resource cleanup fence_node_01
but now, when I execute the command 
pcs stonith list
Im getting the error 
Error: unable to locate command: /usr/sbin/fence_manual

Can someone guide me with this process?I don´t know why im getting this error configuring the fence mechanism.
How can I add a new fence mechanism to the system, and it to be detected by the pcs?

Thanks in advance


PD: SELinux is disabled (temporally) to avoid any problem in communications with the nodes.


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://clusterlabs.org/pipermail/users/attachments/20160321/393e6cd2/attachment.html>


More information about the Users mailing list