[Pacemaker] 2 sbd devices and stonith-ng is showing (1 active devices)
Lars Marowsky-Bree
lmb at suse.com
Thu Mar 15 21:08:48 UTC 2012
On 2012-03-15T15:39:23, "Janec, Jozef" <jozef.janec at hp.com> wrote:
> We have configured pacemaker on HAE from novell:
>
> cat /etc/sysconfig/sbd
> SBD_DEVICE="/dev/mapper/SHARED1_part1;/dev/mapper/SHARED2_part1"
> SBD_OPTS="-W"
>
> I'm running 2 instances of watcher
>
> root 9157 1 0 11:00 pts/0 00:00:00 sbd: inquisitor
> root 9158 9157 0 11:00 pts/0 00:00:00 sbd: watcher: /dev/mapper/SHARED1_part1 - slot: 0
> root 9159 9157 0 11:00 pts/0 00:00:00 sbd: watcher: /dev/mapper/SHARED2_part1 - slot: 1
That looks fine, but did you read
http://www.linux-ha.org/wiki/SBD_Fencing about the limitations of using
2 devices?
> I have running one resource per node
>
> Online: [ b300ple0 b400ple0 ]
>
> sbd_fense_b400 (stonith:external/sbd): Started b400ple0
> sbd_fense_b300 (stonith:external/sbd): Started b300ple0
Why that? Did you read http://www.linux-ha.org/wiki/SBD_Fencing?
You only have to have one external/sbd per cluster. A single primitive
is sufficient. No need to run several, nor to clone them.
> Mar 17 11:03:51 b400ple0 stonith-ng: [9467]: info: stonith_device_register: Added 'sbd_fense_b400' to the device list (1 active devices)
Yes, because from the point of view of the stonith-ng, there is only one
"sbd" device, though that internally uses two storage devices - which
stonith-ng doesn't know about.
> I got:
>
> 15 11:15:33 b300ple0 stonith-ng: [8546]: debug: exec_child_done: Got 60 more bytes: Performing: stonith -t external/sbd -S failed: 0.05859375
> Mar 15 11:15:33 b300ple0 stonith-ng: [8546]: notice: log_operation: Operation 'monitor' [15803] for device 'sbd_fense_b300' returned: 1
> Mar 15 11:15:33 b300ple0 stonith-ng: [8546]: debug: log_operation: sbd_fense_b300 output: Performing: stonith -t external/sbd -S
> Mar 15 11:15:33 b300ple0 stonith-ng: [8546]: debug: log_operation: sbd_fense_b300 output: failed: 0.05859375
> Mar 15 11:15:33 b300ple0 lrm-stonith: [15802]: debug: execra: sbd_fense_b300_monitor returned 1
> Mar 15 11:15:33 b300ple0 stonith-ng: [8546]: debug: log_operation: sbd_fense_b300 output: (total 60 bytes)
The agent itself should also have logged something.
Have you, by chance, configured one external/sbd instance per device?
That would be wrong; you need to run one external/sbd instance per
cluster for all devices.
Did you read http://www.linux-ha.org/wiki/SBD_Fencing?
Regards,
Lars
--
Architect Storage/HA
SUSE LINUX Products GmbH, GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer, HRB 21284 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde
More information about the Pacemaker
mailing list