[Pacemaker] Pacemaker fencing and DLM/cLVM

Christine Caulfield ccaulfie at redhat.com
Tue Nov 25 16:47:49 UTC 2014


On 25/11/14 10:45, Daniel Dehennin wrote:
> Daniel Dehennin <daniel.dehennin at baby-gnu.org> writes:
>
>> I'm using Ubuntu 14.04:
>>
>> - corosync 2.3.3-1ubuntu1
>> - pacemaker 1.1.10+git20130802-1ubuntu2.1
>>
>> I thought everything was integrated in such configuration.
>
> Here are some more informations:
>
> - the pacemaker configuration
> - the log of the DC nebula1 with marks for each step
> - the log of the nebula2 with marks for each step
> - the log of the nebula3 with marks for each step
> - the output of “dlm_tool ls” and dlm_tool status” before/during/after
>    nebula2 fencing
>
> The steps are:
>
> 1. All nodes up, cluster down
> 2. Start corosync on all nodes
> 3. Start pacemaker on all nodes
> 4. Start resource ONE-Storage-Clone (dlm, cLVM, VG, GFS2)
> 5. Crash nebula2
> 6. Start corosync on nebula2 after reboot
> 7. Start pacemaker on nebula2 after reboot
>
> Does someone understand why DLM did not get the ACK of the fencing
> automatically from stonith?
>
> Why ONE-Storage-Clone does not manage to start on nebula2 after fencing.
>

It seems to me that fencing is failing for some reason, though I can't 
tell from the logs exactly why, so you might have to investgate your 
setup for IPMI to see just what is happening (I'm no IPMI expert, sorry).

The logs files tell me this though:

Nov 25 10:56:32 nebula3 dlm_controld[6465]: 1035 fence request 
1084811079 pid 7358 nodedown time 1416909392 fence_all dlm_stonith
Nov 25 10:56:32 nebula3 dlm_controld[6465]: 1035 fence result 1084811079 
pid 7358 result 1 exit status
Nov 25 10:56:32 nebula3 dlm_controld[6465]: 1035 fence status 1084811079 
receive 1 from 1084811080 walltime 1416909392 local 1035
Nov 25 10:56:32 nebula3 dlm_controld[6465]: 1035 fence request 
1084811079 no actor


Showing a status code '1' from dlm_stonith - the result should be 0 if 
fencing completed succesfully.


Chrissie




More information about the Pacemaker mailing list