[Pacemaker] hangs pending

Andrew Beekhof andrew at beekhof.net
Wed Feb 19 00:45:54 EST 2014


On 19 Feb 2014, at 4:18 pm, Andrey Groshev <greenx at yandex.ru> wrote:

> 
> 
> 19.02.2014, 09:08, "Andrew Beekhof" <andrew at beekhof.net>:
>> On 19 Feb 2014, at 4:00 pm, Andrey Groshev <greenx at yandex.ru> wrote:
>> 
>>>  19.02.2014, 06:48, "Andrew Beekhof" <andrew at beekhof.net>:
>>>>  On 18 Feb 2014, at 11:05 pm, Andrey Groshev <greenx at yandex.ru> wrote:
>>>>>   Hi, ALL and Andrew!
>>>>> 
>>>>>   Today is a good day - I killed a lot, and a lot of shooting at me.
>>>>>   In general - I am happy (almost like an elephant)   :)
>>>>>   Except resources on the node are important to me eight processes: corosync,pacemakerd,cib,stonithd,lrmd,attrd,pengine,crmd.
>>>>>   I killed them with different signals (4,6,11 and even 9).
>>>>>   Behavior does not depend of number signal - it's good.
>>>>>   If STONITH send reboot to the node - it rebooted and rejoined the cluster - too it's good.
>>>>>   But the behavior is different from killing various demons.
>>>>> 
>>>>>   Turned four groups:
>>>>>   1. corosync,cib - STONITH work 100%.
>>>>>   Kill via any signals - call STONITH and reboot.
>>>>> 
>>>>>   2. lrmd,crmd - strange behavior STONITH.
>>>>>   Sometimes called STONITH - and the corresponding reaction.
>>>>>   Sometimes restart daemon and restart resources with large delay MS:pgsql.
>>>>>   One time after restart crmd - pgsql don't restart.
>>>>> 
>>>>>   3. stonithd,attrd,pengine - not need STONITH
>>>>>   This daemons simple restart, resources - stay running.
>>>>> 
>>>>>   4. pacemakerd - nothing happens.
>>>>>   And then I can kill any process of the third group. They do not restart.
>>>>>   Generaly don't touch corosync,cib and maybe lrmd,crmd.
>>>>> 
>>>>>   What do you think about this?
>>>>>   The main question of this topic - we decided.
>>>>>   But this varied behavior - another big problem.
>>>>> 
>>>>>   Forgоt logs http://send2me.ru/pcmk-Tue-18-Feb-2014.tar.bz2
>>>>  Which of the various conditions above do the logs cover?
>>>  All various in day.
>> 
>> Are you trying to torture me?
>> Can you give me a rough idea what happened when?
> 
> No, there is 8 processes on the 4th signal and repeats the experiments with unknown outcome :) 
> Easier to conduct new experiments and individual new logs .
> Which variant is more interesting?
> 

The long delay in restarting pgsql.
Everything else seems correct.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 841 bytes
Desc: Message signed with OpenPGP using GPGMail
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20140219/9b8889b9/attachment-0003.sig>


More information about the Pacemaker mailing list