[Pacemaker] hangs pending

Andrew Beekhof andrew at beekhof.net
Tue Feb 18 21:43:28 EST 2014


On 18 Feb 2014, at 11:05 pm, Andrey Groshev <greenx at yandex.ru> wrote:

> Hi, ALL and Andrew!
> 
> Today is a good day - I killed a lot, and a lot of shooting at me.
> In general - I am happy (almost like an elephant)   :)
> Except resources on the node are important to me eight processes: corosync,pacemakerd,cib,stonithd,lrmd,attrd,pengine,crmd.
> I killed them with different signals (4,6,11 and even 9).
> Behavior does not depend of number signal - it's good.
> If STONITH send reboot to the node - it rebooted and rejoined the cluster - too it's good.
> But the behavior is different from killing various demons.
> 
> Turned four groups:
> 1. corosync,cib - STONITH work 100%.
> Kill via any signals - call STONITH and reboot.
> 
> 2. lrmd,crmd - strange behavior STONITH.
> Sometimes called STONITH - and the corresponding reaction.
> Sometimes restart daemon and restart resources with large delay MS:pgsql.
> One time after restart crmd - pgsql don't restart.
> 
> 3. stonithd,attrd,pengine - not need STONITH
> This daemons simple restart, resources - stay running.
> 
> 4. pacemakerd - nothing happens.
> And then I can kill any process of the third group. They do not restart.
> Generaly don't touch corosync,cib and maybe lrmd,crmd.
> 
> What do you think about this?
> The main question of this topic - we decided.
> But this varied behavior - another big problem.
> 
> Forgоt logs http://send2me.ru/pcmk-Tue-18-Feb-2014.tar.bz2

Which of the various conditions above do the logs cover?

-------------- next part --------------
A non-text attachment was scrubbed...
Name: signature.asc
Type: application/pgp-signature
Size: 841 bytes
Desc: Message signed with OpenPGP using GPGMail
URL: <https://lists.clusterlabs.org/pipermail/pacemaker/attachments/20140219/8069823c/attachment-0003.sig>


More information about the Pacemaker mailing list