[Pacemaker] Cannot start VirtualDomain resource after restart

Dejan Muhamedagic dejanmm at fastmail.fm
Thu Jun 28 07:38:38 EDT 2012


Hi,

On Thu, Jun 21, 2012 at 09:32:22AM +0200, Kadlecsik József wrote:
> On Thu, 21 Jun 2012, Andrew Beekhof wrote:
> 
> > Ah, I see you subsequently got some good advice from Phil.  Glad to hear 
> > your problem is resolved. In general, when you run into "why isn't 
> > resource X starting? or starting in the wrong place?" its always a good 
> > idea to include a dump of the current state of the cluster (the output 
> > of cibadmin -Ql).
> 
> Yes, but please note: in spite of having plenty capacity at every node, 
> utilization prevented to (re)start one. Originally the stonith resources 
> hadn't got memory capacity set, but after setting those to explicit zero, 
> the resource still wasn't started. Only disabling placement strategy 
> helped. One type of capacity is defined alone.

Didn't follow the thread closely, but if you think you found a
bug in the utilization placement then please file a bug.

Thanks,

Dejan

> At the moment utilization is disabled and "crm_simulate -U -L" produces:
> 
> Current cluster status:
> Online: [ atlas0 atlas1 atlas2 atlas3 atlas4 atlas5 atlas6 ]
> 
>  kerberos       (ocf::heartbeat:VirtualDomain): Started atlas0
>  stonith-atlas3 (stonith:ipmilan):      Started atlas4
>  stonith-atlas1 (stonith:ipmilan):      Started atlas2
>  stonith-atlas2 (stonith:ipmilan):      Started atlas1
>  stonith-atlas0 (stonith:ipmilan):      Started atlas3
>  stonith-atlas4 (stonith:ipmilan):      Started atlas5
>  mailman        (ocf::heartbeat:VirtualDomain): Started atlas6
>  indico (ocf::heartbeat:VirtualDomain): Started atlas0
>  papi   (ocf::heartbeat:VirtualDomain): Started atlas1
>  wwwd   (ocf::heartbeat:VirtualDomain): Started atlas2
>  webauth        (ocf::heartbeat:VirtualDomain): Started atlas3
>  caladan        (ocf::heartbeat:VirtualDomain): Started atlas4
>  radius (ocf::heartbeat:VirtualDomain): Started atlas5
>  mail0  (ocf::heartbeat:VirtualDomain): Started atlas6
>  stonith-atlas5 (stonith:apcmastersnmp):        Started atlas4
>  stonith-atlas6 (stonith:apcmastersnmp):        Started atlas0
>  w0     (ocf::heartbeat:VirtualDomain): Started atlas2
>  lx0    (ocf::heartbeat:VirtualDomain): Started atlas1
> 
> Utilization information:
> Original: atlas0 capacity: memory=24576
> Original: atlas1 capacity: memory=24576
> Original: atlas2 capacity: memory=24576
> Original: atlas3 capacity: memory=24576
> Original: atlas4 capacity: memory=24576
> Original: atlas5 capacity: memory=20480
> Original: atlas6 capacity: memory=20480
> calculate_utilization: kerberos utilization on atlas0: memory=4608
> calculate_utilization: stonith-atlas3 utilization on atlas4: memory=0
> calculate_utilization: stonith-atlas1 utilization on atlas2: memory=0
> calculate_utilization: stonith-atlas2 utilization on atlas1: memory=0
> calculate_utilization: stonith-atlas0 utilization on atlas3: memory=0
> calculate_utilization: stonith-atlas4 utilization on atlas5: memory=0
> calculate_utilization: mailman utilization on atlas6: memory=5120
> calculate_utilization: indico utilization on atlas0: memory=5120
> calculate_utilization: papi utilization on atlas1: memory=6144
> calculate_utilization: wwwd utilization on atlas2: memory=5120
> calculate_utilization: webauth utilization on atlas3: memory=4608
> calculate_utilization: caladan utilization on atlas4: memory=4608
> calculate_utilization: radius utilization on atlas5: memory=4608
> calculate_utilization: mail0 utilization on atlas6: memory=4608
> calculate_utilization: stonith-atlas5 utilization on atlas4: memory=0
> calculate_utilization: stonith-atlas6 utilization on atlas0: memory=0
> calculate_utilization: w0 utilization on atlas2: memory=4608
> calculate_utilization: lx0 utilization on atlas1: memory=4608
> Remaining: atlas0 capacity: memory=14848
> Remaining: atlas1 capacity: memory=13824
> Remaining: atlas2 capacity: memory=14848
> Remaining: atlas3 capacity: memory=19968
> Remaining: atlas4 capacity: memory=19968
> Remaining: atlas5 capacity: memory=15872
> Remaining: atlas6 capacity: memory=10752
> 
> Transition Summary:
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   
> kerberos  (Started atlas0)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   
> stonith-atlas3     (Started atlas4)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   
> stonith-atlas1     (Started atlas2)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   
> stonith-atlas2     (Started atlas1)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   
> stonith-atlas0     (Started atlas3)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   
> stonith-atlas4     (Started atlas5)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   
> mailman   (Started atlas6)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   
> indico    (Started atlas0)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   papi      
> (Started atlas1)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   wwwd      
> (Started atlas2)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   
> webauth   (Started atlas3)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   
> caladan   (Started atlas4)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   
> radius    (Started atlas5)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   mail0     
> (Started atlas6)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   
> stonith-atlas5     (Started atlas4)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   
> stonith-atlas6     (Started atlas0)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   w0 
> (Started atlas2)
> crm_simulate[36390]: 2012/06/21_09:12:21 notice: LogActions: Leave   lx0       
> (Started atlas1)
> 
> Best regards,
> Jozsef
> --
> E-mail : kadlecsik.jozsef at wigner.mta.hu
> PGP key: http://www.kfki.hu/~kadlec/pgp_public_key.txt
> Address: Wigner Research Centre for Physics, Hungarian Academy of Sciences
>          H-1525 Budapest 114, POB. 49, Hungary
> 
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org




More information about the Pacemaker mailing list