[Pacemaker] CRMd exits because of internal error

Andrew Beekhof andrew at beekhof.net
Wed Jul 24 22:42:25 EDT 2013


On 19/07/2013, at 12:20 AM, K Mehta <kiranmehta1981 at gmail.com> wrote:

> Hi,
> 
> I have a two node cluster. I have few resources configured on it. On vqa12, CRMd dies due to some internal error. It is not clear why CRMd decides to die on May5 at 22:14:50  on system vqa12

Its because of:

May 05 22:14:50 [3518] vqa12       crmd:    error: cib_quorum_update_complete:  Quorum update 135 failed

Could the machine have been overloaded?  Thats usually the reason.

> ================
> May 05 22:14:50 [3518] vqa12       crmd:     info: do_exit:     Performing A_EXIT_0 - gracefully exiting the CRMd
> ================
> 
> 
> Corosync logs vqa12
> ==============
> 
> May 05 22:12:53 [3517] vqa12    pengine:     info: determine_online_status:     Node vqa12 is online
> May 05 22:12:53 [3517] vqa12    pengine:     info: determine_online_status:     Node vqa11 is online
> May 05 22:12:53 [3517] vqa12    pengine:     info: find_anonymous_clone:        Internally renamed vha-94b33532-15ba-4923-a920-ab9268ccd856 on vqa12 to vha-94b33532-15ba-4923-a920-ab9268ccd856:0
> May 05 22:12:53 [3517] vqa12    pengine:     info: native_print:        vgc_virtual_ip  (ocf::heartbeat:IPaddr2):       Started vqa12
> May 05 22:12:53 [3517] vqa12    pengine:     info: clone_print:          Master/Slave Set: ms-94b33532-15ba-4923-a920-ab9268ccd856 [vha-94b33532-15ba-4923-a920-ab9268ccd856]
> May 05 22:12:53 [3517] vqa12    pengine:     info: short_print:              Masters: [ vqa12 ]
> May 05 22:12:53 [3517] vqa12    pengine:     info: short_print:              Stopped: [ vha-94b33532-15ba-4923-a920-ab9268ccd856:1 ]
> May 05 22:12:53 [3517] vqa12    pengine:     info: master_color:        Promoting vha-94b33532-15ba-4923-a920-ab9268ccd856:0 (Master vqa12)
> May 05 22:12:53 [3517] vqa12    pengine:     info: master_color:        ms-94b33532-15ba-4923-a920-ab9268ccd856: Promoted 1 instances of a possible 1 to master
> May 05 22:12:53 [3517] vqa12    pengine:     info: RecurringOp:          Start recurring monitor (31s) for vha-94b33532-15ba-4923-a920-ab9268ccd856:1 on vqa11
> May 05 22:12:53 [3517] vqa12    pengine:     info: RecurringOp:          Start recurring monitor (31s) for vha-94b33532-15ba-4923-a920-ab9268ccd856:1 on vqa11
> May 05 22:12:53 [3517] vqa12    pengine:     info: LogActions:  Leave   vgc_virtual_ip  (Started vqa12)
> May 05 22:12:53 [3517] vqa12    pengine:     info: LogActions:  Leave   vha-94b33532-15ba-4923-a920-ab9268ccd856:0      (Master vqa12)
> May 05 22:12:53 [3517] vqa12    pengine:   notice: LogActions:  Start   vha-94b33532-15ba-4923-a920-ab9268ccd856:1      (vqa11)
> May 05 22:12:53 [3517] vqa12    pengine:   notice: process_pe_message:  Calculated Transition 7: /var/lib/pacemaker/pengine/pe-input-99.bz2
> May 05 22:12:53 [3518] vqa12       crmd:     info: do_state_transition:         State transition S_POLICY_ENGINE -> S_TRANSITION_ENGINE [ input=I_PE_SUCCESS cause=C_IPC_MESSAGE origin=handle_response ]
> May 05 22:12:53 [3518] vqa12       crmd:     info: do_te_invoke:        Processing graph 7 (ref=pe_calc-dc-1367817173-63) derived from /var/lib/pacemaker/pengine/pe-input-99.bz2
> May 05 22:12:53 [3518] vqa12       crmd:     info: te_rsc_command:      Initiating action 5: monitor vgc_virtual_ip_monitor_0 on vqa11
> May 05 22:12:53 [3518] vqa12       crmd:     info: te_rsc_command:      Initiating action 6: monitor vha-94b33532-15ba-4923-a920-ab9268ccd856:1_monitor_0 on vqa11
> May 05 22:12:56 [3518] vqa12       crmd:     info: te_rsc_command:      Initiating action 4: probe_complete probe_complete on vqa11 - no waiting
> May 05 22:12:56 [3518] vqa12       crmd:     info: te_rsc_command:      Initiating action 14: start vha-94b33532-15ba-4923-a920-ab9268ccd856:1_start_0 on vqa11
> May 05 22:12:56 [3518] vqa12       crmd:     info: te_rsc_command:      Initiating action 15: monitor vha-94b33532-15ba-4923-a920-ab9268ccd856:1_monitor_31000 on vqa11
> May 05 22:12:56 [3518] vqa12       crmd:   notice: run_graph:   Transition 7 (Complete=8, Pending=0, Fired=0, Skipped=0, Incomplete=0, Source=/var/lib/pacemaker/pengine/pe-input-99.bz2): Complete
> May 05 22:12:56 [3518] vqa12       crmd:   notice: do_state_transition:         State transition S_TRANSITION_ENGINE -> S_IDLE [ input=I_TE_SUCCESS cause=C_FSA_INTERNAL origin=notify_crmd ]
> May 05 22:14:50 [3518] vqa12       crmd:    error: cib_quorum_update_complete:  Quorum update 135 failed
> May 05 22:14:50 [3518] vqa12       crmd:    error: do_log:      FSA: Input I_ERROR from cib_quorum_update_complete() received in state S_IDLE
> May 05 22:14:50 [3518] vqa12       crmd:   notice: do_state_transition:         State transition S_IDLE -> S_RECOVERY [ input=I_ERROR cause=C_FSA_INTERNAL origin=cib_quorum_update_complete ]
> May 05 22:14:50 [3518] vqa12       crmd:    error: do_recover:  Action A_RECOVER (0000000001000000) not supported
> May 05 22:14:50 [3518] vqa12       crmd:  warning: do_election_vote:    Not voting in election, we're in state S_RECOVERY
> May 05 22:14:50 [3518] vqa12       crmd:     info: do_dc_release:       DC role released
> May 05 22:14:50 [3518] vqa12       crmd:     info: pe_ipc_destroy:      Connection to the Policy Engine released
> May 05 22:14:50 [3518] vqa12       crmd:     info: do_te_control:       Transitioner is now inactive
> May 05 22:14:50 [3518] vqa12       crmd:    error: do_log:      FSA: Input I_TERMINATE from do_recover() received in state S_RECOVERY
> May 05 22:14:50 [3518] vqa12       crmd:     info: do_state_transition:         State transition S_RECOVERY -> S_TERMINATE [ input=I_TERMINATE cause=C_FSA_INTERNAL orig
> May 05 22:14:50 [3518] vqa12       crmd:     info: do_shutdown:         Disconnecting STONITH...
> May 05 22:14:50 [3518] vqa12       crmd:     info: tengine_stonith_connection_destroy:  Fencing daemon disconnected
> May 05 22:14:50 [3515] vqa12       lrmd:     info: cancel_recurring_action:     Cancelling operation vha-94b33532-15ba-4923-a920-ab9268ccd856_monitor_30000
> May 05 22:14:50 [3518] vqa12       crmd:    error: verify_stopped:      Resource vgc_virtual_ip was active at shutdown.  You may ignore this error if it is unmanaged.
> May 05 22:14:50 [3518] vqa12       crmd:    error: verify_stopped:      Resource vha-94b33532-15ba-4923-a920-ab9268ccd856 was active at shutdown.  You may ignore this error if it is unmanaged.
> May 05 22:14:50 [3518] vqa12       crmd:     info: lrmd_api_disconnect:         Disconnecting from lrmd service
> May 05 22:14:50 [3515] vqa12       lrmd:     info: lrmd_ipc_destroy:    LRMD client disconnecting 0x14bf9f0 - name: crmd id: a7e581bd-c0fe-4d3f-9734-acca62e868a8
> May 05 22:14:50 [3518] vqa12       crmd:     info: lrmd_connection_destroy:     connection destroyed
> May 05 22:14:50 [3518] vqa12       crmd:     info: lrm_connection_destroy:      LRM Connection disconnected
> May 05 22:14:50 [3518] vqa12       crmd:     info: do_lrm_control:      Disconnected from the LRM
> May 05 22:14:50 [3518] vqa12       crmd:     info: crm_cluster_disconnect:      Disconnecting from cluster infrastructure: classic openais (with plugin)
> May 05 22:14:50 [3518] vqa12       crmd:   notice: terminate_cs_connection:     Disconnecting from Corosync
> May 05 22:14:50 [3518] vqa12       crmd:     info: crm_cluster_disconnect:      Disconnected from classic openais (with plugin)
> May 05 22:14:50 [3518] vqa12       crmd:     info: do_ha_control:       Disconnected from the cluster
> May 05 22:14:50 [3518] vqa12       crmd:     info: do_cib_control:      Disconnecting CIB
> May 05 22:14:50 corosync [pcmk  ] info: pcmk_ipc_exit: Client crmd (conn=0x231dd00, async-conn=0x231dd00) left
> May 05 22:14:50 [3513] vqa12        cib:     info: cib_process_readwrite:       We are now in R/O mode
> May 05 22:14:50 [3513] vqa12        cib:  warning: qb_ipcs_event_sendv:         new_event_notification (3513-3518-15): Broken pipe (32)
> May 05 22:14:50 [3513] vqa12        cib:     info: crm_ipcs_send:       Event 480 failed, size=162, to=0x2668ba0[3518], queue=1, retries=0, rc=-32: <cib-reply t="cib" cib_op="cib_slave" cib_callid="156" cib_clientid="4589219a-4c81-4a49-803c-df4cc8037f9a" cib_callopt="
> May 05 22:14:50 [3513] vqa12        cib:  warning: do_local_notify:     A-Sync reply to crmd failed: No message of desired type
> May 05 22:14:50 [3518] vqa12       crmd:     info: crmd_cib_connection_destroy:         Connection to the CIB terminated...
> May 05 22:14:50 [3518] vqa12       crmd:     info: qb_ipcs_us_withdraw:         withdrawing server sockets
> May 05 22:14:50 [3518] vqa12       crmd:     info: do_exit:     Performing A_EXIT_0 - gracefully exiting the CRMd
> May 05 22:14:50 [3518] vqa12       crmd:    error: do_exit:     Could not recover from internal error
> May 05 22:14:50 [3518] vqa12       crmd:     info: do_exit:     [crmd] stopped (2)
> May 05 22:14:50 [3518] vqa12       crmd:     info: crmd_exit:   Dropping I_PENDING: [ state=S_TERMINATE cause=C_FSA_INTERNAL origin=do_election_vote ]
> May 05 22:14:50 [3518] vqa12       crmd:     info: crmd_exit:   Dropping I_RELEASE_SUCCESS: [ state=S_TERMINATE cause=C_FSA_INTERNAL origin=do_dc_release ]
> May 05 22:14:50 [3518] vqa12       crmd:     info: crmd_exit:   Dropping I_TERMINATE: [ state=S_TERMINATE cause=C_FSA_INTERNAL origin=do_stop ]
> May 05 22:14:50 [3518] vqa12       crmd:     info: lrmd_api_disconnect:         Disconnecting from lrmd service
> May 05 22:14:50 [3518] vqa12       crmd:     info: crm_xml_cleanup:     Cleaning up memory from libxml2
> May 05 22:14:50 [3507] vqa12 pacemakerd:    error: pcmk_child_exit:     Child process crmd exited (pid=3518, rc=2)
> May 05 22:14:50 [3514] vqa12 stonith-ng:     info: crm_update_peer_proc:        pcmk_mcp_dispatch: Node vqa12[33663168] - unknown is now (null)
> May 05 22:14:50 [3513] vqa12        cib:     info: crm_update_peer_proc:        pcmk_mcp_dispatch: Node vqa12[33663168] - unknown is now (null)
> May 05 22:14:50 [3507] vqa12 pacemakerd:   notice: pcmk_process_exit:   Respawning failed child process: crmd
> May 05 22:14:50 [3507] vqa12 pacemakerd:     info: start_child:         Forked child 7806 for process crmd
> May 05 22:14:50 [3514] vqa12 stonith-ng:     info: crm_update_peer_proc:        pcmk_mcp_dispatch: Node vqa12[33663168] - unknown is now (null)
> May 05 22:14:50 [3513] vqa12        cib:     info: crm_update_peer_proc:        pcmk_mcp_dispatch: Node vqa12[33663168] - unknown is now (null)
> May 05 22:14:50 corosync [pcmk  ] WARN: route_ais_message: Sending message to local.crmd failed: ipc delivery failed (rc=-2)
> May 05 22:14:50 [7806] vqa12       crmd:     info: crm_log_init:        Cannot change active directory to /var/lib/pacemaker/cores/hacluster: Permission denied (13)
> May 05 22:14:50 [7806] vqa12       crmd:   notice: main:        CRM Git Version: 394e906
> May 05 22:14:50 [7806] vqa12       crmd:     info: get_cluster_type:    Cluster type is: 'openais'
> 
> 
> 
> Corosync logs vqa11
> ===============
> 
> May 05 22:12:50 [3553] vqa11        cib:     info: cib_server_process_diff:     Requesting re-sync from peer
> May 05 22:12:50 [3553] vqa11        cib:   notice: cib_server_process_diff:     Not applying diff 0.7126.40 -> 0.7126.41 (sync in progress)
> May 05 22:12:50 corosync [CPG   ] chosen downlist: sender r(0) ip(192.168.1.1) ; members(old:1 left:0)
> May 05 22:12:50 corosync [MAIN  ] Completed service synchronization, ready to provide service.
> May 05 22:12:50 [3553] vqa11        cib:     info: cib_process_replace:         Digest matched on replace from vqa12: dfa194fcf61b3e86b6a79b2506c41a1c
> May 05 22:12:50 [3553] vqa11        cib:     info: cib_process_replace:         Replaced 0.7126.1 with 0.7126.42 from vqa12
> May 05 22:12:50 [3553] vqa11        cib:     info: cib_replace_notify:  Replaced: 0.7126.1 -> 0.7126.42 from vqa12
> May 05 22:12:51 [3554] vqa11 stonith-ng:     info: stonith_command:     Processed register from crmd.3558: OK (0)
> May 05 22:12:51 [3554] vqa11 stonith-ng:     info: stonith_command:     Processed st_notify from crmd.3558: OK (0)
> May 05 22:12:51 [3554] vqa11 stonith-ng:     info: stonith_command:     Processed st_notify from crmd.3558: OK (0)
> May 05 22:12:51 [3558] vqa11       crmd:     info: ais_dispatch_message:        Membership 4120: quorum still lost
> May 05 22:12:51 [3558] vqa11       crmd:     info: crm_get_peer:        Node <null> now has id: 33663168
> May 05 22:12:51 [3558] vqa11       crmd:   notice: crm_update_peer_state:       crm_update_ais_node: Node (null)[33663168] - state is now member
> May 05 22:12:51 [3558] vqa11       crmd:     info: crm_update_peer:     crm_update_ais_node: Node (null): id=33663168 state=member addr=r(0) ip(192.168.1.2)  (new) votes=0 born=0 seen=4120 proc=00000000000000000000000000000000
> May 05 22:12:51 [3558] vqa11       crmd:   notice: ais_dispatch_message:        Membership 4120: quorum acquired
> May 05 22:12:51 [3558] vqa11       crmd:     info: crm_get_peer:        Node 33663168 is now known as vqa12
> May 05 22:12:51 [3558] vqa11       crmd:     info: peer_update_callback:        vqa12 is now member
> May 05 22:12:51 [3558] vqa11       crmd:     info: crm_get_peer:        Node 33663168 has uuid vqa12
> May 05 22:12:51 [3558] vqa11       crmd:     info: crm_update_peer:     crm_update_ais_node: Node vqa12: id=33663168 state=member addr=r(0) ip(192.168.1.2)  votes=1 (new) born=4092 seen=4120 proc=00000000000000000000000000000000
> May 05 22:12:51 [3558] vqa11       crmd:    error: crmd_ais_dispatch:   Recieving messages from a node we think is dead: vqa12[33663168]
> May 05 22:12:51 [3558] vqa11       crmd:     info: crm_update_peer_proc:        crmd_ais_dispatch: Node vqa12[33663168] - ais is now online
> May 05 22:12:51 [3558] vqa11       crmd:     info: peer_update_callback:        Client vqa12/peer now has status [offline] (DC=<null>)
> May 05 22:12:51 [3547] vqa11 pacemakerd:   notice: update_node_processes:       0x13685e0 Node 33663168 now known as vqa12, was:
> May 05 22:12:51 [3558] vqa11       crmd:     info: crm_update_peer_proc:        pcmk_mcp_dispatch: Node vqa12[33663168] - unknown is now (null)
> May 05 22:12:51 [3558] vqa11       crmd:     info: peer_update_callback:        Client vqa12/peer now has status [online] (DC=<null>)
> May 05 22:12:51 [3554] vqa11 stonith-ng:     info: crm_get_peer:        Node vqa12 now has id: 33663168
> May 05 22:12:51 [3554] vqa11 stonith-ng:     info: crm_get_peer:        Node 33663168 is now known as vqa12
> May 05 22:12:51 [3554] vqa11 stonith-ng:     info: crm_get_peer:        Node 33663168 has uuid vqa12
> May 05 22:12:51 [3554] vqa11 stonith-ng:     info: crm_update_peer_proc:        pcmk_mcp_dispatch: Node vqa12[33663168] - unknown is now (null)
> May 05 22:12:51 [3558] vqa11       crmd:     info: update_dc:   Set DC to vqa12 (3.0.7)
> May 05 22:12:51 [3553] vqa11        cib:     info: crm_update_peer_proc:        pcmk_mcp_dispatch: Node vqa12[33663168] - unknown is now (null)
> May 05 22:12:51 [3558] vqa11       crmd:     info: erase_status_tag:    Deleting xpath: //node_state[@uname='vqa11']/transient_attributes
> May 05 22:12:51 [3558] vqa11       crmd:     info: update_attrd:        Connecting to attrd... 5 retries remaining
> May 05 22:12:51 [3558] vqa11       crmd:   notice: do_state_transition:         State transition S_PENDING -> S_NOT_DC [ input=I_NOT_DC cause=C_HA_MESSAGE origin=do_cl_join_finalize_respond ]
> May 05 22:12:51 [3556] vqa11      attrd:   notice: attrd_local_callback:        Sending full refresh (origin=crmd)
> May 05 22:12:51 [3553] vqa11        cib:     info: cib_process_replace:         Digest matched on replace from vqa12: 8e663b021f35b1f922b50f7404cb7032
> May 05 22:12:51 [3553] vqa11        cib:     info: cib_process_replace:         Replaced 0.7126.47 with 0.7126.47 from vqa12
> May 05 22:12:53 [3555] vqa11       lrmd:     info: process_lrmd_get_rsc_info:   Resource 'vgc_virtual_ip' not found (0 active resources)
> May 05 22:12:53 [3555] vqa11       lrmd:     info: process_lrmd_rsc_register:   Added 'vgc_virtual_ip' to the rsc list (1 active resources)
> May 05 22:12:53 [3555] vqa11       lrmd:     info: process_lrmd_get_rsc_info:   Resource 'vha-94b33532-15ba-4923-a920-ab9268ccd856' not found (1 active resources)
> May 05 22:12:53 [3555] vqa11       lrmd:     info: process_lrmd_get_rsc_info:   Resource 'vha-94b33532-15ba-4923-a920-ab9268ccd856:1' not found (1 active resources)
> May 05 22:12:53 [3555] vqa11       lrmd:     info: process_lrmd_rsc_register:   Added 'vha-94b33532-15ba-4923-a920-ab9268ccd856' to the rsc list (2 active resources)
> May 05 22:12:53 [3555] vqa11       lrmd:   notice: operation_finished:  vgc_virtual_ip_monitor_0:3616 [ Converted dotted-quad netmask to CIDR as: 22 ]
> May 05 22:12:54 [3558] vqa11       crmd:     info: services_os_action_execute:  Managed vgc-cm-agent.ocf_meta-data_0 process 3633 exited with rc=0
> May 05 22:12:54 [3558] vqa11       crmd:   notice: process_lrm_event:   LRM operation vha-94b33532-15ba-4923-a920-ab9268ccd856_monitor_0 (call=10, rc=7, cib-update=8, confirmed=true) not running
> May 05 22:12:55 [3558] vqa11       crmd:     info: services_os_action_execute:  Managed IPaddr2_meta-data_0 process 3659 exited with rc=0
> May 05 22:12:55 [3558] vqa11       crmd:   notice: process_lrm_event:   LRM operation vgc_virtual_ip_monitor_0 (call=5, rc=7, cib-update=9, confirmed=true) not running
> May 05 22:12:55 [3556] vqa11      attrd:   notice: attrd_trigger_update:        Sending flush op to all hosts for: probe_complete (true)
> May 05 22:12:55 [3556] vqa11      attrd:   notice: attrd_perform_update:        Sent update 8: probe_complete=true
> May 05 22:12:55 [3558] vqa11       crmd:   notice: process_lrm_event:   LRM operation vha-94b33532-15ba-4923-a920-ab9268ccd856_start_0 (call=14, rc=0, cib-update=10, confirmed=true) ok
> May 05 22:12:56 [3558] vqa11       crmd:   notice: process_lrm_event:   LRM operation vha-94b33532-15ba-4923-a920-ab9268ccd856_monitor_31000 (call=17, rc=0, cib-update=11, confirmed=false) ok
> May 05 22:14:50 [3553] vqa11        cib:     info: crm_update_peer_proc:        pcmk_mcp_dispatch: Node vqa12[33663168] - unknown is now (null)
> May 05 22:14:50 [3558] vqa11       crmd:     info: crm_update_peer_proc:        pcmk_mcp_dispatch: Node vqa12[33663168] - unknown is now (null)
> May 05 22:14:50 [3558] vqa11       crmd:     info: peer_update_callback:        Client vqa12/peer now has status [offline] (DC=vqa12)
> May 05 22:14:50 [3554] vqa11 stonith-ng:     info: crm_update_peer_proc:        pcmk_mcp_dispatch: Node vqa12[33663168] - unknown is now (null)
> May 05 22:14:50 [3558] vqa11       crmd:   notice: peer_update_callback:        Got client status callback - our DC is dead
> May 05 22:14:50 [3558] vqa11       crmd:   notice: do_state_transition:         State transition S_NOT_DC -> S_ELECTION [ input=I_ELECTION cause=C_CRMD_STATUS_CALLBACK origin=peer_update_callback ]
> May 05 22:14:50 [3558] vqa11       crmd:   notice: do_state_transition:         State transition S_ELECTION -> S_INTEGRATION [ input=I_ELECTION_DC cause=C_FSA_INTERNAL origin=do_election_check ]
> May 05 22:14:50 [3558] vqa11       crmd:     info: do_te_control:       Registering TE UUID: 8e6b7382-90eb-4ac5-80bf-d2feac7e0d7e
> May 05 22:14:50 [3554] vqa11 stonith-ng:     info: crm_update_peer_proc:        pcmk_mcp_dispatch: Node vqa12[33663168] - unknown is now (null)
> May 05 22:14:50 [3553] vqa11        cib:     info: crm_update_peer_proc:        pcmk_mcp_dispatch: Node vqa12[33663168] - unknown is now (null)
> May 05 22:14:50 [3558] vqa11       crmd:     info: set_graph_functions:         Setting custom graph functions
> May 05 22:14:50 [3558] vqa11       crmd:     info: do_dc_takeover:      Taking over DC status for this partition
> May 05 22:14:50 [3553] vqa11        cib:     info: cib_process_readwrite:       We are now in R/W mode
> May 05 22:14:50 [3553] vqa11        cib:     info: cib_process_request:         Operation complete: op cib_master for section 'all' (origin=local/crmd/12, version=0.7126.67): OK (rc=0)
> May 05 22:14:50 [3553] vqa11        cib:     info: cib_process_request:         Operation complete: op cib_modify for section cib (origin=local/crmd/13, version=0.7126.68): OK (rc=0)
> May 05 22:14:50 [3553] vqa11        cib:     info: cib_process_request:         Operation complete: op cib_modify for section crm_config (origin=local/crmd/15, version=0.7126.69): OK (rc=0)
> May 05 22:14:50 [3558] vqa11       crmd:     info: join_make_offer:     Making join offers based on membership 4120
> May 05 22:14:50 [3558] vqa11       crmd:     info: do_dc_join_offer_all:        join-1: Waiting on 1 outstanding join acks
> May 05 22:14:50 [3553] vqa11        cib:     info: cib_process_request:         Operation complete: op cib_modify for section crm_config (origin=local/crmd/17, version=0.7126.70): OK (rc=0)
> May 05 22:14:50 [3558] vqa11       crmd:     info: crm_update_peer_expected:    do_dc_join_filter_offer: Node vqa11[16885952] - expected state is now member
> May 05 22:14:51 [3558] vqa11       crmd:     info: do_dc_join_offer_all:        A new node joined the cluster
> May 05 22:14:51 [3558] vqa11       crmd:     info: do_dc_join_offer_all:        join-3: Waiting on 2 outstanding join acks
> May 05 22:14:51 [3558] vqa11       crmd:     info: update_dc:   Set DC to vqa11 (3.0.7)
> May 05 22:14:52 [3558] vqa11       crmd:     info: crm_update_peer_expected:    do_dc_join_filter_offer: Node vqa12[33663168] - expected state is now member
> May 05 22:14:52 [3558] vqa11       crmd:     info: do_state_transition:         State transition S_INTEGRATION -> S_FINALIZE_JOIN [ input=I_INTEGRATED cause=C_FSA_INTERNAL origin=check_join_state ]
> May 05 22:14:52 [3558] vqa11       crmd:     info: do_dc_join_finalize:         join-3: Syncing the CIB from vqa11 to the rest of the cluster
> May 05 22:14:52 [3553] vqa11        cib:     info: cib_process_request:         Operation complete: op cib_sync for section 'all' (origin=local/crmd/27, version=0.7126.73): OK (rc=0)
> May 05 22:14:52 [3553] vqa11        cib:     info: cib_process_request:         Operation complete: op cib_modify for section nodes (origin=local/crmd/28, version=0.7126.74): OK (rc=0)
> May 05 22:14:52 [3553] vqa11        cib:     info: cib_process_request:         Operation complete: op cib_modify for section nodes (origin=local/crmd/29, version=0.7126.75): OK (rc=0)
> May 05 22:14:52 [3553] vqa11        cib:     info: cib_process_request:         Operation complete: op cib_delete for section //node_state[@uname='vqa12']/transient_attributes (origin=vqa12/crmd/8, version=0.7126.76): OK (rc=0)
> May 05 22:14:52 [3553] vqa11        cib:  warning: cib_process_request:         Operation complete: op cib_modify for section status (origin=vqa12/attrd/61, version=0.7126.76): No such device or address (rc=-6)
> May 05 22:14:53 [3558] vqa11       crmd:     info: services_os_action_execute:  Managed vgc-cm-agent.ocf_meta-data_0 process 4015 exited with rc=0
> May 05 22:14:53 [3558] vqa11       crmd:     info: do_dc_join_ack:      join-3: Updating node state to member for vqa12
> May 05 22:14:53 [3558] vqa11       crmd:     info: erase_status_tag:    Deleting xpath: //node_state[@uname='vqa12']/lrm
> May 05 22:14:53 [3558] vqa11       crmd:     info: do_dc_join_ack:      join-3: Updating node state to member for vqa11
> May 05 22:14:53 [3558] vqa11       crmd:     info: erase_status_tag:    Deleting xpath: //node_state[@uname='vqa11']/lrm
> May 05 22:14:53 [3553] vqa11        cib:     info: cib_process_request:         Operation complete: op cib_delete for section //node_state[@uname='vqa12']/lrm (origin=local/crmd/30, version=0.7126.79): OK (rc=0)
> May 05 22:14:53 [3553] vqa11        cib:     info: cib_process_request:         Operation complete: op cib_delete for section //node_state[@uname='vqa11']/lrm (origin=local/crmd/32, version=0.7126.81): OK (rc=0)
> May 05 22:14:53 [3558] vqa11       crmd:     info: do_state_transition:         State transition S_FINALIZE_JOIN -> S_POLICY_ENGINE [ input=I_FINALIZED cause=C_FSA_INTERNAL origin=check_join_state ]
> May 05 22:14:53 [3556] vqa11      attrd:   notice: attrd_local_callback:        Sending full refresh (origin=crmd)
> May 05 22:14:53 [3556] vqa11      attrd:   notice: attrd_trigger_update:        Sending flush op to all hosts for: probe_complete (true)
> May 05 22:14:53 [3558] vqa11       crmd:     info: abort_transition_graph:      do_te_invoke:156 - Triggered transition abort (complete=1) : Peer Cancelled
> May 05 22:14:53 [3553] vqa11        cib:     info: cib_process_request:         Operation complete: op cib_modify for section nodes (origin=local/crmd/34, version=0.7126.83): OK (rc=0)
> May 05 22:14:53 [3553] vqa11        cib:     info: cib_process_request:         Operation complete: op cib_modify for section cib (origin=local/crmd/36, version=0.7126.85): OK (rc=0)
> 
> 
> 
> [root at vqa12 bug17873]# rpm -qa | grep pacemaker
> pacemaker-cluster-libs-1.1.8-7.el6.x86_64
> pacemaker-cli-1.1.8-7.el6.x86_64
> pacemaker-1.1.8-7.el6.x86_64
> pacemaker-libs-1.1.8-7.el6.x86_64
> [root at vqa12 bug17873]# rpm -qa | grep corosync
> corosync-1.4.1-15.el6.x86_64
> corosynclib-1.4.1-15.el6.x86_64
> [root at vqa12 bug17873]# cat /etc/redhat-release
> Red Hat Enterprise Linux Server release 6.2 (Santiago)
> 
> 
> Regards,
>  Kiran
> 
> _______________________________________________
> Pacemaker mailing list: Pacemaker at oss.clusterlabs.org
> http://oss.clusterlabs.org/mailman/listinfo/pacemaker
> 
> Project Home: http://www.clusterlabs.org
> Getting started: http://www.clusterlabs.org/doc/Cluster_from_Scratch.pdf
> Bugs: http://bugs.clusterlabs.org





More information about the Pacemaker mailing list