[Pacemaker] The active trap of the SNMP is delayed.
renayama19661014 at ybb.ne.jp
renayama19661014 at ybb.ne.jp
Fri Jun 17 01:44:29 UTC 2011
Hi All,
I registered this problem in Bugzilla.
* http://developerbugs.linux-foundation.org/show_bug.cgi?id=2604
Best Regards,
Hideo Yamauch.
--- On Wed, 2011/6/15, renayama19661014 at ybb.ne.jp <renayama19661014 at ybb.ne.jp> wrote:
> Hi All,
>
> I found a problem with a trap of the SNMP.(from hbagent.)
>
> A trap of active of the node seems to have possibilities to be delayed.
>
> In addition, this problem sometimes occurs and does not always occur.
>
>
> I confirmed it in the next procedure.
>
> Step1) Start a node.
>
> ============
> Last updated: Wed Jun 15 19:23:39 2011
> Stack: Heartbeat
> Current DC: srv02 (afe72fff-b7b4-4663-b845-872df29c635d) - partition WITHOUT quorum
> Version: 1.0.11-6e010d6b0d49a6b929d17c0114e9d2d934dc8e04
> 2 Nodes configured, unknown expected votes
> 1 Resources configured.
> ============
>
> Online: [ srv01 srv02 ]
>
> Resource Group: group-1
> prmDummy1 (ocf::heartbeat:Dummy): Started srv01
>
> Migration summary:
> * Node srv02:
> * Node srv01:
>
>
> Step2) Intercept one interface of the Heartbeat communication.
>
> # iptables -A INPUT -i eth1 -s ! 192.168.10.110 -j DROP
> # iptables -A INPUT -i eth1 -s ! 192.168.10.120 -j DROP
>
>
> Step3) The next trap is received in SNMP managers.
>
> (snip)
> Jun 15 19:24:30 snmp-manager snmptrapd[4771]: 2011-06-15 19:24:30 <UNKNOWN> [UDP: [192.168.40.120]:59010]: DISMAN-EVENT-MIB::sysUpTimeInstance = Timeticks: (23014) 0:03:50.14 SNMPv2-MIB::snmpTrapOID.0 = OID: LINUX-HA-MIB::LHAIFStatusUpdate LINUX-HA-MIB::LHANodeName = STRING: srv01 LINUX-HA-MIB::LHAIFName = STRING: eth1 LINUX-HA-MIB::LHAIFStatus = INTEGER: down(2)
> ----> No problem.
> Jun 15 19:24:32 snmp-manager snmptrapd[4771]: 2011-06-15 19:24:32 <UNKNOWN> [UDP: [192.168.40.110]:44001]: DISMAN-EVENT-MIB::sysUpTimeInstance = Timeticks: (23597) 0:03:55.97 SNMPv2-MIB::snmpTrapOID.0 = OID: LINUX-HA-MIB::LHANodeStatusUpdate LINUX-HA-MIB::LHANodeName = STRING: srv02 LINUX-HA-MIB::LHANodeStatus = INTEGER: active(3)
> ----> The trap of active is improper in this timing.
> Jun 15 19:24:34 snmp-manager snmptrapd[4771]: 2011-06-15 19:24:34 <UNKNOWN> [UDP: [192.168.40.110]:44001]: DISMAN-EVENT-MIB::sysUpTimeInstance = Timeticks: (23803) 0:03:58.03 SNMPv2-MIB::snmpTrapOID.0 = OID: LINUX-HA-MIB::LHAIFStatusUpdate LINUX-HA-MIB::LHANodeName = STRING: srv02 LINUX-HA-MIB::LHAIFName = STRING: eth1 LINUX-HA-MIB::LHAIFStatus = INTEGER: down(2)
> ----> No problem.
> (snip)
>
> Between the traps which interface intercepted, it is strange that the active trap of the node comes.
>
> And I think that it is necessary for the active trap to be sent in an earlier timing.
>
>
> This problem seems to happen in Heartbeat2.1.4.
>
> I watched some sources, but think that client_lib of Heartbeat has a problem somehow or other.
> Transmitted F_STATUS message is late and seems to be handled.
>
>
> Best Regards,
> Hideo Yamauchi.
>
>
More information about the Pacemaker
mailing list