NetXMS Support Forum

English Support => General Support => Topic started by: Dani@M3T on May 19, 2016, 08:52:13 PM

Title: SNMP agent unreachable after network outage
Post by: Dani@M3T on May 19, 2016, 08:52:13 PM
Hi

I observe the following problem for some time:
We have some sites connected by VPN connections.
When a VPN connection goes down and up again, the NetXMS server reports the remote VPN-gateway as 'snmp agent unreachable'. When I manually do a status Poll on a remote node I get 'node is connected' but also 'snmp agent unreachable'.

My findings so far:

[19-May-2016 18:28:50.789] [DEBUG] Node(vpn-gateway.domain.intern)->GetItemFromSNMP(.1.3.6.1.2.1.1.3.0): dwResult=17
[19-May-2016 18:28:50.789] [DEBUG] StatusPoll(vpn-gateway.domain.intern [951]): unable to get system uptime
[19-May-2016 18:28:50.789] [DEBUG] StatusPoll(vpn-gateway.domain.intern [951]): unable to get agent uptime
[19-May-2016 18:28:50.789] [DEBUG] StatusPoll(vpn-gateway.domain.intern [951]): unable to get system location
[19-May-2016 18:28:50.789] [DEBUG] Finished status poll for node vpn-gateway.domain.intern (ID: 951)
[19-May-2016 18:28:50.789] [DEBUG] ConfigReadStr: (cached) name=DeleteUnreachableNodesPeriod value="0"
...
[19-May-2016 18:30:07.540] [DEBUG] Node(vpn-gateway.domain.intern)->GetItemFromSNMP(.1.3.6.1.4.1.890.1.6.22.1.6.0): dwResult=4


What could be the best next steps for troubleshooting?

NetXMS server is V2.0.3 on Linux x64 built from sources.

thanks
Dani
Title: Re: SNMP agent unreachable after network outage
Post by: Victor Kirhenshtein on May 27, 2016, 10:12:03 AM
Hi,

error 4 is general communication error, and error 17 is SNMP engine ID mismatch. Does this problematic gateway use SNMP version 3? Could you please capture SNMP traffic between NetXMS server and gateway during unsuccessful status poll?

Best regards,
Victor
Title: Re: SNMP agent unreachable after network outage
Post by: Dani@M3T on May 27, 2016, 12:56:22 PM
Hi Victor

Yes these are SNMPV3 Devices.
I rebooted one of the remote gateways to activate the problem and captured the SNMP traffic between the remote gateway and the NetXMS server.

I get a lot of exactly this communication:
Server to gateway: "encryptedPDU: privKey Unknown"
Gateway to server: "report 1.3.6.1.6.3.15.1.1.4.0"
(I can also send you the tcpdump file but not in the forum)

Maybe the Engine ID changed when remote gateway was rebooted. But I cannot set a static Engine ID on these gateways.

kind regards
Dani
Title: Re: SNMP agent unreachable after network outage
Post by: Victor Kirhenshtein on May 27, 2016, 07:16:16 PM
Hi,

could you please try to apply attached patch and check if server will handle gateway restart correctly?

Best regards,
Victor
Title: Re: SNMP agent unreachable after network outage
Post by: Dani@M3T on May 28, 2016, 06:21:43 PM
Hi Victor

Your patch for reset of the engine ID fixes the problem. First test is ok!
Do you apply this to the 2.0.4 release?

Thanks!
Dani
Title: Re: SNMP agent unreachable after network outage
Post by: Victor Kirhenshtein on May 30, 2016, 02:39:46 PM
Hi,

yes, this patch will be included into 2.0.4 release.

Best regards,
Victor