News:

We really need your input in this questionnaire

Main Menu

SYS_AGENT_UNREACHABLE

Started by Nomis, January 15, 2016, 03:09:55 PM

Previous topic - Next topic

Nomis

Am I missing something obvious or is this the Twilight Zone. I once more opened the nxagentd.conf file, ensured that DebugLevel is 6, saved and closed, restarted the Agent, and the log file once again says "0". This is the content of the conf-file:
Quote#
# NetXMS agent configuration file
# Created by agent installer at Thu Sep 29 15:41:41 2011
#

MasterServers = 192.168.25.17

FileStore = F:\Program\NetXMS\var\
LogFile = F:\Program\NetXMS\etc\nxagentd.log
DebugLevel = 6
SubAgent = ecs.nsm
SubAgent = ping.nsm
SubAgent = logwatch.nsm
SubAgent = portcheck.nsm
SubAgent = winperf.nsm
SubAgent = wmi.nsm
SubAgent = ups.nsm

tomaskir

Please also check "F:\Program\NetXMS\etc\nxagentd.conf.d" if there is some config there or not.

Nomis


Victor Kirhenshtein

Hi,

check if you have -D0 in service command line. If yes, remove it.

Best regards,
Victor

Nomis

That did the trick, Victor. Now there's a lot of logging going on. I'll get back tomorrow with the result.

Nomis

I'm attaching an hour's worth of the log file, between 09:00 and 10:00. During that time there were five alarms, approximately 09:17, 09:21, 09:23, 09:32, 09:42, and 09:48. Thanks for taking time!

Nomis

Personally I can't find anything consistent with the alarms in the logs. There are "Session diconnected by timeout", but these are quite frequent and not necessarily at the same minutes that we get the alarms.

If anybody finds anything or has any idea of a next step, please don't hesitate to let me know. :-)

Victor Kirhenshtein

Activity looks strange actually. There are multiple occurences of this pattern:

[27-Jan-2016 09:19:38.747] [DEBUG] [session:1] Session disconnected by timeout (last activity timestamp is 1453882717)
[27-Jan-2016 09:19:38.747] [DEBUG] [session:1] Session with 192.168.25.17 closed
[27-Jan-2016 09:19:42.835] [DEBUG] Incoming connection from 192.168.25.17
[27-Jan-2016 09:19:42.835] [DEBUG] Connection from 192.168.25.17 accepted

then normal session init, parameter queries, and suddenly again session disconnect by timeout and almost immediate reconnect:

[27-Jan-2016 09:19:42.850] [DEBUG] [session:1] Requesting parameter "Agent.Uptime"
[27-Jan-2016 09:19:42.850] [DEBUG] [session:1] Sending message CMD_REQUEST_COMPLETED (size 56)
[27-Jan-2016 09:20:43.861] [DEBUG] [session:1] Session disconnected by timeout (last activity timestamp is 1453882782)
[27-Jan-2016 09:20:43.861] [DEBUG] [session:1] Session with 192.168.25.17 closed

and inactivity timeout seems to be very short. Could it be that you have agent configuration parameter SessionIdleTimeout set to 0?

Best regards,
Victor

Nomis

Thank you for taking time, Victor Kirhenshtein, I really appreciate it. Sorry to have to respond to every suggestion with a question, but where would that parameter be? - There's no such parameter in the agent's config file, and I can't find it in the object's Properties either.

Victor Kirhenshtein

It is in nxagentd.conf,  but it has default value of 60 seconds on Windows, and in the log we see "idle" disconnect after just one second. Try to add

SessionIdleTimeout = 600

to nxagentd.conf and check if idle disconnect messages will still appear.

Best regards,
Victor

Nomis

Thank you. I'll do that and report back later.

Nomis

The annoying SYS_AGENT_UNREACHABLE seems to have disappeared after the SessionIdleTimout configuration. Thank you very much, Victor.

jhonnyvey

Hi,

I have same issue too and found this topic :).
I made this change 'SessionIdleTimeout = 600' and now hope that the problem also be solved :)

Tks a lot! :)