DCI and Authentication disabled (after OS reboot)

Started by radw, May 23, 2016, 04:32:51 AM

Previous topic - Next topic

radw

Hello NETXMS team, I continue working with NETXMS and it start give me the required SNMP monitoring that I was looking for. Thanks you for this tool.

However I just find out that, for two nodes, even if I leave all their DCI activated, after a reboot of the Debian8 (where I installled NETXMS), their DCIs appear as "disabled" again:

1) Before Debian reboot: Their DCIs are activated (in fact, I activated again since they appeared disabled in the last reboot)





2) After Debian rebbot: The DCIs appear disabled again:





In addition , every time the OS is restarted, I have to reload the netxms service for enable access through the Management Console (on the contrary, a message "conection refused" appear). I found the netxms service status different after I restart it:



NETXMS-TEST:~$ sudo service netxmsd status
[sudo] password for admtelcisco:
â netxmsd.service - LSB: NetXMS server
   Loaded: loaded (/etc/init.d/netxmsd)
   Active: active (exited) since Sun 2016-05-22 19:12:52 COT; 5min ago
  Process: 613 ExecStart=/etc/init.d/netxmsd start (code=exited, status=0/SUCCESS)
NETXMS-TEST:~$
NETXMS-TEST:~$ sudo service netxmsd restart
NETXMS-TEST:~$
NETXMS-TEST:~$ sudo service netxmsd status
â netxmsd.service - LSB: NetXMS server
   Loaded: loaded (/etc/init.d/netxmsd)
   Active: active (running) since Sun 2016-05-22 19:18:21 COT; 3s ago
  Process: 1787 ExecStop=/etc/init.d/netxmsd stop (code=exited, status=0/SUCCESS)
  Process: 1796 ExecStart=/etc/init.d/netxmsd start (code=exited, status=0/SUCCESS)
   CGroup: /system.slice/netxmsd.service
           ââ1802 /usr/bin/netxmsd -d




How those two issues could be fixed??


Regards.

radw

Hello, I solved the DCI disabled, by copy them to an unmanaged node, then deleting it in the original node, and finally copy the DCI from the unmanaged node, activating and rediscover the instances again, reboot the system, and the DCIs keep activated.

The autentication issue persists after reboot.

Regards.

Victor Kirhenshtein

Hi,

it seems that server is started on OS boot and then stopped. One possible reason could be that it started before database is ready, so it cannot access database and exits. Can you please set debug level to 9 (by adding DebugLevel=9 to netxmsd.conf), restart the system, and post server's log file?

Best regards,
Victor

radw


Victor Kirhenshtein

It seems that netxmsd starts before MySQL is ready:

[31-May-2016 10:50:17.311] [ERROR] Unable to establish connection with database (Can't connect to local MySQL server through socket '/var/run/mysqld/mysqld.sock' (2))

try to add dependency on mysqld and/or startup delay to netxmsd startup script.

Best regards,
Victor

radw

Hello, thanks for your answer. In those cases, which is the best practice ??. Also, is there any procedure in the documentation to perform that action ??

Regards.

radw

Hello Victor, can you explain me how to perform those steps. Thanks.

Alex Kirhenshtein

Modify /etc/init.d/netxmsd, add "mysqld" to "Required-Start" section.

radw

Hello, I modify the file and restart the netxmsd service, however the issue persists after I reboot the Debian system:



NETXMS-TEST:~$ sudo vi /etc/init.d/netxmsd
#!/bin/sh
# vim: syntax=sh
### BEGIN INIT INFO
# Provides:          netxms-server
# Required-Start:    $network $local_fs nxagentd mysqld
# Required-Stop:
# Default-Start:     2 3 4 5
# Default-Stop:      0 1 6
# Short-Description: NetXMS server
# Description:       Network monitoring daemon, which collect and process metrics
#                    from native and SNMP agents.
### END INIT INFO

# Author: Alex Kirhenshtein <[email protected]>




Regards.

radw

Hello, I still have the authentication issue after rebooting or shutdown the OS.

The netxmsd service appears active, but is necessary to restart it for allow login.

Regards.

Victor Kirhenshtein

What is in the netxmsd log when you try to connect after OS restart?

Best regards,
Victor

radw


Victor Kirhenshtein

Hi,

still there is an issue with database not being ready:

[19-Aug-2016 09:55:11.539] [ERROR] Unable to establish connection with database (Can't connect to local MySQL server through socket '/var/run/mysqld/mysqld.sock' (2))

If there is dependency on mysqld in netxmsd startup script, then you need to add some code to wait while mysql becomes available in netxmsd startup script, or just add delay big enough to allow mysqld to initialize.

Best regards,
Victor

radw

Hello Victor, thanks for your answer and analysis.

In this case, ¿¿can you guide me on on how to ingress the code you mentioned for fixing the issue ??

Regards.