NetXMS Support Forum

English Support => General Support => Topic started by: xenth on May 13, 2008, 09:17:36 AM

Title: Server crashed again, possibly database writer queue?
Post by: xenth on May 13, 2008, 09:17:36 AM
I've had netxms running unattended for around 4-5 days and it crashed, it didn't collect any data after this occured  :(

The logfile shows the same problem as from before


[11-May-2008 17:32:46] Thread "Item Poller" does not respond to watchdog thread
[11-May-2008 17:34:06] Thread "Poll Manager" does not respond to watchdog thread
[11-May-2008 17:34:26] Thread "Syncer Thread" does not respond to watchdog thread


However, this time there were no substantial queues, to prove this here's a graph of all the pollers.

Yellow: Status poller queue
Blue:    Data collector queue
Purple: Config poller queue
Light blue: Database writer queue
Green: Average time to queue dci's

(http://xs227.xs.to/xs227/08202/newgraph752.png)

When I look at the entire graph over 4 days I see that the database writer can get pretty high from time to time, just for a moment, but still.

Here's a graph over 4 days, same colour codes apply.

(http://xs227.xs.to/xs227/08202/graph2511.png)
Title: Re: Server crashed again, possibly database writer queue?
Post by: xenth on May 13, 2008, 10:07:14 AM
Here is an example of how bad the database writer queue can get:

(http://xs227.xs.to/xs227/08202/danger433.png)


:(
Title: Re: Server crashed again, possibly database writer queue?
Post by: xenth on May 13, 2008, 11:30:51 AM
Happened again just now  >:(


[13-May-2008 09:40:58] Thread "Item Poller" does not respond to watchdog thread
[13-May-2008 09:41:58] Thread "Poll Manager" does not respond to watchdog thread
[13-May-2008 09:42:38] Thread "Syncer Thread" does not respond to watchdog thread
Title: Re: Server crashed again, possibly database writer queue?
Post by: Victor Kirhenshtein on May 13, 2008, 02:34:47 PM
Hello!

Please try to upgrade to 0.2.21 - it's available already at https://www.netxms.org/download/netxms-0.2.21.exe but not announced yet - we do some final testing. You problem looks very similar to one solved in 0.2.21, related to SNMP data collection.

Best regards,
Victor
Title: Re: Server crashed again, possibly database writer queue?
Post by: xenth on May 13, 2008, 03:37:44 PM
I am very interested in the changelog if you have it available  :)

I'm going to try upgrading the server when I have the time.

Thank you.

Title: Re: Server crashed again, possibly database writer queue?
Post by: Victor Kirhenshtein on May 13, 2008, 04:30:19 PM
Change log:

- Multiple network maps implemented
- Added parameter ListenAddress to all services (server, web server, agent)
- New possible value for UseInterfaceAliases - concatenate name with alias
- Added possibility to create custom message in event matching script and
  use it in alarms and actions
- WMI subagent added
- SNMP sysDescr and agent's uname now polled and displayed
- New features in Windows console:
        - Possibility to use non-local timezone in Windows console
        - Default graph settings can be changed
- AIX subagent: implemented System.CPU.LoadAvg* and System.Uptime parameters
- Fixed issues: #193, #194, #198, #204, #209, #211, #212, #213, #214, #215

Best regards,
Victor
Title: Re: Server crashed again, possibly database writer queue?
Post by: xenth on May 15, 2008, 10:23:28 AM
I'm using it now :)

I'll let you know if I experience strange things
Title: Re: Server crashed again, possibly database writer queue?
Post by: xenth on May 15, 2008, 03:30:25 PM
The problem with the database writer queue hasn't been fixed unfortunately  :(
Here's a graph showing my problem

(http://img396.imageshack.us/img396/4417/graphjeqz8.th.png) (http://img396.imageshack.us/my.php?image=graphjeqz8.png)
Title: Re: Server crashed again, possibly database writer queue?
Post by: Alex Kirhenshtein on May 16, 2008, 02:33:54 AM
It looks normal; most of the time queue is empty - these peaks are caused by housekeeping process which runs every hour (it loads quite bit amount of data, do vacuum (on postgresql), etc).

DB Writer queue size indicates problem only when it's value is above zero most of the time.

However, if db load will be too high, you can try to increase housekeeping intervals in server config - this should help a bit.
Title: Re: Server crashed again, possibly database writer queue?
Post by: xenth on May 16, 2008, 09:04:26 AM
Ahh I see :)

But, I want to get an alert if one of the queues is too high, what threshold do you recommend that I set on the databasewriterqueue?
Title: Re: Server crashed again, possibly database writer queue?
Post by: Alex Kirhenshtein on May 17, 2008, 12:26:47 AM
I'd set something like "average >= 10 for last 5 polls"