News:

We really need your input in this questionnaire

Main Menu

Adding Node Breaks Polls

Started by johnkv, February 23, 2011, 01:36:29 AM

Previous topic - Next topic

johnkv

I am monitoring several nodes in my NetXMS install (on Ubuntu, ver. 1.0.10, sqlite).  I have been getting errors in the log DCI___ Internal: AverageDCPollerQueueSize, DBWriterQueueSize, QueueingTime, PollerQueue and ConfigPollerQueueSize all changing to unsupported (manually clearing buys me about 4 hours before they reappear).  Then a bunch of services go down.  Doing a status poll manually shows it is trying to be done with the wrong IP address (it seems to be using the management IP from a Cisco 6509 (130+ interfaces)).  So I looked at the 6509 node and it had the XMS interfaces only (different from when I initially added it) It had the XMS public IP, internal IP and the NAT Adapter.  So I deleted the 6509 node and all the alarms immediately cleared.  Readded the 6509, populated all the 130+ interfaces (I know because of my 136 emails received seconds after) and all the alams (DCI and servies) went down and unavailable again and the manual poll showed the 6509 address again.

Thoughts?

Victor Kirhenshtein

Hi!

Looks somehow similar to this bug: https://www.netxms.org/bugtrack/view.php?id=319. Could you please run server in debug mode (by adding -D 6 switch to netxmsd command line) and send me the logs (either here or to dump-at-netxms.org)?

Also, question to all readers: does anybody has Cisco device in a lab with such big number of ports and able to provide me with remote access to it for some days? My best Cisco device has only 24 ports, and I can reproduce such problem (either described here or listed in the bug record) on big Nortel switches :(

Best regards,
Victor

johnkv

#2
Sorry, my attachment was too big, here it is broken in half.

Victor: I removed attachments to prevent unneeded information disclosure.

Victor Kirhenshtein

Looks like management server has the same IP address as on one of 6509 interfaces. Could you check it? Does NetXMS server has only one IP?

Victor Kirhenshtein

As I found in log, interface Vl1 has IP address 172.30.1.25, and interface eth0 on management server has same address. This cause NetXMS server to interpret switch as local machine.

johnkv

Just to follow up, that was exactly it.  Interesting behavior observed for others who may be searching: I had intermittent up/downs (false positives) as well as sometimes when I would look at the switch it would show me the management servers interfaces and others it would show the switch.

Thanks for taking a look!