Menu

Show posts

This section allows you to view all posts made by this member. Note that you can only see posts made in areas you currently have access to.

Show posts Menu

Topics - lweidig

#21
Feature Requests / DCI - Use Multiplier
October 09, 2020, 03:04:40 PM
It would be nice on a per item basis to be able to turn on / off the Use Multiplier function and of course have a default setting for this in the Server Configuration options.  In most instances it is great and works as you would hope, but for things like Ethernet speed or Frequency that an access point is using it becomes odd.  Gigabit Ethernet shows as 1K and an access point using 2412 for its frequency show 2.41K which is not helpful at all in that case.

Thanks!
#22
General Support / [SOLVED] GeoLocation function not found
December 20, 2018, 05:37:38 AM
Running a script like the one at:
https://wiki.netxms.org/index.php?title=Script_Example:_Setting_node_geolocation_from_SNMP&mobileaction=toggle_view_desktop

Gives an error on the GeoLocation line with function not found.  We are running 2.2.10.  How do we correct this?
#23
General Support / Agent Shared Secret Max Length
January 16, 2018, 07:33:31 PM
There seems to be a max length for the agent shared secret, at least if you are using nxencpasswd -a to encrypt it (and maybe without have not tried).  We have successfully used a password that is 19 characters long, but fails with 32 character long passwords.

What is the limit and can you please update the Wiki / Admin guides to show this limit so that others are not spending time having longer passwords fail.  Thanks!
#24
General Support / Automated Agent Configuration
December 22, 2017, 03:45:57 AM
We have written a script which can analyze a server and determine the services that we care to monitor on that device which then updates the node.  Following that templates apply to the node to then start monitoring these services.  The issue we are having is how to push agent configuration changes to the servers based on what we find AUTOMATICALLY. 

As an example:

Server 1 has ssh, https and radius running
Server 2 has ssh and https

We have files that typically go into the nxagent.conf.d for each of these services that have ExternalParameters, Actions,...  At this point we have not found any automated way of getting these to the client.  Yes, I know you can push configurations but that only seems to allow for an ENTIRE configuration to be pushed, not just the components needed.  Thought about going about this using Policies but does not seem to be any way to automatically install / update them.

Looking for suggestions on how to automate this process.  Pretty much want it to find a new node on the network and completely set it up for monitoring after the agent is setup.

Thanks!
#25
General Support / Retrieve data FROM NetXMS
August 16, 2017, 05:30:21 AM
I am looking for a way to grab data FROM NetXMS that it has collected for display in a web application that is primarily Javascript based.  Installed the Web REST api and have it working to get node based information, but cannot figure out a way to get DCI information which is what I really want.

What suggestions do people have for polling data that NetXMS is storing? 
#26
I had a number of nodes that I had significantly changed the templates for and thought it would be a great idea to delete the nodes and let them get discovered again.  I has been past an activediscoveryinterval (900s) and they are not coming back.  Wondering if there is something I need to do to get them to come back or if I will just need to add them all manually.  This is on a 2.1 server.
#27
Installed the 2.1RC1 mobile agent APK on a Galaxy S7 (model: SM-G30R4) running Android 7.0.  As soon as I enabled Location Strategy of Network & GPS the app started crashing (and wanting to e-mail crash reports) - CONTINUALLY!!!!  Could not really do anything as it just kept opening the mail window wanting to send another crash report.  Finally, was able to reboot phone and uninstall Agent before it started the crash loop.  Did try to restart before that and see if that would resolve but it kept on crash looping as previouly.

The client needs to crash and stay crashed, this was really annoying and I suspect a typical user would have had no idea how to get out of it.  I am perfectly fine wanting to e-mail a crash dump and hopefully you have a few from me, but the crash / e-mail loop needs immediate resolution. 

Seems 2.1 has a way to go from what we are seeing from a stability and functionality standpoint.  It is a great product and I am sure it can be overcome, thanks!
#28
We are running Ubuntu 16.04 from packages.  The agent running on the server will NOT however connect to the server itself.  Running the agent with -D 9 -f we only see:

[19-May-2017 13:35:09.775] [DEBUG] Incoming connection from <serverIp>
[19-May-2017 13:35:09.776] [DEBUG] Connection from <serverIp> rejected

The nxagentd.conf file is very basic:

MasterServers = 127.0.0.1
FileStore = /tmp
StartupDelay = 30
EnableActions = yes
SubAgent = portcheck.nsm
*portCheck
Timeout = 5000
*ping
Timeout = 5000

I see the modules all loading and things seem fine.  Have tried using the real IP for the MasterServers and no help.  Tried also debugging the server and did not get anything that helped from that either.
#29
This is a Ubuntu 16.04 system, fully patched using the packages for installation.  Running the command:
nxdbmgr init /usr/share/netxms/sql/dbinit_mysql.sql

Results in the following:
NetXMS Database Manager Version 2.1-RC1 Build 9010 (2.1-RC1) (UNICODE)

Initializing database...
SQL query failed (Specified key was too long; max key length is 767 bytes):
CREATE TABLE user_profiles (   user_id integer not null,   var_name varchar(255) not null,   var_value longtext not null,   PRIMARY KEY(user_id,var_name) )
Database initialization failed


Suggestions on getting past this point?
#30
General Support / MIB Issue 1.2.5
February 19, 2013, 08:21:02 PM
We are running a Ubuntu 12.04 server at version 1.2.5.  If you create the MIB file using the command:

nxmibc -d /usr/share/netxms/mibs/ -z

Then when you run the console from a Windows machine (1.2.5 as well) you get the following error:

QuoteCannot load MIB file from server: Bad MIB file data

Omitting the -z option causes the error to go away, but of course increase the size of the file by about tenfold in our case.

#31
There does not seem to be a way to clear the proxy node if you copy a DCI from one template to another. We have a group of templates for devices that cannot run the agent that simply use the agent on the server to process them.  When we copy to a template where we would have an agent on the device it would be applied to we cannot seem to clear the value.
#32
Feature Requests / Graphs / RRD
October 05, 2012, 12:15:40 AM
I have always wondered why data collection and graphing is not using RRD for its storage and display.  So I decided to make this a feature request to hopefully open some discussion.  While I 100% love NetXMS I do miss the capabilities most of the other NMS software gave through use of RRD (just browsing through http://oss.oetiker.ch/rrdtool/gallery/index.en.html brings tears to my eyes :) ).  Specifically, looking for things like:

- Easily being able to select date / time ranges and adjust them on the fly.  Opsview, ZenOSS give good examples
- Does not seem to be any facility for getting Min, Max, Average, Total,... and displaying these on the graphs.
- Custom set presets.  As an example for some links we would like to look at "last month" (9/1 - 9/30 for current example)

It is a code base that has existed been enhanced for quite some time and really seems to be the defacto standard.  Curious what everybody thinks.  Thanks as always!
#33
General Support / Nagios plugin
October 03, 2012, 10:15:24 PM
I am trying to simply use the Nagios check_dhcp plugin to test some DHCP services in our network.  On the agent running on the same machine as the server I created the following entry:

ExternalParameterShellExec = Dhcp.Check(*):/usr/lib/nagios/plugins/check_dhcp -u -s $1 -m $2 > /dev/null ; echo $?


I then created a DCI for one of my machines using the NetXMS server as the proxy:

Dhcp.Check(1.2.3.4,"11:22:33:44:55:66")

The server shows the value in ERROR.  When I debug with agent I get:

[03-Oct-2012 13:57:13] [session:0] Requesting parameter "Dhcp.Check(1.2.3.4,"11:22:33:44:55:66")"
[03-Oct-2012 13:57:13] H_ExternalParameter called for "Dhcp.Check(1.2.3.4,"11:22:33:44:55:66")" "S/usr/lib/nagios/plugins/check_dhcp -u -s $1 -m $2 > /dev/null ; echo $?"
[03-Oct-2012 13:57:13] H_ExternalParameter: command line is "/usr/lib/nagios/plugins/check_dhcp -u -s 1.2.3.4 -m 11:22:33:44:55:66 > /dev/null ; echo ?"
[03-Oct-2012 13:57:13] H_ExternalParameter (shell exec): worker thread created
[03-Oct-2012 13:57:15] H_ExternalParameter (shell exec): execution timeout
[03-Oct-2012 13:57:15] H_ExternalParameter (shell exec): execution status 2
[03-Oct-2012 13:57:15] [session:0] GetParameterValue(): result is 500 (INTERNAL_ERROR)
[03-Oct-2012 13:57:15] [session:0] Sending message CMD_REQUEST_COMPLETED (size 32)
[03-Oct-2012 13:57:15] [session:0] Received message CMD_GET_PARAMETER
[03-Oct-2012 13:57:15] [session:0] Requesting parameter "Dhcp.Check(1.2.3.4,"11:22:33:44:55:66")"
[03-Oct-2012 13:57:15] H_ExternalParameter called for "Dhcp.Check(1.2.3.4,"11:22:33:44:55:66")" "S/usr/lib/nagios/plugins/check_dhcp -u -s $1 -m $2 > /dev/null ; echo $?"
[03-Oct-2012 13:57:15] H_ExternalParameter: command line is "/usr/lib/nagios/plugins/check_dhcp -u -s 1.2.3.4 -m 11:22:33:44:55:66 > /dev/null ; echo ?"
[03-Oct-2012 13:57:15] H_ExternalParameter (shell exec): worker thread created
[03-Oct-2012 13:57:15] H_ExternalParameter/POpenWorker: worker thread pipe read result: 0x7f84e0001070
[03-Oct-2012 13:57:17] H_ExternalParameter (shell exec): execution timeout
[03-Oct-2012 13:57:17] H_ExternalParameter (shell exec): execution status 2
[03-Oct-2012 13:57:17] [session:0] GetParameterValue(): result is 500 (INTERNAL_ERROR)
[03-Oct-2012 13:57:17] [session:0] Sending message CMD_REQUEST_COMPLETED (size 32)
[03-Oct-2012 13:57:17] [session:0] Received message CMD_GET_PARAMETER
[03-Oct-2012 13:57:17] [session:0] Requesting parameter "Dhcp.Check(1.2.3.4,"11:22:33:44:55:66")"
[03-Oct-2012 13:57:17] H_ExternalParameter called for "Dhcp.Check(1.2.3.4,"11:22:33:44:55:66")" "S/usr/lib/nagios/plugins/check_dhcp -u -s $1 -m $2 > /dev/null ; echo $?"
[03-Oct-2012 13:57:17] H_ExternalParameter: command line is "/usr/lib/nagios/plugins/check_dhcp -u -s 1.2.3.4 -m 11:22:33:44:55:66 > /dev/null ; echo ?"
[03-Oct-2012 13:57:17] H_ExternalParameter (shell exec): worker thread created
[03-Oct-2012 13:57:17] H_ExternalParameter/POpenWorker: worker thread pipe read result: 0x7f84e0001340
[03-Oct-2012 13:57:17] H_ExternalParameter/POpenWorker: worker thread pipe read result: 0x7f84e0001070


Have also tried this with the agent configuration as just ExternalParameter.  Still not 100% sure on the difference of the two. 

Running the parsed command from the server produces the desired results, 0 if everything works and 2 if there is no offer.
#34
I was documenting the PortCheck subagent this morning on the WIKI (complete BTW) and noticed while reading the code that you are not using the Header value that is passed in as an option.  It is using instead the Host (target) passed in as the first parameter in making the request.  Not a huge deal, but should be resolved.  Thanks!
#35
General Support / Template Services
July 24, 2012, 11:10:58 PM
At this point we have all of the SNMP and Agent monitoring setup using templates with automatic bind rules so that as we add a device to the network we are assured to at least get the minimum amount of information so we know about it, and record standard metrics.

Now, however I want to try and template services (SSH, Telnet, HTTP,...).  I can manually create them on devices after they are discovered, but to make this as automated as possible I would like to have it do this for me at node discovery.  I have come up with one way for doing this, but want to see if somebody has a more elegant solution.  We have 100's of services to monitor (maybe 1,000's) and would like to do this right the first time.

Here is the current thought:

1.  We already have an event fired when a node is added that e-mails us to make sure we actually
want this device on our network and setup all the items manually that we currently need to.
2.  When this event is fired we would call an action that runs a process on the local server (something like nmap for example to get ports ope, or chkconfig to see services enabled) so that we can see the ports open.
3.  We can then setup a push of data back to the NetXMS node nicely formatted with values like:
                          mynode      port      22
                          mynode      port      80
                          mynode      service  spamassassin
                          mynode      service  reallyneatservice
4.  These pushed back values are then used by script to create custom attributes on the node.
                          portScan          22:80
                          serviceScan      spamassassin:reallyneatservice
5.  Templates are setup looking to bind nodes that have the specific custom attribute / value pair.


This of course only works for templating services that are running on the node at the time it is discovered.  I could of expand on this to scan again at a later time using one of many methods.

As mentioned, I am really hoping for feedback on either refining this idea or saying to me - that is ridiculous "xyz" is a much better way to do this.   Thanks to all in advance. 

Now I am off to upgrade everything to 1.2.2 which I see just made it onto the site :) 
#36
General Support / Template and unsupported SNMP
July 22, 2012, 04:36:31 PM
We have created a template where one of the SNMP items in that template applies to the majority of the devices in the template, but some not.  We are trying to create the system as automatic as possible so we have created rules to assign templates to devices as discovered.  When one of the devices that does not support this SNMP item is discovered it kicks off an error:

Status of DCI 877 (SNMP: .1.3.6.1.4.1...) changed to UNSUPPORTED

This is fine, I would expect that during the initial application of the template.  We then go in and change this item to  Disabled, but leave it in there.

The problem comes in that if we go into the template to make any further changes, or even just to check things it seems to update some sort of flag and try to apply the template items again.  This causes all the nodes to once again process the unsupported message above and for us to clear them and set the item to disabled.

Maybe during the applying of templates it could check if the item exists for the node already and set to disabled.  If so, NOT fire this message as it has likely already been seen and handled. 
#37
General Support / Procurve SNMP Traps
July 20, 2012, 05:50:21 AM
I recently decided to start moving our trap notifications over to NetXMS from what I hope will soon be our old NMS.  We have a number of various model Procurve switches which the system seem to try and detect.  It shows the driver on the general tab as PROCURVE.  However, when we get a trap for link down and then a trap for link up it is NOT clearing the Link down event and it stays up on the device until we manually terminate.  It is detecting the ports correctly from the varbinds, just not clearing.

First traps .1.3.6.1.6.3.1.1.5.3 (link down) followed by .1.3.6.1.6.3.1.1.5.4 (link up).  These appear to be the "standard" down / up noticed and are listed in the SNMP Trap Configuration section.

Not sure how it comes up with the port drawings either, but it is wrong.  For example our 4000M switch shows an odd layout of ports 1-22 in slot 1, 0 - 23 in slot 2, 0 in slot 3 and 1 in slot 4.  In actuality there are 6 8 port cards, 1 1 port card.  It also pops up a components tab with nothing filled in.

Also, would like to make a suggestions that you allow clipboard copy from the SNMP Trap Monitor display so it is easier to grab OID's or messages. 

Thanks as always!
#38
General Support / Server Configuration Options
July 17, 2012, 01:56:58 AM
I see that within the client I can add my own Server Configuration variable / value pairs.  How can I retrieve these values within a script and or how would I use my own additions. 
#39
General Support / Server Performance
July 11, 2012, 07:29:33 PM
I know from the many NMS systems that I have implemented / tested that something is up.  Just not sure how to start digging to find the issue and hoping to get some pointers.  We are running on a quad core Xeon / 4GB RAM machine running Ubuntu 12.04, NetXMS 1.2.1 (from .deb files).  Currently have about 200 nodes and 1500 DCI's setup.  The machine is running NOTHING other than NetXMS and services to support the installation.  Here is a screen grab from top:


# top
top - 11:15:55 up 18:36,  1 user,  load average: 14.94, 14.19, 14.36
Tasks:  30 total,   1 running,  29 sleeping,   0 stopped,   0 zombie
Cpu(s): 23.2%us, 48.7%sy,  0.0%ni,  0.0%id,  0.0%wa,  0.0%hi,  0.0%si, 28.0%st
Mem:   4194304k total,   436432k used,  3757872k free,        0k buffers
Swap:  2097152k total,        0k used,  2097152k free,   176968k cached

  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND                                             
  284 root      20   0 2282m  24m 3592 S  400  0.6   3679:06 netxmsd                                             
  236 mysql     20   0 1654m 211m 8000 S   24  5.2 617:58.69 mysqld                                               
  291 root      20   0  632m 4884 1460 S   10  0.1 194:04.51 nxagentd                                             
    1 root      20   0 24024 2024 1340 S    0  0.0   0:00.15 init                                                 
    2 root      20   0     0    0    0 S    0  0.0   0:00.00 kthreadd/105
...


Have considered recompiling from source, but prefer to use packages if available for ease of installation and hopefully optimal settings in build.  Can anybody else confirm they are running the .deb files under the latest Ubuntu LTS? 

I do have a question though about the software itself.  Are containers inside the "Infrastructure Services" node just considered logical groupings?  That is what I have assumed so we do have nodes that can appear under 3+ containers.  We have setup an "all node" container, grouped by device type, grouped by location and a few other ways we like to analyze the nodes in our network.    I am assuming software is polling node once, no matter how many times it appears under various containers.  If that is NOT the case then I probably have too much activity going on. 

Thanks!
#40
General Support / Discovered "ghost" devices
July 10, 2012, 06:11:22 PM
I am getting three devices "discovered" but they are not for anything real.  It simply finds the IP address, but then of course cannot detect anything by Agent or SNMP as they are not really devices.  There is nothing on these addresses and they do not show up in any ARP tables or respond to ping.  Looked through the logs, but they are not really showing anything. 

What I am looking for is how these devices are being discovered.  I suspect some device on my network is wrongly making a reference to them and this is getting picked up on during examination of routing / arp tables.  But that is a guess.