ServiceCheck.HTTP issue

Started by DanG, January 16, 2012, 01:15:58 PM

Previous topic - Next topic

DanG

Hi

Our Linux hosting server hanged recently (extremely rare), however I did not receive any alarms for the ServiceCheck.HTTP that was configured for it.

Today I looked further into it... I've set a ServiceCheck.HTTP to monitor a web site (apache) on my machine. When I stop the web daemon, the DCI threshold triggers as expected. However, if I disconnect the server from the network no triggering occurs. Is this by design or is this a bug?

Regards,

Dan

Victor Kirhenshtein

Hi!

Could be either a bug or configuration problem. I suppose that TCP connection timeout (for agent to web server connection) is longer than server timeout for agent communication, and so server just didn't wait long enough to get value for ServiceCheck.HTTP parameter, and because there was no new value, threshold was not triggered. I suggest to 1) specify reasonable but short connection timeout in ServiceCheck.HTTP parameter; 2) ad additional threshold for data collection error.

Best regards,
Victor

DanG

Hi Victor,

Thanks for the prompt reply.

I changed the following:
-   server configuration variable AgentCommandTimeout set to 5000;
-   added the following to nxagentd.conf:
        *portCheck
        Timeout = 4000
The above solved the issue – thank you. So no bug, but is this by design? My settings before the change (I believe them to be the default) where 2 seconds on the server and 30 second on the portCheck agent. Do these not cause issues for other users?

Kind regards,
Dan

Victor Kirhenshtein

Hi!

No one complaining so far :) I agree that this is a flaw in default configuration. I'll change default values of AgentCommandTimeout and portcheck timeout.

Best regards,
Victor