Node uptime statistics

Started by Egert143, April 23, 2021, 03:01:20 PM

Previous topic - Next topic

Egert143

Hello

Is it possible to do currently or is there plans to add support for Node uptime statistics. Basically possibility to list nodes ordered by most down time or sort them by site and see what site has most issues with uptime.

Egert

Victor Kirhenshtein

Hi,

the tricky part is what to consider uptime. If purely time when node was unreachable (like 100 - (intervals between SYS_NODE_DOWN and SYS_NODE_UP) / period length) then it is quite simple. In fact it can be done on current version with reporting. If uptime should include more conditions, like that all services are running on the node, then it became more complicated. This is what business services were meant for, but they are very hard to configure and definitely need rework.

Best regards,
Victor

Egert143

Thanks for the reply. Is there any reference in database for example when node went down/up ? Maybe i could extract it with another app and build statistics and link it back with tool menu. Currently one way i can think of is to parse event logs and look for specific events.

For example node would have tool lets call it uptime and it would show latest x amount of downtimes and durations and then more general uptime info for month/year etc.