Netris Healthchecks
Netris includes built-in healthchecks to monitor the status of various network services and applications. These healthchecks help ensure that your network infrastructure is functioning optimally by providing real-time insights into service availability and performance.
There are three main categories of healthchecks in Netris:
Node Health: Node-level health checks that validate whether a node is functioning properly.
Fabric Health: Control-plane and protocol-level checks that validate the network fabric as a whole is functioning properly.
Switch Port Health: Port-level checks that validate whether a specific switch port is functioning properly.
Check type |
Check name |
Description |
Trigger Logic |
Message example |
Comments |
|---|---|---|---|---|---|
Node Health |
check_disk |
Storage % used |
|
53% / Used from 5.4G |
|
Node Health |
check_fan |
Fan status |
|
Fan Tray 1, Fan 1(OK), Fan Tray 2, Fan 2(OK) |
|
Node Health |
check_load |
Load average |
|
Load average 0.39, 0.50, 0.66 |
|
Node Health |
check_memory |
RAM % used |
|
89% Used of 1709 MB |
|
Node Health |
check_psu |
Power Supply status |
|
PSU1(OK), PSU2(OK) |
|
Node Health |
check_ratio |
Detects unusually frequent configuration changes that may indicate abnormal or unstable behavior |
|
vxpd - 0% |
Critical threshold default 60 |
Node Health |
check_temp |
Temperature sensors status |
|
PSU1 Temp Sensor(OK), PSU2 Temp Sensor(OK) |
|
Node Health |
health_monitoring |
Checks whether the node is monitored by Netris |
|
|
If this check is CRITICAL, other healthchecks for the same nodes are marked Unknown, as the node monitoring if not functioning properly. |
Node Health |
sys_service |
Monitors service status:
|
|
rsyslog - active, collectd@mgmt - active, switchd - active, frr - active, vxrd - inactive, netris-portinfo-server - active, netris-swlb.service - active |
|
Node Health |
xc_service |
Netris agent healthcheck:
|
|
vxpd-nvue - OK, ifstats - OK |
|
Node Health |
xc_timesync |
NTP sync status |
|
Time is synchronized |
|
Fabric Health |
check_bgp_underlay |
Switch loopbacks reachability |
|
|
|
Fabric Health |
check_bgp |
BGP session status on port towards connected neighbor switch |
|
swp57s0 IPv4(State: Established, Prefix: 4, Uptime: 02:31:46) |
|
Fabric Health |
check_topology |
Compares LLDP information with declared Netris Topology. |
|
|
|
Switch Port Health |
check_port |
Checks
|
|
swp57s0 port is UP, 0% RX Utilized of 1 Gbps, 0% TX Utilized of 1 Gbps |
|